This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
METR
metr.org
did:plc:dll3hepzq76nymel5c3yt6nk
METR previously estimated that the time horizon of AI agents on software tasks is doubling every 7 months.
We have now analyzed 9 other benchmarks for scientific reasoning, math, robotics, computer use, and self-driving; we observe generally similar rates of improvement.
[contains quote post or other embedded content]
2025-07-14T18:22:08.238Z