This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Scott McGrath
smcgrath.phd
did:plc:lrzkd5exmxqrblbruvjieofj
🧪 "Humanity's Last Exam" sets a new benchmark for AI: 3,000 expert-crafted questions spanning 100+ subjects. Current LLMs perform poorly, revealing a gap in expert-level knowledge and calibration, but it would be difficult to build a harder test. 🩺💻 #MLSky
https://lastexam.ai
2025-01-23T17:22:32.557Z