This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Stanford HAI
stanfordhai.bsky.social
did:plc:x3v7jwickkzpc37wed7ckyyv
A recent Stanford paper reveals that many popular AI benchmarks are fundamentally flawed: They can be outdated, easily gamed, or inaccurate. Stanford HAI Graduate Fellow
@ankareuel.bsky.social talks about how researchers are rethinking AI benchmarks: https://www.emergingtechbrew.com/stories/2025/03/24/researchers-ai-intelligence-benchmarks
2025-03-25T21:26:13.988Z