This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Pasquale Minervini
neuralnoise.com
did:plc:tmclxozyb3anks7zzf5gmd6f
Garbage in, garbage out -- nice gem for the Italian-speaking folks on this platform 😅 TLDR, in arxiv.org/abs/2406.04127 we found that MMLU contains TONS of errors, and looks like all these seamlessly propagated to this new "Global MMLU" dataset
[contains quote post or other embedded content]
2024-12-06T13:17:27.518Z