This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
FBK - NLP Research Group
fbk-nlp.bsky.social
did:plc:qniqtmrom6byy2ueeldvfgdj
🚀 **Exciting News!** 🎉 Evalita-LLM is here! 🇮🇹 A new benchmark for evaluating LLMs—offering native Italian tasks, generative challenges, and fair multi-prompt evaluations. Now also available in lm-evaluation harness by @eleutherai.bsky.social !
ArXiv: arxiv.org/abs/2502.02289
#NLProc #LLM #Evaluation
https://arxiv.org/abs/2502.02289
2025-02-24T17:07:16.858Z