This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
JHU Computer Science
jhucompsci.bsky.social
did:plc:rnvr6ggexusur72mzupbabv4
and @jwuphysics.bsky.social, @jegpeek.bsky.social, @ziangxiao.bsky.social, @anjalief.bsky.social, and more seek to improve evaluation procedures by building an understanding of how users evaluate LLMs in “From Queries to Criteria: Understanding How Astronomers Evaluate LLMs”: (3/3)
https://arxiv.org/abs/2507.15715
2025-10-06T19:11:58.715Z