This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Andrew White 🐦⬛
andrew.diffuse.one
did:plc:5lmwpxyligfn4bmpeb5a4ejf
HLE has recently become the benchmark to beat for frontier agents. We at FutureHouse took a closer look at the chem and bio questions and found about 30% of them are likely invalid based on our analysis and third-party PhD evaluations. 1/7
2025-07-23T16:29:03.844Z