This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Kyle Lo @ COLM 2025 🍁
kylelo.bsky.social
did:plc:xl4nejvjc52uhp25mceh3f7q
scathing takedown of recent K2 Think model
"evaluates on data it was trained on, relies on an external model and additional samples for its claimed performance gains, and artificially reduces the scores of compared models"
www.sri.inf.ethz.ch/blog/k2think
https://www.sri.inf.ethz.ch/blog/k2think
2025-09-12T20:57:51.602Z