This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Why
why.bsky.team
did:plc:vpkhqolt662uhesyj6nxm7ys
The "reduced reward hacking" stuff they're talking about is also noticeable, it much less often stubs things it can't quite do right out. Retrying a set of prompts from a month ago, instead of wrapping some data loaders in try/catch blocks, it is more happy to let things fail and propagate errors up
2025-06-01T04:52:01.348Z