This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Greg Durrett
gregdnlp.bsky.social
did:plc:36teqxj4ycvmuzwayor6ocir
...established a critical weakness of RLHF with open reward models: spurious correlation with length (COLM 2024)
2025-01-03T14:39:29.759Z