This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Johannes Gasteiger🔸
gasteigerjo.bsky.social
did:plc:pisjuoavhugpvyjvero7mooj
New Anthropic blog post: Subtle sabotage in automated researchers.
As AI systems increasingly assist with AI research, how do we ensure they're not subtly sabotaging that research? We show that malicious models can undermine ML research tasks in ways that are hard to detect.
2025-03-25T16:03:46.882Z