This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Simon Willison
simonwillison.net
did:plc:kft6lu4trxowqmter2b6vg6z
Made some notes on the new DeepMind paper "Video models are zero-shot learners and reasoners" - it makes a convincing case that generative video models are to vision problems what LLMs were to NLP problems: single models that can solve a wide array of challenges https://simonwillison.net/2025/Sep/27/video-models-are-zero-shot-learners-and-reasoners/
2025-09-28T00:29:05.681Z