Put together some notes on the new DeepMind paper "Video models are zero-shot learners and reasoners" - it makes a very convincing case that generative video models are to vision problems what LLMs were to NLP problems: single models that can solve a wide array of challenges simonwillison.net/2025/Sep/27/

0

If you have a fediverse account, you can quote this note from your own instance. Search https://fedi.simonwillison.net/users/simon/statuses/115279092766368772 on your instance and quote it. (Note that quoting is not supported in Mastodon.)