<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"><channel><description>Breakthrough AI to solve the world&#39;s biggest problems.&#xA;&#xA;› Join us: http://allenai.org/careers&#xA;› Get our newsletter: https://share.hsforms.com/1uJkWs5aDRHWhiky3aHooIg3ioxm</description><link>https://bsky.app/profile/ai2.bsky.social</link><title>@ai2.bsky.social - Ai2</title><item><link>https://bsky.app/profile/ai2.bsky.social/post/3miw72pj3is2c</link><description>Today we&#39;re releasing WildDet3D—an open model for monocular 3D object detection in the wild.&#xA;&#xA;It works with text, clicks, or 2D boxes, and on zero-shot evals it nearly doubles the best prior scores. 🧵</description><pubDate>07 Apr 2026 16:27 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3miw72pj3is2c</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3miceejeknx2g</link><description>Thrilled to have Ai2’s VP of Engineering Jeremy Tryba on stage at @geekwire.com&#39;s Agents of Transformation event last week. &#xA;&#xA;He painted a vivid picture of what agentic AI can do for science, and cancer research in particular. 🧵</description><pubDate>30 Mar 2026 19:08 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3miceejeknx2g</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mi2pdr2q5324</link><description>MolmoBot, our open robotic manipulation suite trained entirely in simulation, now has code, training data, a data generation pipeline, &amp; evals all available. &#xA;&#xA;This puts our robotics models within reach of any research lab—no extensive real-world data collection required. 🧵</description><pubDate>27 Mar 2026 18:04 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mi2pdr2q5324</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mhsucnydgm2g</link><description>Today we&#39;re releasing MolmoWeb, an open source agent that can navigate + complete tasks in a browser on your behalf. &#xA;&#xA;Built on Molmo 2 in 4B &amp; 8B sizes, it sets a new open-weight SOTA across four major web-agent benchmarks &amp; even surpasses agents built on proprietary models. 🧵</description><pubDate>24 Mar 2026 15:11 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mhsucnydgm2g</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mhr46p7aav2g</link><description>We were at #NVIDIAGTC last week! Across panels, livestreams, &amp; expo floor demos, we shared work on Olmo Hybrid, SERA, Asta AutoDiscovery, MolmoBot, &amp; more—all grounded in the same idea: truly open AI means sharing the full pipeline, not just the weights. 🧵&#xA;&#xA;buff.ly/rGAEUh5&#xA;https://buff.ly/rGAEUh5</description><pubDate>23 Mar 2026 22:27 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mhr46p7aav2g</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mhgxwhkhtk2i</link><description>📢 Introducing vla-evaluation-harness—a unified, fully open framework to evaluate any VLA model on any robot simulation benchmark. &#xA;&#xA;Integrate your model once. Integrate the benchmark once. The full cross-evaluation matrix fills itself. 🧵</description><pubDate>19 Mar 2026 21:44 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mhgxwhkhtk2i</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mhe2k3t5vc2v</link><description>Grounding lets vision-language models do more than describe—they can point to where a robot should grasp, which button to click, or which object to track across video frames.&#xA;&#xA;Today we&#39;re releasing MolmoPoint, a better way for models to point. 🧵</description><pubDate>18 Mar 2026 17:53 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mhe2k3t5vc2v</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mh37slg5fs2o</link><description>🔎 Deep research agents like Asta ScholarQA are transforming how we perform literature review.&#xA;&#xA;But how do we know if the way we evaluate them is actually meaningful?&#xA;&#xA;Announcing our new paper: “Deep Research, Shallow Evaluation: A Case Study in Meta-Evaluation for Long-Form QA Benchmarks” 🧵</description><pubDate>15 Mar 2026 05:33 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mh37slg5fs2o</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mgsaic5i2k22</link><description>Today, a step forward in open robotics - our results show that sim-to-real zero shot transfer for manipulation is possible. MolmoBot is our open model suite for robotics, trained entirely in simulation on MolmoSpaces.🧵</description><pubDate>11 Mar 2026 15:51 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mgsaic5i2k22</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mgda2kkyn22n</link><description>Introducing Olmo Hybrid, a 7B fully open model combining transformer and linear RNN layers. It decisively outperforms Olmo 3 7B across evals, w/ new theory &amp; scaling experiments explaining why. 🧵</description><pubDate>05 Mar 2026 16:34 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mgda2kkyn22n</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mg6elxhnrk2h</link><description>📢 Update: the Molmo 2 codebase is now open source. &#xA;&#xA;We&#39;re releasing the code behind Molmo 2—our open model family for video &amp; image understanding, pointing, tracking, &amp; more. Now you can easily train Molmo 2 on your own data. 🧵</description><pubDate>03 Mar 2026 18:12 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mg6elxhnrk2h</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mg44t6zmx52q</link><description>In just a few weeks, researchers used AutoDiscovery to generate 20K+ hypotheses across oncology, climate science, marine ecology, entomology, cybersecurity, music cognition, social sciences, &amp; more. &#xA;&#xA;Now we&#39;re extending access for three more months—and refreshing credits. 👇</description><pubDate>02 Mar 2026 20:47 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mg44t6zmx52q</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mfubtunv5w2c</link><description>We analyzed 250K+ queries &amp; 430K+ clickstream interactions from Asta, our AI-powered research assistant—and today we&#39;re releasing the full dataset. How do researchers actually use AI science tools? Here&#39;s what we found. 🧵</description><pubDate>27 Feb 2026 17:56 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mfubtunv5w2c</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mfp5qeieoj2i</link><description>Can AI predict what scientists will do next—not just one piece, but the whole research process? PreScience is our new model eval for forecasting how science unfolds end-to-end, from how research teams form to a paper&#39;s eventual impact. Built with UChicago, supported by NSF.</description><pubDate>25 Feb 2026 16:59 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mfp5qeieoj2i</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mfkkx2kpyu2g</link><description>Less than a week left to try AutoDiscovery. 🔬&#xA;&#xA;Most AI tools for science wait for a question. AutoDiscovery starts with your data—generating hypotheses, running experiments, and surfacing surprising findings with reproducible code.&#xA;&#xA;Get 1,000 Hypothesis Credits through Feb 28. 👇</description><pubDate>23 Feb 2026 21:12 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mfkkx2kpyu2g</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mfa4klvejz2h</link><description>It&#39;s been incredible seeing what the scientific community has done in just one week with AutoDiscovery, our new tool that autonomously surfaces hypotheses you might never think to test.&#xA;&#xA;Researchers have run 10,000+ experiments so far. Tell us what it&#39;s uncovering for you. 🧵</description><pubDate>19 Feb 2026 17:28 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mfa4klvejz2h</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mf5pybcx3v2w</link><description>We&#39;ve released a Chrome extension for Asta—a faster way to go from finding a paper to asking questions about it while you read. 🧵</description><pubDate>18 Feb 2026 18:37 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mf5pybcx3v2w</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3meqwroxnas2g</link><description>Data mixing – determining how much web text, code, math, etc., you need for LM development – is a first-order lever on model quality. Introducing Olmix: a framework for configuring mixing methods at the start of dev &amp; efficiently updating as data changes throughout. 🧵</description><pubDate>13 Feb 2026 16:34 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3meqwroxnas2g</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3meoepd2xaw2b</link><description>Knowing which questions to ask is often the hardest part of science. Today we&#39;re releasing AutoDiscovery in AstaLabs, an AI system that starts with your data and generates its own hypotheses. 🧪</description><pubDate>12 Feb 2026 16:06 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3meoepd2xaw2b</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3memamxbby62e</link><description>Introducing MolmoSpaces, a large-scale, fully open platform + benchmark for embodied AI research. 🤖&#xA;&#xA;230k+ indoor scenes, 130k+ object models, &amp; 42M annotated robotic grasps—all in one ecosystem.</description><pubDate>11 Feb 2026 19:47 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3memamxbby62e</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mejggcvxan2s</link><description>LLMs often generate step-by-step instructions, from real-world tasks (how do I file taxes?) to plans for AI agents. Improving this is hard: outputs can sound fluent for steps that don&#39;t work, and current datasets cover few domains.&#xA;&#xA;How2Everything evals/trains for this at scale. 🧵</description><pubDate>10 Feb 2026 16:53 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mejggcvxan2s</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3megulqnpqv26</link><description>New: A web demo to make using DR Tulu even simpler, built by our collaborators at MIT &amp; the University of Washington.&#xA;Ask a question and watch DR Tulu plan, search, &amp; synthesize a citation-grounded report you can share. 🔎</description><pubDate>09 Feb 2026 16:29 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3megulqnpqv26</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mdxvqabzuj2e</link><description>Since launching Open Coding Agents, it&#39;s been exciting to see how quickly the community has adopted them. Today we&#39;re releasing SERA-14B – a new 14B-parameter coding model – plus a major refresh of our open training datasets. 🧵</description><pubDate>03 Feb 2026 17:39 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mdxvqabzuj2e</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mdiw5hi3up26</link><description>Introducing Theorizer: Turning thousands of papers into scientific laws 📚➡️📜&#xA;&#xA;Most automated discovery systems focus on experimentation. Theorizer tackles the other half of science: theory building—compressing scattered findings into structured, testable claims. 🧵</description><pubDate>28 Jan 2026 18:37 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mdiw5hi3up26</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mdg5munm4r2e</link><description>Introducing Ai2 Open Coding Agents—starting with SERA, our first-ever coding models. Fast, accessible agents (8B–32B) that adapt to any repo, including private codebases. Train a powerful specialized agent for as little as ~$400, &amp; it works with Claude Code out of the box. 🧵</description><pubDate>27 Jan 2026 16:12 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mdg5munm4r2e</guid></item><item><link>https://bsky.app/profile/ai2.bsky.social/post/3mddqp3v35j2g</link><description>Molmo 2 (8B) is now available via @hf.co Inference Providers, courtesy of Public AI.&#xA;&#xA;State-of-the-art video understanding with pointing, counting, &amp; multi-frame reasoning. Track objects through scenes and identify where + when events occur. 🧵</description><pubDate>26 Jan 2026 17:16 +0000</pubDate><guid isPermaLink="false">at://did:plc:i4kytxgsu3yfsrt2ml3o7tgq/app.bsky.feed.post/3mddqp3v35j2g</guid></item></channel></rss>