2026 β Unified Models & Robotics
11 June 2026
Anthropic faces backlash over silent capability restrictions on national-security grounds
Fortune reported that Anthropic drew criticism from users and observers after rolling out restrictions β described as silent downgrades β that limited certain model outputs without prior notice to paying customers, citing national-security considerations. The company responded publicly to the backlash. The episode raised questions about what AI labs owe paid users when they adjust model behaviour for policy reasons, and about how transparent safety-driven capability changes should be.Fortune β Anthropic Fable 5 silent downgrade backlash
10 June 2026
OpenAI publishes its plan for AGI, access, and shared benefit
OpenAI released a public document outlining how it plans to ensure AGI benefits humanity broadly β covering safety commitments, access policy, governance structure, and the rationale for its scale. The document coincided with the company's confidential S-1 filing, positioning its commercial trajectory as aligned with public-benefit goals. It is a statement of intent rather than binding policy, but frames OpenAI's public argument for continued growth in the lead-up to a potential IPO.OpenAI β Built to benefit everyone
10 June 2026
Dario Amodei calls for governance to match the pace of AI acceleration
Anthropic CEO Dario Amodei published an essay arguing that AI is advancing faster than governance frameworks can track, and calling for stronger policy responses to model-risk and acceleration. The essay is notable because it comes from the head of a frontier lab rather than a regulator or critic, and because Amodei explicitly frames oversight as necessary rather than optional β a position with practical implications for how Anthropic positions itself ahead of expected regulation.Dario Amodei β Policy on the AI exponential
4 June 2026
Anthropic reports Claude wrote over 80% of its production code in May 2026
Anthropic published a research post titled 'When AI builds itself', disclosing that Claude had authored more than 80% of the code merged into Anthropic's production codebase in May 2026 β a significant milestone in AI self-development. The post also called for the creation of a global AI pause mechanism, suggesting that as AI systems become more capable of self-improvement, the ability to halt development rapidly could become a critical safety requirement.Anthropic β 'When AI builds itself' VentureBeat β Anthropic Claude writes own code Tom's Hardware β Claude 80% code milestone
Jun 4, 2026
NVIDIA releases Nemotron 3 Ultra, the strongest US open-weights model
NVIDIA released Nemotron 3 Ultra, a 550-billion-parameter open-weights model (55B active) built on a hybrid Mamba-Transformer mixture-of-experts architecture with a one-million-token context and post-trained for long-running agents. Shipped with open weights, training data and recipes, it scored 48 on the Artificial Analysis Intelligence Index β the highest of any US open-weight model at release.
Jun 2, 2026
Microsoft launches MAI-Thinking-1, its first flagship reasoning model
At Build 2026, Microsoft released MAI-Thinking-1, its first in-house flagship reasoning model β a sparse mixture-of-experts system with a 256K-token context that the company said matched Anthropic's Claude Opus 4.6 on the SWE-Bench Pro coding benchmark. It headlined a launch of seven new MAI models spanning text, image, voice and transcription, marking Microsoft's deepest move yet into building its own frontier stack.
May 31, 2026
NVIDIA RTX Spark runs a 120B model locally on a laptop
NVIDIA and Microsoft unveiled RTX Spark, a unified superchip combining a Grace CPU and a Blackwell GPU with up to 128GB of unified memory and around one petaflop of AI performance. NVIDIA said it could run a 120-billion-parameter model with a one-million-token context entirely on-device, bringing frontier-scale inference to laptops and compact desktops with no cloud round-trip.
May 26, 2026 Microsoft AI
Microsoft's MAI-Image-2.5 debuts at No. 3 on the image Arena
Microsoft AI launched MAI-Image-2.5, an update to its in-house image model that entered the public Arena leaderboard at No. 3, with stronger text rendering and a focus on commercial, product and branding imagery. It was Microsoft's strongest showing yet with a self-developed image model rather than a partner system.
The result signalled Microsoft's growing independence from third-party image models and intensified competition at the top of the text-to-image field, where the gap between leading systems continued to narrow.Microsoft AI announcement
May 13, 2026 Thinking Machines
Thinking Machines releases Interaction Models for real-time voice conversation
Thinking Machines published a demonstration of its Interaction Models, an AI system designed for natural, real-time voice conversation. The model can wait silently when it should, interrupt politely when it should, count discrete items in view, observe a user's posture from camera input, and redirect requests it judges to be unsafe.
The release is the lab's first major public capability demonstration and adds a third credible entrant alongside OpenAI and Google in the real-time multimodal interaction race.Thinking Machines blog
May 13, 2026 Google DeepMind
DeepMind unveils AI Pointer β a context-aware cursor for editing by voice
Google DeepMind introduced AI Pointer, a cursor that interprets on-screen selection plus spoken instruction as a single intent. Users can highlight content β text, image regions, list items β and tell the system what to do with it, removing the need to switch between chat sidebars and direct manipulation.
The approach reframes how operating systems expose AI affordances: rather than a separate assistant window, AI becomes part of the cursor itself.DeepMind blog
May 13, 2026 World Labs
World Labs ships open-source image-to-3D with physics and audio
World Labs released an open-source tool that converts a single photograph into a navigable 3D environment, complete with physics, movable objects, lighting and ambient audio. The release significantly lowers the barrier for hobbyists and indie developers to produce playable 3D scenes from reference photography.
Single-image-to-scene with physics and audio is a key step toward AI-generated game and simulation content, and the open-source release accelerates downstream tooling that previously required full studio pipelines.World Labs
Apr 16, 2026
Robot maps and recovers artefacts from France's deepest shipwreck
The French Navy and underwater-archaeology unit DRASSM used the ROV C 4000 to survey Camarat 4, a 16th-century merchant wreck found at 2.5 km in the Mediterranean. The tethered robot captured roughly 86,000 images at up to eight per second, and lifted ceramic jugs without disturbing the surrounding debris field β among the deepest objects ever raised from a French wreck.
The imagery feeds a 3D model of a vessel type that is poorly documented in surviving 16th-century texts and signals a wider shift toward non-invasive deep-sea archaeology. At 1.5 miles down the ROV operates under nearly 150 atmospheres, where conventional human dive equipment fails entirely.CBS News β
Apr 8, 2026
Princeton soft origami robot moves without motor or gears
Princeton engineers built a soft-rigid hybrid robot that moves without a motor, gearbox or pneumatic line by combining a printable liquid-crystal-elastomer polymer with flexible electronics and origami folding. A demonstration crane-shaped robot flaps its wings on electric current alone, with targeted heating in the polymer doing the work an actuator would normally do.
The paper β Bershadsky, Davidson, Paulino and Zhao, "Digital Actuation Control of Soft Robotic Origami With Self-Folding Liquid Crystal Elastomer Hinges" β appeared online in Advanced Functional Materials on 21 March 2026. Removing the motor cuts part count and failure modes, opening applications in medical devices, search-and-rescue and inspection robotics.Princeton Engineering β
Apr 2026
Coinbase launches Agentic Wallets and x402 β AI agents get their own money
Coinbase released Agentic Wallets alongside the x402 payment protocol, giving AI agents the ability to hold, send, and receive cryptocurrency autonomously without human approval for each transaction. For the first time, an AI agent could pay for API calls, purchase compute, or settle invoices as part of a workflow β creating the financial infrastructure layer that autonomous agents need to act independently in economic contexts.
Mar 26, 2026 RAI Institute
RAI Institute unveils Roadrunner bipedal-wheeled robot
The Robotics & AI Institute, led by Boston Dynamics founder Marc Raibert, revealed Roadrunner β a 15 kg bipedal robot whose feet double as wheels, switching between side-by-side and in-line skating modes plus stepping on the same hardware. A single learned control policy handles every locomotion mode, and behaviours such as standing up from the ground and balancing on one wheel were deployed zero-shot.
Roadrunner is positioned for logistics and warehouse use where wheels save energy on flat ground and legs handle obstacles. The release continues a 2026 trend of multi-modal locomotion β wheels plus legs β replacing pure bipedal designs in commercial robotics research.RAI Institute β
Mar 12, 2026
LTX 2.3 generates synchronised video and audio in a single pass
Lightricks released LTX 2.3, a 22-billion-parameter diffusion transformer model that generates synchronised video and audio in a single forward pass. The model supports resolutions up to 4K at 50 frames per second, marking a significant leap in real-time media generation quality.
Mar 5, 2026
GPT-5.4 launches with 1 million token context window
OpenAI released GPT-5.4, its most capable frontier model, available in Standard, Thinking, and Pro variants. The model features a context window of up to 1 million tokens (the largest from OpenAI), a reported 33% reduction in factual errors compared to GPT-5.2, and improved capabilities across coding, reasoning, and agentic workflows.
2025 β Robotics & Reasoning
December 11, 2025
LEAP 71 hot-fires two AI-designed, 3D-printed methalox rocket engines
Following the 2024 University of Sheffield test fire of what was described as the world's first AI-designed rocket engine, LEAP 71 reported hot-firing two 20 kN orbital-class methalox engines β a conventional bell nozzle and a full-scale aerospike β going from specification to first flame in under three weeks. The engines were generated autonomously by Noyron, the company's Large Computational Engineering Model, and 3D-printed in copper. LEAP 71 described the tested engines as roughly a tenth of the thrust class it plans to hot-fire in 2026, with manufacturing validation underway on 200 kN and 2,000 kN designs.Sheffield β
Aug 20, 2025
Atlas Humanoid with Neural Large Behavior Models
Boston Dynamics demonstrated a newly redesigned Atlas humanoid robot powered by neural Large Behavior Models from Toyota Research Institute. The robot performed complex multi-task sequences with self-correction, learning control policies without hand-coded routines. Boston Dynamics has deployed over 500 robots with revenue exceeding $130 million.
Jan 20, 2025
DeepSeek R1 Open-Source Reasoning Model
DeepSeek released R1, an open-source reasoning model that demonstrates competitive performance with proprietary frontier models. The release includes both full weights and distilled versions, making advanced reasoning capabilities accessible to the open-source community.
2024 β Agent & Vision Capabilities
Oct 29, 2024
Claude 3.5 Sonnet with Computer Use
Anthropic released Claude 3.5 Sonnet with native computer interaction capabilities, allowing the model to see, understand, and control a computer screen. This enables autonomous execution of multi-step digital workflows without relying on separate tool APIs.
Sep 12, 2024
OpenAI o1 Reasoning Model Launch
OpenAI introduced o1, a model trained to spend more time thinking through problems before responding. It achieves state-of-the-art performance on mathematical, coding, and scientific reasoning tasks by using reinforcement learning to develop internal reasoning processes.
Jul 23, 2024
Meta Llama 3.1 405B Open-Source Release
Meta released Llama 3.1 405B, a 405-billion parameter open-source model that rivals closed proprietary models on performance benchmarks. The full weights were made freely available for research and commercial use.
May 13, 2024
GPT-4o Multimodal Model
OpenAI released GPT-4o, a model optimized to handle text, vision, and audio seamlessly in a unified way. The model shows significant performance improvements over GPT-4 and can process audio and images natively without intermediate conversions.
May 8, 2024
AlphaFold 3 Predicts Protein-Ligand Complexes
DeepMind released AlphaFold 3, expanding beyond protein structure prediction to accurately predict protein-DNA, protein-RNA, and protein-ligand interactions. The model achieved 50% accuracy improvement over AlphaFold 2 and contributed to structural understanding underlying the 2024 Nobel Prize in Chemistry.
2023 β Context & Reasoning
Feb 15, 2024
Google Gemini 1.5 Pro with 1M Token Context
Google introduced Gemini 1.5 Pro, a model capable of processing a context window of up to 1 million tokens. This enables the model to work with entire books, lengthy video transcripts, and massive code repositories in a single prompt.
Dec 11, 2023
Mixtral 8x7B Mixture-of-Experts Model
Mistral AI released Mixtral 8x7B, a sparse mixture-of-experts model that achieves performance comparable to much larger models while maintaining efficiency. The model uses 8 expert networks, activating only 2 per token for computational efficiency.
Nov 14, 2023
GraphCast Achieves Superior Weather Forecasting
DeepMind released GraphCast, a graph neural network model that predicts weather globally at 0.25-degree resolution in under 1 minute. The model outperformed the European Centre for Medium-Range Weather Forecasts (ECMWF) on 90% of evaluated meteorological variables, producing predictions that took traditional systems 10 minutes to calculate.
Nov 6, 2023
GPT-4 Turbo with 128K Context
OpenAI released GPT-4 Turbo with a 128,000 token context window, 4x the original GPT-4 context. The model also features reduced hallucination rates and lower API costs compared to previous versions.
Sep 6, 2023
Technology Innovation Institute Releases Falcon 180B
The Technology Innovation Institute (TII) released Falcon 180B, a 180-billion parameter open-source language model trained on 3.5 trillion tokens. At release, it was the largest openly available language model, surpassing Llama 2 on multiple benchmarks including MMLU, LAMBADA, and HellaSwag.
Jul 18, 2023
Meta Llama 2 Open-Source Release
Meta released Llama 2, a family of open-source language models ranging from 7B to 70B parameters. Made freely available for research and commercial use, with models trained on 2 trillion tokens of public data.
Jul 11, 2023
Claude 2 Language Model Release
Anthropic released Claude 2, a significantly improved version with longer context (100K tokens), better performance on complex reasoning tasks, and improved safety properties. The model set new benchmarks for instruction following and factuality.
2022 β Multimodal & Generative
Mar 14, 2023
GPT-4 Launch
OpenAI released GPT-4, a multimodal model accepting both text and image inputs. It demonstrated significant improvements in reasoning, safety, and reliability compared to GPT-3.5, with performance surpassing human experts on many professional benchmarks.
Nov 30, 2022
ChatGPT Public Launch
OpenAI released ChatGPT to the public, a conversational interface powered by GPT-3.5. It reached 1 million users in 5 days and 100 million in 2 months, becoming the fastest-growing application in history.
Sep 21, 2022
OpenAI Open-Sources Whisper Speech Recognition
OpenAI released Whisper, an open-source automatic speech recognition (ASR) model trained on 680,000 hours of multilingual and multitask supervised data collected from the web. The model achieves 50% fewer errors than current specialized models and supports transcription in 96 languages.
Aug 22, 2022
Stable Diffusion Public Release
Stability AI released Stable Diffusion, an open-source text-to-image generation model. Available under an open license, it could run on consumer hardware and sparked a wave of creative applications and fine-tuned variants.
Jul 11, 2022
BigScience Releases BLOOM 176B Multilingual Model
The BigScience collaborative initiative released BLOOM, a 176-billion parameter open-source language model trained across 46 natural languages and 13 programming languages. At the time of release, it was the largest openly available language model in existence.
Apr 6, 2022
DALL-E 2 Image Generation Model
OpenAI released DALL-E 2, a significantly improved text-to-image model with better understanding of natural language prompts and higher image quality. The model demonstrated zero-shot generalization to novel concepts and creative variations.
2021 β Protein & Scale
Dec 1, 2020
AlphaFold 2 Solves Protein Folding
DeepMind's AlphaFold 2 solved the protein folding problem, predicting 3D protein structures to near-experimental accuracy at the CASP14 competition. The breakthrough came from combining attention mechanisms with evolutionary biology insights. The achievement later contributed to the 2024 Nobel Prize in Chemistry awarded to David Baker, Demis Hassabis, and John Jumper.
2020 β Scaling & Few-Shot
Jun 11, 2020
GPT-3 Language Model Breakthrough
OpenAI published GPT-3, a 175-billion parameter language model demonstrating few-shot learning across diverse tasks without task-specific fine-tuning. The model showed emergent abilities like chain-of-thought reasoning and simple code generation.
2017 β Transformers
Jun 12, 2017
"Attention Is All You Need" Transformer Paper
Google researchers published "Attention Is All You Need," introducing the Transformer architecture built entirely on attention mechanisms. This paper became one of the most cited in machine learning, fundamentally changing how neural networks are designed.
2016 β Deep Reasoning
Mar 9, 2016
AlphaGo Defeats Lee Sedol
DeepMind's AlphaGo defeated world champion Lee Sedol in a 5-game match of Go, winning 4-1. Using deep neural networks combined with tree search, AlphaGo exhibited intuitive play and strategic understanding previously thought impossible for machines.
2014 β Generative Models
Jun 10, 2014
Generative Adversarial Networks (GANs) Introduced
Ian Goodfellow and collaborators introduced Generative Adversarial Networks, a framework where two neural networks competeβone generating data and one discriminating real from fake. This sparked a revolution in generative modeling and unsupervised learning.
2012 β Deep Learning Revolution
Sep 30, 2012
AlexNet Wins ImageNet Competition
A deep convolutional neural network called AlexNet won the ImageNet Large Scale Visual Recognition Challenge with a top-5 error rate of 15.4%, far exceeding traditional computer vision approaches at 26.2%. The win sparked the deep learning revolution in vision.
2026 β Unified Models & Robotics
June 17, 2026 OpenAI
AI chemist improves a real medicinal-chemistry reaction
OpenAI reported that GPT-5.4, connected to Molecule.one's Maria AI and an automated lab platform, identified an additive that raised Chan-Lam coupling yields for more than 80% of the substrates tested. The system reasoned over the chemistry, proposed the change and validated it experimentally rather than only suggesting candidates on paper. It is one of the clearer demonstrations of a language model driving a measurable improvement in a real laboratory reaction.OpenAI β
June 18, 2026 OpenAI
AI reanalysis surfaces diagnostic leads for unsolved rare childhood diseases
Clinicians used an OpenAI reasoning model to re-examine 376 previously unsolved rare genetic disease cases in children and surfaced candidate leads for 18 diagnoses. The model worked through tangled genetic and clinical evidence that had defeated earlier analysis, pointing specialists toward specific avenues to confirm. Each lead still needs clinical validation, but the exercise shows reasoning models helping with diagnostic dead-ends that affect real families.OpenAI β
June 17, 2026 OpenAI
OpenAI introduces LifeSciBench for real-world life-science tasks
OpenAI introduced LifeSciBench, an expert-written and expert-reviewed benchmark for messy, real-world life-science research work. It tests whether AI systems can interpret evidence, reconcile conflicting results, design experiments and weigh translational risk rather than answer tidy exam questions. The benchmark gives the field a way to measure progress on the kind of research that actually moves biology forward.OpenAI β
June 19, 2026 Anthropic
AlphaFold co-creator John Jumper leaves DeepMind for Anthropic
Reuters reported that John Jumper, co-creator of AlphaFold and a 2024 Nobel laureate, is leaving Google DeepMind to join Anthropic. The move shifts one of the most prominent figures in AI for science between two frontier labs. It is a notable signal in the contest among leading labs to attract the researchers driving scientific applications of AI.Reuters β