AI News 2026-02-28

General

Research Insights

Learning to Continually Learn via Meta-learning Agentic Memory Designs. Rather than prescribing a memory architecture, a meta-agent automatically designs it.
Goodfire: Features as Rewards: Using Interpretability to Reduce Hallucinations. This RLFR paradigm uses interpretability to extract model beliefs, providing a feedback signal for RL.
Replicating Human Motivated Reasoning Studies with LLMs. Current AIs seem not to engage in motivated reasoning, which represents a limit to using them to model humans.

LLM

Anthropic releases Claude Opus 4.6. State-of-the-art on ARC-AGI.
OpenAI releases GPT-5.3-Codex.
OpenAI releases GPT-5.3-Codex-Spark, optimized for real-time coding.
Google release Gemini 3 Deep Think, intended for science and engineering. (Impressive benchmarks.)
Google release Gemini 3.1 Pro, which is based on the Gemini 3 Deep Think core intelligence improvements. Pareto on ARC-AGI.
MiniMax M2.5 is an open-source model (230B) optimized for coding and agentics.
Anthropic reveal Claude Sonnet 4.6, 1M context window.

AI Agents

AI Safety

The Hot Mess of AI: How Does Misalignment Scale with Model Intelligence and Task Complexity? (preprint)
- Models become more coherent over longer reasoning traces.
- Smarter models can nevertheless become more incoherent on hard tasks.
- With respect to safety, the real risk may be “hot mess” behavior on the correct goal, rather than optimization of the wrong goal.

Image Synthesis

Seedream 5.0 available.
Google release Nano Banana 2, a more fast/cheap version with still high quality.

Audio

Video

Science

Cars

The Waymo World Model: A New Frontier For Autonomous Driving Simulation. Built on Google DeepMind Genie 3.
Tesla releases safety data. Driving with Full Self-Driving (Supervised) yields a lower collision rate than without (and especially when compared to the US driver average).

Robots

Epoch AI analyze the state of robotics: Where Autonomy Works: Evaluating Robot Capabilities in 2026. They conclude: “navigation is deployed at scale, manipulation mostly is not”.