AI News 2025-10-02

General

Research Insights

Google DeepMind: Video models are zero-shot learners and reasoners. They show that Veo 3 gains a remarkable number of non-trained (emergent) capabilities that imply understanding.

LLM

OpenAI ChatGPT Pulse is a feature where the system proactively searches and curates information for the user.
OpenAI GDPval: Measuring the performance of our models on real-world tasks (paper). Claude Opus 4.1 is reaching near-parity with an industry expert on the selected tasks.

Anthropic releases Claude Sonnet 4.5.
OpenAI adds Instant Checkout in ChatGPT and Agentic Commerce Protocol.
- C.f. Google’s recent Agent Payments Protocol (AP2), and paper on virtual agent economies.
- Clearly, the AI labs want to streamline the ability of agents to engage in economic activity on behalf of their user.
Dreamer 4: Training Agents Inside of Scalable World Models (paper). The ability to transfer learning from an internal neural world model, to the actual task, unlocks improved learning.
Nvidia announce: RLP: Reinforcement as a Pretraining Objective (paper). They apply RL in the pre-training phase (instead of only post‑training), treating chain-of-thought as actions which can be rewarded by information gain.

Audio

Video

OpenAI announce Sora 2 (system card). More realistic, includes sound, ability to add a specific person to a scene, multiple aesthetics. The app is iOS only (for now) and emphasizes social aspects (friend invites, etc.).

Science

Hardware

European AI chip startup Euclyd aims to deliver large-area chips optimized for inference.