AI News 2025-12-25

Research Insights

LLM

  • OpenAI update: GPT-5.2-Codex.
  • METR reports a record-setting time on their task length benchmark: Opus 4.5 reaches almost 5 hours (though sparsity in the available evaluations at this time horizon make data increasingly unrealiable).
    • From this, we can update our predictions. It appears that progress continues along the established exponential, with capabilities doubling every 4-5 months.

AI Agents

Safety

Posted in AI, News | Tagged , , | Leave a comment

AI News 2025-12-18

General

Research Insights

  • A new method, Poetiq, reaches new record (54%) on ARC-AGI-2. The method layers a “meta system” over generic LLMs, optimizing their usage to the task.

LLM

AI Agents

  • Google DeepMind: Towards a Science of Scaling Agent Systems. If individual agent performance is too low, multi-agent systems tend to do even worse. Independent agents lead to error amplification, while central coordination can reduce this effect.

Image Synthesis

Audio

Video

Posted in AI, News | Tagged , , , , , | Leave a comment

AI News 2025-12-11

General

Research Insights

LLM

Safety

AI Agents

Video

  • Runway ML announces several updates:
    • Gen 4.5 video model, improved model with native audio.
    • Edit audio of existing videos, and multi-shot editing.
    • GWM-1 (General World Model), allowing predictions of future states (e.g. for robotics). Three variants: GWM Worlds for explorable environments, GWM Avatars for conversational characters, and GWM Robotics for robotic manipulation.
Posted in AI, News | Tagged , , , , | Leave a comment

AI News 2025-12-04

LLM

AI Agents

Data

Image Synthesis

Video

3D

  • Hunyuan 3D Studio claims art-grade 3D generative model, Hunyuan 3D-PolyGen 1.5.

Robots

Cars

Science

Posted in AI, News | Tagged , , , , , , , , | Leave a comment

AI News 2025-11-27

General

Safety

LLM

  • Anthropic unveils Claude Opus 4.5. Beats Gemini 3 Pro on many (but not all) benchmarks, making it competitive with the state-of-the-art.

AI Agents

Image Synthesis

World Synthesis

Science

Robots

  • Phybot M1 can do backflips.
  • GigaAI (Huawai-backed) to launch a wheeled humanoid: Maker H01.
Posted in AI, News | Tagged , , , , , , , | Leave a comment

AI News 2025-11-20

General

Research Insights

LLM

  • Google announces availability of Gemini 3 Pro. Record-setting scores across many measures (including ARC-AGI).
    • Antigravity is their new agentic development platform, that uses Gemini 3 Pro. It plays the role of an IDE, but also allows one to manage agents.

Agents

Vision

Image Synthesis

Video

  • ElevenLabs is converging third-party image and video models with their own audio capabilities (voices, music, sound effects).

World Synthesis

Science

Robots

  • Shenzhen MindOn Robotics is testing their robot brain in the Unitree G1 body. If the claims are true that this motion is not teleoperated, it is indeed remarkably fluid and capable.
  • Agile Robotics (Germany) announces plans for Agile One humanoid for industrial context.
Posted in AI, News | Tagged , , , , , , , | Leave a comment

AI News 2025-11-13

General

Research Insights

LLM

Audio

World Synthesis

Science

Cars

Robots

Posted in AI, News | Tagged , , , , , , | Leave a comment

AI News 2025-11-06

Research Insights

Science

Robots

Posted in AI, News | Tagged | Leave a comment

AI News 2025-10-30

General

Research Insights

LLM

Video

Robots

  • 1X Neo Home Robot, now available for purchase (20k$), delivery in 2026.
Posted in AI, News | Tagged , , , | Leave a comment

AI News 2025-10-23

Research Insights

LLM

Video

Science

Robots

  • NOETIX Bumi is a very small humanoid, only $1,400.
Posted in AI, News | Tagged , , , , | Leave a comment