General
- Mira Murati’s Thinking Machines Lab raises $2B ($10B valuation).
- Scaling Test Time Compute to Multi-Agent Civilizations: Noam Brown.
- OpenAI schedules their next DevDay for October 6, 2025.
Research Insights
- Dense SAE Latents Are Features, Not Bugs. They find pairs of opposing features that fire very frequently. Far from being useless, they find these encode meaningful concepts.
- Sakana AI: Reinforcement Learning Teachers of Test Time Scaling (preprint). Rather than using to RL to improve solution, they focus on improving the ability for the model to teach other models. RL reward is based on how useful generated training examples are (to smaller learning models); rather than being rewarded on correctness of their final answer.
LLM
Agents
- Kimi-Researcher: End-to-End RL Training for Emerging Agentic Capabilities. Achieves 27% on Humanity’s Last Exam.
- Google introduce Gemini CLI, an open-source AI agent for coding in your terminal.
Audio
- ElevenLabs introduces 11ai, a voice conversational assistant; exploits MCP to enable connection to resources (calendar, etc.).
- ElevenLabs introduces Voice Design v3, an improvement to their text-to-voice system for designing a voice.
Image Synthesis
- Higgsfield Soul is a new high-aesthetics image model (examples).
Video
- ByteDance: InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions. Demonstrates the possibility of controlling multiple characters talking, matched to provided audio.
World Synthesis
- Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition (preprint, video example). Improved quality and consistency.
- Runway is experimenting with generative text adventures.
- PlayerOne: Egocentric World Simulator (preprint).
Science
- Google DeepMind releases AlphaGenome (including API capabilities); it takes base-pair sequences as input, and predicts genomic behavior outputs.
Robots
- Having humanoid robots walking around in the real world points towards improvements to robustness and reliability.
- Google release a VLA that allows robotic control on-device: Gemini Robotics On-Device brings AI to local robotic devices.
- See also tech report: Gemini Robotics: Bringing AI into the Physical World.