AI News 2025-06-05

General

Research Insights

LLM

  • LisanBench (github) is a new benchmark that evaluates long-term task coherence (“stamina”) through a game where the LLM must progressively alter a word (one character at a time), always yielding a valid English word, to build the longest possible chain. Although highly contrived, this does seem to test longer-range planning. The results conform to vibes about model intelligence.
  • Anthropic has launched Claude Explains, a blog of AI generated posts (with human verification). The focus (currently) appears to be teaching simple coding concepts.
  • OpenAI announces updates to ChatGPT for business.
    • Deep research can now search across defined private data repositories (Sharepoint, Google Drive, Dropbox, etc.).
    • Chat queries and data analysis requests can draw directly from connected data sources.
    • ChatGPT now supports custom connectors, based on MCP.
    • Being deployed for Teams, Enterprise, and Edu.
    • Record mode transcribes meetings, providing a summary document with pointers to the transcript/timecode.
  • Google updated Gemini 2.5 Pro.

Agents

Safety

  • Yoshua Bengio launches LawZero, a non-profit dedicated to advancing safe-by-design AI.

Audio

  • Elevenlabs introduces a multi-modal assistant, that can handle mixture between voice and text input (at the same time; not requiring toggling between modes). It does seem like a productive way to interact with an AI.
  • Play AI is open-sourcing PlayDiffusion (demo) a diffusion-LLM for speech, which allows for inpainting (example).
  • Bland announces an improvement in their text-to-speech model, with cloning of voice, accent, style, etc. They claim it is finally past the uncanny valley.

Image Synthesis

Video

  • AMC is integrating Runway ML genAI into its workflows (mostly for ideation, pre-vis, and promotional materials).
  • Luma introduces Modify Video, allowing style transfer or video-generation conditioned on an input video.

Science

This entry was posted in AI, News and tagged , , , , , , , . Bookmark the permalink.

Leave a Reply