AI News 2024-11-14

General

OpenAI’s data scraping wins big as Raw Story’s copyright lawsuit dismissed by NY court. The crux is that the plaintiffs could not demonstrate a concrete, actual harm from OpenAI’s actions.
An article on Reuters: OpenAI and others seek new path to smarter AI as current methods hit limitations. It repeats the assertions (disputed by many experts in the community) that next-generation models (under development) are under-performing, and that AI labs are hitting data walls. They also emphasize that the path forward involves more “inference-time compute” to unlock reasoning.
- It is interesting to see the article including a quote from Ilya Sutskever, who has been largely quiet in the public sphere, after his departure from OpenAI and founding of SSI.
The AI Semiconductor Landscape.

Lex Fridman interviews Anthropic: Dario Amodei (CEO), Amanda Askell (develops Claude’s personality), Chris Olah (works on mechanistic interpretability).

Research Insights

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning (code). They implement temporary updates to weights at inference-time, using a loss and gradients in the usual (training) manner. They show strong performance on ARC tasks.
Mansi Sakarvadia’s thesis: Towards Interpreting Language Models: A Case Study in Multi-Hop Reasoning. Develops a system to allow the user to inject prompt-specific information into inference, which can improve multi-step reasoning. Also describes Attention Lens, to convert attention heads into interpretable tokens.
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding (code).

LLM

AI Agents

Video

World Synthesis

Science

Robots

New Deep Robotics video shows very good terrain navigation from a quadruped-with-wheels design.