AI News 2025-01-30

General

Research Insights

LLM

  • Release of Qwen2.5-1M model, with a 1 million token context (technical report).
  • Release of Qwen2.5-VL, a vision-language model.
  • DeepSeek releases Janus Pro 1B (includes image generation and chat with PDF). It can run local/in-browser via WebGPU (demo here).
  • Open Thoughts has launched as an effort to curate quality datasets for training reasoning models (e.g. validated synthetic reasoning traces). Initial dataset has 114k traces.
  • Open-R1 is an attempt to reproduce the DeepSeek-R1 model/result/method in a fully open manner.
  • OpenAI has added a “think” option to GPT-4o, allowing it to invoke some form of chain-of-thought.

Safety

AI Agents

Audio

Video

Science

Robots

This entry was posted in AI, News and tagged , , , , , . Bookmark the permalink.

Leave a Reply