The AI Timeline

Follow The Latest Cutting Edge AI Research in 5 minutes a week.

Connect

Featured Posts

Premium InsightsPremium Insights

Dec 11, 2025

Premium

Aug~Nov AI Research Trend Report

Basically recapping what I missed in the last 4 months

by cloud

Premium InsightsPremium Insights

Dec 04, 2025

Premium

The Only Perfect Score Paper at NeurIPS 2025

Breaking down "Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?"

by cloud, +1

Dec 02, 2025

DeepSeek-V3.2 Technical Report Is Pure Gold

FreeFlow, DeepSeekMath-V2, Soft Adaptive Policy Optimization, and more

by cloud

Nov 18, 2025

Depth Anything 3: Recovering the Visual Space from Any Views

LeJEPA, The Path Not Taken, and more

by cloud

Jun 03, 2025

A Shocking RLVR Revelation For LLM Just Dropped

Read about "Rethinking Training Signals in RLVR", why LLMs are headless chickens, and "Learning to Reason without External Rewards"

by cloud

Jan 28, 2025

DeepSeek-R1 Explained

Plus more about Transformer2 and Kimi k1.5

by cloud

On-Policy Delta Distillation

plus more about Concurrent Image Understanding and Generation, Latent and Explicit Reasoning with Looped Transformers and more

by cloud

Jul 14, 2026

Why Memorized Knowledge Fails to Generalize in LLM Finetuning

Plus more about Single Async Opt for Agentic RL, Remember When It Matters, and Sparse Delta Memory

by cloud

Jul 07, 2026

You Only Need 1 Layer for RLVR?

plus more about AdaJEPA, Program-as-Weights, The World Is In Your Mind, and Dual On-policy Distillation

by cloud

Jun 30, 2026

DeepSeek Just dropped a new speculative decoding method!

plus more about Tapered LMs, Improved LLDMs, AutoData, and You Don't Need To Run Every Eval

by cloud

Jun 23, 2026

What even is a >< former (yes >< former)

plus more about Looped World Models, Fixed-Point Reasoners, and ExpRL

by cloud

Jun 16, 2026

MiniMax M3's New Attention: MiniMax Sparse Attention

plus more about FlashMemory-DeepSeek-V4, Trajectory-Refined Distillation, Test-Time Gradient Guidance, and End-to-End Context Compression at Scale

by cloud

Jun 09, 2026

Microsoft just shared the frontier data engineering secrets

plus more about If LLMs Have Human-Like Attributes, Then So Does Age of Empires II, Cosmos 3, and Robots Need More than VLA and World Models

by cloud

Jun 02, 2026

DiffusionBlocks: Save 2-3x Training Memory!?

plus more about Bitter Lesson in Data Filtering, Do Language Models Need Sleep, and Neural Weight Norm.

by cloud

May 26, 2026

Generative Recursive Reasoning

plus more on the Benefits of Subword Tokenization, HRM-Text, Probabilistic Tiny Recursive Model, and Vector Policy Optimization

by cloud

May 19, 2026

Long Context Pre-Training w/ Lighthouse Attention

plus more about Self-distilled Agentic RL, Embedded Language Flows, and Negation Neglect

by cloud

May 12, 2026

Think In Diffusion: Continuous Latent Diffusion Language Model

plus more on Sparser, Faster, Lighter Transformer LMs, Manifold Steering, and Teaching Claude Why

by cloud

May 05, 2026

DeepSeek's Deleted Paper: Thinking With Visual Primitives

can't believe they removed this paper unknowningly

by cloud

First Back

1 2 3 4 5 6 7 8

Next Last

The AI Timeline

Featured Posts

Aug~Nov AI Research Trend Report

The Only Perfect Score Paper at NeurIPS 2025

DeepSeek-V3.2 Technical Report Is Pure Gold

Depth Anything 3: Recovering the Visual Space from Any Views

A Shocking RLVR Revelation For LLM Just Dropped

DeepSeek-R1 Explained

Archive

On-Policy Delta Distillation

Why Memorized Knowledge Fails to Generalize in LLM Finetuning

You Only Need 1 Layer for RLVR?

DeepSeek Just dropped a new speculative decoding method!

What even is a >< former (yes >< former)

MiniMax M3's New Attention: MiniMax Sparse Attention

Microsoft just shared the frontier data engineering secrets

DiffusionBlocks: Save 2-3x Training Memory!?

Generative Recursive Reasoning

Long Context Pre-Training w/ Lighthouse Attention

Think In Diffusion: Continuous Latent Diffusion Language Model

DeepSeek's Deleted Paper: Thinking With Visual Primitives