The AI Timeline

Follow The Latest Cutting Edge AI Research in 5 minutes a week.

Connect

Featured Posts

Jan 28, 2025

DeepSeek-R1 Explained

Plus more about Transformer2 and Kimi k1.5

by cloud

Nov 19, 2024

Top 3 Rated ICLR 2025 Papers - LoRA Done RITE, IC-Light, HyCoCLIP

#32 | Latest AI Research Explained Simply

by cloud

Premium InsightsPremium Insights

May 19, 2025

How DeepSeek Made The Best Math Prover Ever (+500% vs prev. SoTA)

Premium Insights: A closer look into the DeepSeek Prover series

by cloud, +1

Jun 03, 2025

A Shocking RLVR Revelation For LLM Just Dropped

Read about "Rethinking Training Signals in RLVR", why LLMs are headless chickens, and "Learning to Reason without External Rewards"

by cloud

Premium InsightsPremium Insights

Jun 06, 2025

May 2025 Research Trend Report

Premium Insights: A recap of popular AI research papers and research trends in May 2025

by cloud

Jun 17, 2025

LLM That Can Modify Itself?

Plus more about "The Diffusion Duality" and "Reinforcement Pre-Training"

by cloud

Language Models are Injective and Hence Invertible

and more on Kimi Linear, Looped Transformer, How FP16 fixes RL...

Naman, +1

Oct 29, 2025

How to Compress Long Text into Images To Reduce LLM Tokens

by cloud

Oct 21, 2025

Last Week's Trending Papers 📈

RAE, Reasoning with Sampling, and more

by cloud

Oct 14, 2025

Less is More: Recursive Reasoning with Tiny Networks

Plus more about Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences and LLM Fine-Tuning Beyond Reinforcement Learning

by cloud

Oct 07, 2025

Training Agents Inside of Scalable World Models

Plus more about Polychromic Objectives for Reinforcement Learning and Stochastic activations

by cloud

Sep 30, 2025

Video models are zero-shot learners and reasoners

Plus more about Thinking Augmented Pre-training and Reinforcement Learning on Pre-Training Data

by cloud

Sep 23, 2025

Pre-training under infinite compute

Plus more about Discovery of Unstable Singularities and AToken: A Unified Tokenizer for Vision

by cloud

Sep 16, 2025

Defeating Nondeterminism in LLM Inference

Plus more about Analog in-memory computing attention mechanism for fast and energy-efficient large language models, and the Majority is not always right: RL training for solution aggregation

Naman, +1

Sep 09, 2025

Why Do MLLMs Struggle with Spatial Understanding?

Plus more about Small Language Models are the Future of Agentic AI and Why Online Reinforcement Learning Forgets Less

Naman, +1

Sep 03, 2025

Prophesy in LLMs: DIFFUSION LANGUAGE MODELS KNOW THE ANSWER BEFORE DECODING

Plus more about StepWiser: Stepwise Generative Judges for Wiser Reasoning and Predicting the Order of Upcoming Tokens Improves Language Modeling

by cloud

Aug 26, 2025

Has GPT-5 Achieved Spatial Intelligence?

Plus more about Reinforcement Learning with Rubric Anchors and DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Naman, +1

Aug 19, 2025

How AI is Learning to Reason: RL Tricks, Policy Optimization, and the New WebWatcher Agent

In this article, we will analyze the use of Reinforcement Learning for LLM reasoning, a new policy optimization method for more concise outputs, and the groundbreaking WebWatcher vision-language research agent.

by cloud

First Back

1 2 3 4 5 6 7 8

Next Last

The AI Timeline

Featured Posts

DeepSeek-R1 Explained

Top 3 Rated ICLR 2025 Papers - LoRA Done RITE, IC-Light, HyCoCLIP

How DeepSeek Made The Best Math Prover Ever (+500% vs prev. SoTA)

A Shocking RLVR Revelation For LLM Just Dropped

May 2025 Research Trend Report

LLM That Can Modify Itself?

Archive

Language Models are Injective and Hence Invertible

How to Compress Long Text into Images To Reduce LLM Tokens

Last Week's Trending Papers 📈

Less is More: Recursive Reasoning with Tiny Networks

Training Agents Inside of Scalable World Models

Video models are zero-shot learners and reasoners

Pre-training under infinite compute

Defeating Nondeterminism in LLM Inference

Why Do MLLMs Struggle with Spatial Understanding?

Prophesy in LLMs: DIFFUSION LANGUAGE MODELS KNOW THE ANSWER BEFORE DECODING

Has GPT-5 Achieved Spatial Intelligence?

How AI is Learning to Reason: RL Tricks, Policy Optimization, and the New WebWatcher Agent