The AI Timeline
Follow The Latest Cutting Edge AI Research in 5 minutes a week.
Connect
FreeFlow, DeepSeekMath-V2, Soft Adaptive Policy Optimization, and more
Breaking down "Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?"
LeJEPA, The Path Not Taken, and more
Read about "Rethinking Training Signals in RLVR", why LLMs are headless chickens, and "Learning to Reason without External Rewards"
Premium Insights: A recap of popular AI research papers and research trends in May 2025
Plus more about Transformer2 and Kimi k1.5
PretrainZero, Stabilizing RL with LLMs and more
Plus more on Seer, Virtual Width Networks, SAM 3, and Evolution Strategies at the Hyperscale
From Memorization to Reasoning in the Spectrum of Loss Curvature and Introducing Nested Learning: A new ML paradigm for continual learning
and more on Kimi Linear, Looped Transformer, How FP16 fixes RL...
How to Compress Long Text into Images To Reduce LLM Tokens and more
RLM, RAE, Reasoning with Sampling, and more
Plus more about Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences and LLM Fine-Tuning Beyond Reinforcement Learning
Plus more about Polychromic Objectives for Reinforcement Learning and Stochastic activations
Plus more about Thinking Augmented Pre-training and Reinforcement Learning on Pre-Training Data