The AI Timeline
Follow The Latest Cutting Edge AI Research in 5 minutes a week.
Connect
FreeFlow, DeepSeekMath-V2, Soft Adaptive Policy Optimization, and more
Breaking down "Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?"
LeJEPA, The Path Not Taken, and more
Read about "Rethinking Training Signals in RLVR", why LLMs are headless chickens, and "Learning to Reason without External Rewards"
Plus more about Transformer2 and Kimi k1.5
Basically recapping what I missed in the last 4 months
plus more on Memorization Dynamics in Knowledge Distillation and Efficient Agents
plus more on DroPE: Dropping RoPE, STEM, and Dr. Zero
and more on Dead Salmons of AI Interp, GDPO, From Entropy to Epiplexity
And more about Recursive Language Models, LongCat ZigZag Attention, and LoRA RL
plus more on Self-Play SWE-RL, Step DeepResearch, and Attention Is Not What You Need
Next-Embedding Prediction Makes Strong Vision Learners, Let's (not) just put things in Context, Spherical Equivariant Graph Transformers, and moree
Scaling Up Diffusion Language Models to 100B, Adding 1 Attention Layer & Make Visual Encoders Generate Images, LayerNorm Is Not Needed In Transformer, and more
PretrainZero, Stabilizing RL with LLMs and more
Plus more on Seer, Virtual Width Networks, SAM 3, and Evolution Strategies at the Hyperscale