plus more about V-JEPA 2.1, Mamba 3, and latent planning
and more about GLM-OCR, pre-pre-training on NCA, IndexCache, and neural thickets
and more about Speculative Speculative Decoding, SWE-CI, and Beyond Language Modeling
plus more on Learning Without Training and The Geometry of Noise
plus more about Experiential RL, GLM-5 Report, and Attention Matching
plus more on Evolving Agents via Recursive Skill-Augmented RL and Low Hanging Fruits in Vision Transformers
an insane big week in AI reseasrch
An early preview of Continual Learning in 2026
and more on Quantization-Aware Distillation for NVFP4, RL via Self-Distillation
plus more on Memorization Dynamics in Knowledge Distillation and Efficient Agents
plus more on DroPE: Dropping RoPE, STEM, and Dr. Zero
and more on Dead Salmons of AI Interp, GDPO, From Entropy to Epiplexity