AIR: Post-training Data Selection for Reasoning via Attention Head Influence Paper • 2512.13279 • Published Dec 15, 2025 • 2
LLMs are Also Effective Embedding Models: An In-depth Overview Paper • 2412.12591 • Published Dec 17, 2024 • 2
MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling Paper • 2606.13473 • Published 8 days ago • 89
view post Post 7132 MiniMax-M3 coming soon.https://github.com/MiniMax-AI/MiniMax-M3 See translation 🔥 34 34 🚀 4 4 😎 2 2 🧠2 2 🤗 1 1 + Reply
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment Paper • 2605.19577 • Published May 19 • 59
OProver: A Unified Framework for Agentic Formal Theorem Proving Paper • 2605.17283 • Published May 17 • 31
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid? Paper • 2605.06527 • Published May 7 • 46
Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context Paper • 2605.13831 • Published May 13 • 87