The Shape of Addition: Geometric Structures of Arithmetic in Large Language Models Paper • 2606.03645 • Published 11 days ago • 4
LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation Paper • 2606.02553 • Published 8 days ago • 19
LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws Paper • 2605.23901 • Published 18 days ago • 13
BitCPM-CANN Collection Full-pipeline ternary quantized model trained on CANN. • 12 items • Updated 16 days ago • 27
Mix-Quant: Quantized Prefilling, Precise Decoding for Agentic LLMs Paper • 2605.20315 • Published 21 days ago • 28
Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos Paper • 2605.18233 • Published 22 days ago • 92
Post-Trained MoE Can Skip Half Experts via Self-Distillation Paper • 2605.18643 • Published 22 days ago • 30