Running Featured 85 Distilling 100B+ Models 40x Faster with TRL 📝 85 TRL distillation for 100B+ teachers, 40x faster
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf • Sep 18, 2024 • 280
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published Apr 25, 2025 • 53
Running on Zero Agents 810 IndexTTS 2 Demo 🏢 810 Generate expressive speech from text with voice and emotion control
Running on Zero Agents Featured 489 Qwen Image Edit Fast! ✒ 489 Fast 8 step inference of Qwen Image Edit
Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper • 2506.09250 • Published Jun 10, 2025 • 27 • 6