Running 109 Unlocking On-Policy Distillation for Any Model Family 📝 109 Visualize on-policy distillation for any model family
Running Agents 7 Dataset Length Profiler 👁 7 Estimate optimal max_length for SFT training with token analysis
Running 3.87k The Ultra-Scale Playbook 🌌 3.87k The ultimate guide to training LLM on large GPU Clusters
Running Agents 88 Large Reasoning Models Leaderboard 🐳 88 A leaderboard to rank large reasoning models
Running 600 Scaling test-time compute 📈 600 Boost LLM answers with flexible test‑time search strategies
Running Agents 430 Reward Bench Leaderboard 📐 430 Explore and compare model scores on RewardBench benchmarks
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots