Running Agents Featured 588 LLM-Perf Leaderboard 🏆 588 Compare LLM hardware performance and find the best model
Running Agents 1.51k Big Code Models Leaderboard 📈 1.51k Explore and compare code model performance on a leaderboard
Running on Zero Agents 18 Chat with Gemma-2-9B-Chinese-Chat 💬 18 Chat with a Chinese language assistant
Running Agents 431 Reward Bench Leaderboard 📐 431 Explore and compare model scores on RewardBench benchmarks