Running 62 Stick To Your Role! Leaderboard 🎠62 Benchmarking LLMs on the stability of simulated populations
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • 109B • Updated May 22, 2025 • 452k • • 1.3k
view article Article Improving Prompt Consistency with Structured Generations +1 willkurt, remi, clefourrier • Apr 30, 2024 • 68
Running 600 Scaling test-time compute 📈 600 Boost LLM answers with flexible test‑time search strategies
Llama 3.3 (All Versions) Collection Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 2 days ago • 38
meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 787k • • 2.81k
Running Agents 111 Judge Arena 💻 111 View and compare open‑source AI model rankings with ELO scores