Running 43 43 Stick To Your Role! Leaderboard 🎠Benchmarking LLMs on the stability of simulated populations
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • Updated 14 days ago • 737k • • 814
Running 548 548 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
Llama 3.3 (All Versions) Collection Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 2 days ago • 37