view article Article Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios By quotientai and 3 others • May 2 • 19
Running on CPU Upgrade 13.1k 13.1k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots