strict accuracy
on IFEval (0-Shot)
Open LLM Leaderboard
82.920
normalized accuracy
on BBH (3-Shot)
Open LLM Leaderboard
48.050
exact match
on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard
54.230
acc_norm
on GPQA (0-shot)
Open LLM Leaderboard
12.300
acc_norm
on MuSR (0-shot)
Open LLM Leaderboard
13.150
accuracy
on MMLU-PRO (5-shot)
test set
Open LLM Leaderboard
44.650