Spaces:
Running
Running
Commit History
chore: fix model repo names
11fdbf3
verified
chore: fix model repo names
bf42c1e
verified
data: add u-math models
ed6309a
verified
fix: dedup o-mini
611ee4d
verified
update older model scores
746deed
verified
update reasoning models scores
05f7d31
verified
chore: update gemini thinking percent to rate
c160aaa
Konstantin Chernyshev
commited on
fix: rescale gemini2 flash-thinking u-math scores
c0d99c1
verified
data: add gemini-2.0-flash-thinking-exp-01-21 U-MATH scores
88c7969
verified
chore: update r1/o3-mini, fix buttons
aac20b5
Konstantin Chernyshev
commited on
data: commit mu-math numbers
1589444
verified
chore: add more u-math models
51510cb
Konstantin Chernyshev
commited on
fix: add charts
c933ce0
Konstantin Chernyshev
commited on
chore: add u-math results
4790bc5
Konstantin Chernyshev
commited on
feat: add mvp leaderboard
ff4f460
Konstantin Chernyshev
commited on