Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromย
benediktstroebl/hal
agent-evals
/
core_leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
766750f
core_leaderboard
/
evals_live
Ctrl+K
Ctrl+K
3 contributors
History:
2 commits
benediktstroebl
data reformatting for demo
766750f
9 months ago
swebench_lite_example_agent_1722587866.json
Safe
8.44 kB
big update with raw predictions section and dropdowns that dynamically parse agents of current leaderboard
9 months ago
swebench_lite_example_agent_17227906123.json
Safe
10.4 kB
big update with raw predictions section and dropdowns that dynamically parse agents of current leaderboard
9 months ago
swebench_lite_example_agent_1722790656.json
Safe
10.4 kB
big update with raw predictions section and dropdowns that dynamically parse agents of current leaderboard
9 months ago
usaco_USACO_Zero-shot_gpt-4o-mini-2024-07-18_1723149367.json
Safe
692 MB
LFS
big update with raw predictions section and dropdowns that dynamically parse agents of current leaderboard
9 months ago
usaco_usaco_example_agent_1722871.json
Safe
2.52 kB
data reformatting for demo
9 months ago
usaco_usaco_example_agent_1722871527.json
Safe
4.57 kB
big update with raw predictions section and dropdowns that dynamically parse agents of current leaderboard
9 months ago
usaco_usaco_test_172306727812321123.json
Safe
23.9 kB
data reformatting for demo
9 months ago