Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromย
benediktstroebl/hal
agent-evals
/
core_leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
6a40c60
core_leaderboard
/
evals
Ctrl+K
Ctrl+K
3 contributors
History:
10 commits
benediktstroebl
updated visibility feature
ad4ec76
9 months ago
usaco_traces
updated visibility feature
9 months ago
swebench_lite_example_agent_1722587866.json
Safe
8.44 kB
update
9 months ago
swebench_lite_example_agent_17227906123.json
Safe
10.4 kB
update
9 months ago
swebench_lite_example_agent_1722790656.json
Safe
10.4 kB
update
9 months ago
usaco_USACO_Zero-shot_gpt-4o-mini-2024-07-18_1723149367.json
Safe
692 MB
LFS
added test data
9 months ago
usaco_usaco_example_agent_1722871.json
Safe
2.52 kB
added usaco
9 months ago
usaco_usaco_example_agent_17228715212.json
Safe
2.52 kB
update
9 months ago
usaco_usaco_example_agent_1722871527.json
Safe
2.52 kB
updated agents
9 months ago
usaco_usaco_test_1723067278.json
Safe
16.9 kB
added test data
9 months ago
usaco_usaco_test_1723069675.json
Safe
15.4 kB
added test data
9 months ago