Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
benediktstroebl/hal
agent-evals
/
core_leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
de4df51
core_leaderboard
Ctrl+K
Ctrl+K
3 contributors
History:
117 commits
benediktstroebl
Upload preprocessed_traces.db
de4df51
verified
8 months ago
agent_monitor
Big update with SQL backend
9 months ago
evals_live
Upload swebench_verified_Agentless_gpt-4o-mini-2024-07-18_50_Instances_1723916965.json
9 months ago
evals_processed
init files to keep dirs open
9 months ago
evals_upload
init files to keep dirs open
9 months ago
utils
Upload viz.py
8 months ago
.gitattributes
Safe
2.05 kB
Upload preprocessed_traces.db
9 months ago
.gitignore
Safe
139 Bytes
Update .gitignore
8 months ago
README copy.md
Safe
14.7 kB
init
9 months ago
README.md
Safe
236 Bytes
initial commit
9 months ago
about.md
Safe
5.39 kB
Upload 3 files
8 months ago
agent_submission.md
Safe
1.28 kB
Upload 5 files
8 months ago
app.py
Safe
61.8 kB
Upload app.py
8 months ago
benchmark_submission.md
Safe
496 Bytes
Upload 3 files
8 months ago
config.py
Safe
1.62 kB
Upload config.py
8 months ago
css.css
Safe
997 Bytes
vis update
8 months ago
envs.py
Safe
191 Bytes
added auto update
9 months ago
hal.ico
Safe
15.4 kB
Upload 5 files
8 months ago
hal.png
Safe
1.03 kB
Upload 5 files
8 months ago
header.md
Safe
118 Bytes
vis update
8 months ago
preprocessed_traces.db
1.16 GB
LFS
Upload preprocessed_traces.db
8 months ago
requirements.txt
Safe
1.86 kB
Upload requirements.txt
8 months ago
scratch.py
Safe
1.61 kB
vis update
8 months ago
verified_agents.yaml
Safe
2.6 kB
Upload verified_agents.yaml
8 months ago