Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
benediktstroebl/hal
agent-evals
/
core_leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
fda0adb
core_leaderboard
Ctrl+K
Ctrl+K
3 contributors
History:
102 commits
benediktstroebl
Upload viz.py
fda0adb
verified
8 months ago
agent_monitor
Big update with SQL backend
9 months ago
evals_live
Upload swebench_verified_Agentless_gpt-4o-mini-2024-07-18_50_Instances_1723916965.json
9 months ago
evals_processed
init files to keep dirs open
9 months ago
evals_upload
init files to keep dirs open
9 months ago
utils
Upload viz.py
8 months ago
.gitattributes
Safe
2.05 kB
Upload preprocessed_traces.db
9 months ago
.gitignore
Safe
139 Bytes
Update .gitignore
8 months ago
README copy.md
Safe
14.7 kB
init
9 months ago
README.md
Safe
236 Bytes
initial commit
9 months ago
about.md
Safe
7.17 kB
modified heading and added about tab text
8 months ago
app.py
Safe
52.2 kB
Upload app.py
8 months ago
config.py
Safe
1.6 kB
Update
9 months ago
css.css
Safe
997 Bytes
vis update
8 months ago
envs.py
Safe
191 Bytes
added auto update
9 months ago
header.md
Safe
118 Bytes
vis update
8 months ago
preprocessed_traces.db
Safe
983 MB
LFS
Upload preprocessed_traces.db
8 months ago
requirements.txt
Safe
1.85 kB
update
9 months ago
scratch.py
Safe
1.61 kB
vis update
8 months ago
verified_agents.yaml
Safe
1.26 kB
added verified agents management and column and fixed widths
8 months ago