Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromย
benediktstroebl/hal
agent-evals
/
core_leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
e802557
core_leaderboard
/
utils
Ctrl+K
Ctrl+K
3 contributors
History:
8 commits
benediktstroebl
changed pareto plot size
60d47bb
9 months ago
data.py
Safe
9.47 kB
format update and added monitor llm client backend
9 months ago
pareto.py
Safe
1.34 kB
big update with raw predictions section and dropdowns that dynamically parse agents of current leaderboard
9 months ago
processing.py
Safe
5.97 kB
added try catch loop to analze agent steps call
9 months ago
viz.py
Safe
8.58 kB
changed pareto plot size
9 months ago