Commit History

agent submission instructions
2c91b5e

Zachary Siegel commited on

verify the agents
abf78cc

Zachary Siegel commited on

add results to leaderboard
8de3f0a

Zachary Siegel commited on

remove agents
797d23f

Zachary Siegel commited on

added first agent to leaderboard
64319c0

Zachary Siegel commited on

scaffold for core bench
b335ab8

Zachary Siegel commited on

core bench outline
2faf3bd

Zachary Siegel commited on

Upload preprocessed_traces.db
de4df51
verified

benediktstroebl commited on

Upload verified_agents.yaml
e92240d
verified

benediktstroebl commited on

Upload requirements.txt
b56511a
verified

benediktstroebl commited on

Upload preprocessed_traces.db
7db4465
verified

benediktstroebl commited on

Upload preprocessed_traces.db
bce89cb
verified

benediktstroebl commited on

modified heading and added about tab text
c50a008

benediktstroebl commited on

added one line descriptions to each benchmark with acknowledgements and modified headline
4e68e9f

benediktstroebl commited on

Upload preprocessed_traces.db
c4276af
verified

benediktstroebl commited on

Delete preprocessed_traces.db
040eed7
verified

benediktstroebl commited on

added verified agents management and column and fixed widths
b7d1f08

benediktstroebl commited on

Merge branch 'main' of https://huggingface.co/spaces/agent-evals/leaderboard
9d2915b

benediktstroebl commited on

Upload preprocessed_traces.db
338177f
verified

benediktstroebl commited on

Upload preprocessed_traces.db
77c3be7
verified

benediktstroebl commited on

Upload swebench_verified_Agentless_gpt-4o-mini-2024-07-18_50_Instances_1723916965.json
01fb261
verified

benediktstroebl commited on

Delete evals_live/swebench_verified_Agentless_gpt-4o-2024-07-18_50_Instances_1723916965.json
e23eddc
verified

benediktstroebl commited on

Upload swebench_verified_Agentless_gpt-4o-2024-07-18_50_Instances_1723916965.json
a2d5cb2
verified

benediktstroebl commited on