core_leaderboard / evals /example_agent_1.json
benediktstroebl's picture
added automatic download for results
1783518
raw
history blame
211 Bytes
{
"config": {
"agent_name": "example_agent_1",
"benchmark_name": "swebench_lite",
"date": "2021-10-01"
},
"results": {
"accuracy": 12,
"total_cost": 34
}
}