index
:
Auto-GPT.git
add-codex-ability
benchmark/concurrency
break-linting
bringing-in-the-benchmark
data/benchmark-reports
fixing-linting
forge/fixes
frontend-build/master
github-repo-stats
kpczerwinski/open-1085-move-tests-to-forge
kpczerwinski/open-959-component-specific-configs
master
python-coverage-comment-action-data
reinier/open-1100-abstract-openaiprovider-groqprovider-to-a-common
release-autogpt-v0.5.x
remove-git-from-cli
security/analysis-workflows-sandbox
self-feedback-rough-example
summary_memory
swiftyos/agpt-734-create-project-outline
zamilmajdy/agpt-740-initialize-db-engine-sqlite-prisma-and-define-the-db-schema
zamilmajdy/code-validation
An experimental open-source attempt to make GPT-4 fully autonomous.
Torantulino
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
benchmark
/
agbenchmark
/
__main__.py
Age
Commit message (
Expand
)
Author
Files
Lines
2024-01-22
feat(benchmark): Add `-N`, `--attempts` option for multiple attempts per chal...
Reinier van der Leer
1
-0
/
+6
2024-01-16
refactor(benchmark): Disable Helicone integrations
Reinier van der Leer
1
-13
/
+13
2024-01-02
AGBenchmark codebase clean-up (#6650)
Reinier van der Leer
1
-209
/
+132
2023-10-02
Fix benchmark ci (#5478)
merwanehamadi
1
-1
/
+2
2023-10-02
add load_dotenv (#5474)
merwanehamadi
1
-0
/
+2
2023-09-20
Make agbenchmark a proxy of the evaluated agent (#5279)
merwanehamadi
1
-3
/
+0
2023-09-18
Implement old polling mechanism (#5248)
merwanehamadi
1
-49
/
+23
2023-09-17
Refactor benchmark (#5247)
merwanehamadi
1
-1
/
+6
2023-09-16
Remove start from agbenchmark (#5241)
merwanehamadi
1
-7
/
+2
2023-09-16
Add ability to run multiple tests (#5233)
merwanehamadi
1
-3
/
+2
2023-09-15
Ability to run by categories (#5229)
merwanehamadi
1
-2
/
+11
2023-09-15
add benchmark endpoints mock (#5221)
merwanehamadi
1
-0
/
+7
2023-09-13
Support agent protocol in benchmark (#5213)
merwanehamadi
1
-1
/
+1
2023-09-13
Fix API Mode (#5209)
merwanehamadi
1
-3
/
+2
2023-09-13
fixed multiple report folder bug
SwiftyOS
1
-1
/
+51
2023-09-13
Added ability to keep answers
SwiftyOS
1
-2
/
+9
2023-09-12
Benchmark changes
Merwane Hamadi
1
-0
/
+254