index
:
Auto-GPT.git
benchmark/concurrency
bringing-in-the-benchmark
data/benchmark-reports
forge/fixes
frontend-build/master
github-repo-stats
groq
master
python-coverage-comment-action-data
reinier/open-765-do-not-run-profile-generator-on-agent-creation
reinier/open-786-persist-contextitems-on-agent
reinier/open-807-fix-type-propagation-of-command-and-command
release-autogpt-v0.5.x
security/analysis-workflows-sandbox
self-feedback-rough-example
summary_memory
An experimental open-source attempt to make GPT-4 fully autonomous.
Torantulino
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
Files
Lines
2024-02-21
Update frontend build based on commit e44ca4185a0d56273b7e2cd68fc5ca42a4f9b73e
frontend-build/master
Pwuts
3
-11046
/
+11057
2024-02-21
fix(frontend): Unbreak `ChatInputField`
Reinier van der Leer
1
-2
/
+1
2024-02-21
fix(ci/frontend): Add trigger on `push` including workflow file
Reinier van der Leer
1
-0
/
+1
2024-02-21
fix(ci/frontend): Add and fix trigger on workflow file
Reinier van der Leer
1
-1
/
+1
2024-02-21
ci: Revise Frontend CI
Reinier van der Leer
2
-46
/
+59
2024-02-20
chore(agent/llm): Update model alias `gpt-3.5-turbo` -> `gpt-3.5-turbo-0125`
Reinier van der Leer
1
-1
/
+2
2024-02-20
fix(ci/benchmark): Install benchmark dependencies
Reinier van der Leer
1
-0
/
+2
2024-02-20
fix(benchmark/reports): Make format.py executable
Reinier van der Leer
1
-0
/
+2
2024-02-20
fix(agent/browser): Print descriptive error if ChromeDriver install fails
Reinier van der Leer
1
-7
/
+14
2024-02-20
fix(agent/llm): Include `id` in tool_calls in prompt
Reinier van der Leer
2
-2
/
+4
2024-02-20
fix(autogpt/llm): Omit `AssistantChatMessage.tool_calls` if no tool calls are...
Reinier van der Leer
2
-3
/
+6
2024-02-20
fix(ci/benchmark): Specify poetry env path for report conversion step
Reinier van der Leer
1
-1
/
+1
2024-02-20
fix(benchmark/challenges): Improve spec and eval of TicTacToe challenge
Albert Örwall
2
-2
/
+2
2024-02-20
fix(agent/setup): Fix revising constraints and best practices (#6777)
Thunder Drag
1
-3
/
+19
2024-02-20
feat(frontend): Allow sending a message with the enter key (#6378)
Ethan Presberg
1
-0
/
+6
2024-02-20
fix(ci/benchmark): Unbreak "Push reports to data branch" step
Reinier van der Leer
1
-1
/
+2
2024-02-19
feat(ci/benchmark): Generate step summary from benchmark report
Reinier van der Leer
1
-0
/
+12
2024-02-19
feat(benchmark): Add reports/format.py script to convert report.json to markdown
Reinier van der Leer
1
-0
/
+136
2024-02-19
chore: Update `agbenchmark` dependency for agent and forge
Reinier van der Leer
2
-2
/
+2
2024-02-19
feat(benchmark): Include Steps in Report
Reinier van der Leer
4
-1
/
+16
2024-02-18
chore: Update `agbenchmark` dependency for agent and forge
Reinier van der Leer
2
-2
/
+2
2024-02-18
debug(benchmark): Improve `TestResult` validation error output format
Reinier van der Leer
1
-5
/
+8
2024-02-17
fix(ci/benchmark): Mitigate VCS conflicts with files in data branch
Reinier van der Leer
1
-0
/
+3
2024-02-17
fix(ci/benchmark): Add `set +e` because we expect (some) challenges to fail
Reinier van der Leer
1
-0
/
+2
2024-02-17
chore: Update `agbenchmark` dependency for agent and forge
Reinier van der Leer
2
-2
/
+2
2024-02-17
debug(benchmark): Add more debug code to pinpoint cause of rare crash
Reinier van der Leer
2
-15
/
+23
2024-02-17
ci: Allow telemetry for non-push events, as long as it's on `master`
Reinier van der Leer
5
-8
/
+3
2024-02-17
ci: Fix setting/passing `TELEMETRY_*` environment variables
Reinier van der Leer
4
-17
/
+11
2024-02-17
chore: Update `agbenchmark` dependency for agent and forge
Reinier van der Leer
2
-3
/
+3
2024-02-17
ci: Update actions to newest versions
Reinier van der Leer
10
-36
/
+38
2024-02-17
debug(benchmark): Make sure `TestResult` validator error output is sufficient...
Reinier van der Leer
1
-1
/
+1
2024-02-17
debug(benchmark): Add log statement to validator on `TestResult`
Reinier van der Leer
1
-0
/
+8
2024-02-17
fix(ci/benchmark): Allow workflow to continue regardless of challenge outcomes
Reinier van der Leer
1
-0
/
+7
2024-02-16
chore: Update agbenchmark dependency for agent and forge
Reinier van der Leer
2
-2
/
+2
2024-02-16
fix(benchmark): Fix `TestResult.fail_reason` assignment condition
Reinier van der Leer
1
-1
/
+1
2024-02-16
chore: Update `agbenchmark` dependency for agent and forge
Reinier van der Leer
2
-2
/
+2
2024-02-16
fix(benchmark): Unbreak `-N`/`--attempts` option
Reinier van der Leer
3
-4
/
+4
2024-02-16
Rename autogpts-benchmark-nightly.yml to autogpts-benchmark.yml
Reinier van der Leer
1
-0
/
+0
2024-02-16
feat(agent/serve): Report task cost through `Step.additional_output`
Reinier van der Leer
3
-10
/
+21
2024-02-16
feat(benchmark): Get agent task cost from `Step.additional_output`
Reinier van der Leer
3
-0
/
+18
2024-02-16
feat(benchmark/report): Add and record `TestResult.n_steps`
Reinier van der Leer
4
-0
/
+9
2024-02-16
ci(benchmark): Add nightly benchmark workflow
Reinier van der Leer
1
-0
/
+71
2024-02-16
lint(benchmark): Remove unnecessary `pass` statement in __main__.py
Reinier van der Leer
1
-1
/
+0
2024-02-16
chore: Update `agbenchmark` dependency for agent and forge
Reinier van der Leer
2
-2
/
+32
2024-02-16
fix(benchmark): Include `WebArenaSiteInfo.additional_info` (e.g. credentials)...
Reinier van der Leer
1
-7
/
+19
2024-02-16
feat(benchmark/cli): Add `challenge list`, `challenge info` subcommands
Reinier van der Leer
5
-6
/
+219
2024-02-16
refactor(benchmark): `load_webarena_challenges`
Reinier van der Leer
2
-22
/
+43
2024-02-15
chore: Update `agbenchmark` dependency for agent and forge
Reinier van der Leer
2
-3
/
+3
2024-02-15
feat(benchmark): Make report output folder configurable
Reinier van der Leer
6
-9
/
+16
2024-02-15
feat(agent/telemetry): Distinguish between `production` and `dev` environment...
Reinier van der Leer
2
-2
/
+55
[next]