aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)AuthorFilesLines
2024-03-01fix(ci/arena): Fix error accessing `context` & improve log output readabilityGravatar Reinier van der Leer 1-20/+20
2024-03-01fix(ci/arena): Fix syntax & formatting errorsGravatar Reinier van der Leer 1-6/+6
2024-03-01feat(ci/arena): Add logging and debug output to workflow scriptGravatar Reinier van der Leer 1-0/+26
2024-03-01ci(arena): Fix `arena-intake` workflowGravatar Reinier van der Leer 1-4/+4
2024-03-01ci(arena): Fix `arena-intake` workflowGravatar Reinier van der Leer 1-15/+22
2024-03-01ci: Add 'Arena intake' workflow to automatically check 'entering the arena' PRsGravatar Reinier van der Leer 1-0/+133
2024-02-29ci: Auto-label PRs based on the scope of their diffGravatar Reinier van der Leer 2-0/+34
2024-02-29chore: Change `agbenchmark` to directory dependency in `autogpt` and `forge` ...Gravatar Reinier van der Leer 4-17/+11
2024-02-29fix(benchmark/reports): Resolve error in format.py on `attempt.cost` is `None`Gravatar Reinier van der Leer 1-1/+2
2024-02-29feat(agent): Catch & disallow duplicate commands in LLM response parser (#6937)Gravatar Krzysztof Czerwinski 2-0/+18
2024-02-29lint(agent): Fix linting error in api_manager.pyGravatar Reinier van der Leer 1-1/+1
2024-02-29fix(agent/llm): Fix support for AzureOpenAI (#6927)Gravatar edwardsp 2-5/+15
2024-02-28fix(agent/security): Make CORS more restrictive and configurableGravatar Reinier van der Leer 2-5/+10
2024-02-28feat(agent): Gracefully handle failure to load non-existing agent (#6938)Gravatar Krzysztof Czerwinski 1-1/+6
2024-02-28fix(agent/execute_code): Disable code execution commands when Docker is unava...Gravatar Krzysztof Czerwinski 1-9/+30
2024-02-27fix(agent/security): Mitigate shell injection vulnerabilities (#6903)Gravatar Elias Hohl 2-17/+33
2024-02-22Update CODEOWNERSGravatar Reinier van der Leer 1-5/+5
2024-02-21fix(frontend): Unbreak `ChatInputField`Gravatar Reinier van der Leer 1-2/+1
2024-02-21fix(ci/frontend): Add trigger on `push` including workflow fileGravatar Reinier van der Leer 1-0/+1
2024-02-21fix(ci/frontend): Add and fix trigger on workflow fileGravatar Reinier van der Leer 1-1/+1
2024-02-21ci: Revise Frontend CIGravatar Reinier van der Leer 2-46/+59
2024-02-20chore(agent/llm): Update model alias `gpt-3.5-turbo` -> `gpt-3.5-turbo-0125`Gravatar Reinier van der Leer 1-1/+2
2024-02-20fix(ci/benchmark): Install benchmark dependenciesGravatar Reinier van der Leer 1-0/+2
2024-02-20fix(benchmark/reports): Make format.py executableGravatar Reinier van der Leer 1-0/+2
2024-02-20fix(agent/browser): Print descriptive error if ChromeDriver install failsGravatar Reinier van der Leer 1-7/+14
2024-02-20fix(agent/llm): Include `id` in tool_calls in promptGravatar Reinier van der Leer 2-2/+4
2024-02-20fix(autogpt/llm): Omit `AssistantChatMessage.tool_calls` if no tool calls are...Gravatar Reinier van der Leer 2-3/+6
2024-02-20fix(ci/benchmark): Specify poetry env path for report conversion stepGravatar Reinier van der Leer 1-1/+1
2024-02-20fix(benchmark/challenges): Improve spec and eval of TicTacToe challengeGravatar Albert Örwall 2-2/+2
2024-02-20fix(agent/setup): Fix revising constraints and best practices (#6777)Gravatar Thunder Drag 1-3/+19
2024-02-20feat(frontend): Allow sending a message with the enter key (#6378)Gravatar Ethan Presberg 1-0/+6
2024-02-20fix(ci/benchmark): Unbreak "Push reports to data branch" stepGravatar Reinier van der Leer 1-1/+2
2024-02-19feat(ci/benchmark): Generate step summary from benchmark reportGravatar Reinier van der Leer 1-0/+12
2024-02-19feat(benchmark): Add reports/format.py script to convert report.json to markdownGravatar Reinier van der Leer 1-0/+136
2024-02-19chore: Update `agbenchmark` dependency for agent and forgeGravatar Reinier van der Leer 2-2/+2
2024-02-19feat(benchmark): Include Steps in ReportGravatar Reinier van der Leer 4-1/+16
2024-02-18chore: Update `agbenchmark` dependency for agent and forgeGravatar Reinier van der Leer 2-2/+2
2024-02-18debug(benchmark): Improve `TestResult` validation error output formatGravatar Reinier van der Leer 1-5/+8
2024-02-17fix(ci/benchmark): Mitigate VCS conflicts with files in data branchGravatar Reinier van der Leer 1-0/+3
2024-02-17fix(ci/benchmark): Add `set +e` because we expect (some) challenges to failGravatar Reinier van der Leer 1-0/+2
2024-02-17chore: Update `agbenchmark` dependency for agent and forgeGravatar Reinier van der Leer 2-2/+2
2024-02-17debug(benchmark): Add more debug code to pinpoint cause of rare crashGravatar Reinier van der Leer 2-15/+23
2024-02-17ci: Allow telemetry for non-push events, as long as it's on `master`Gravatar Reinier van der Leer 5-8/+3
2024-02-17ci: Fix setting/passing `TELEMETRY_*` environment variablesGravatar Reinier van der Leer 4-17/+11
2024-02-17chore: Update `agbenchmark` dependency for agent and forgeGravatar Reinier van der Leer 2-3/+3
2024-02-17ci: Update actions to newest versionsGravatar Reinier van der Leer 10-36/+38
2024-02-17debug(benchmark): Make sure `TestResult` validator error output is sufficient...Gravatar Reinier van der Leer 1-1/+1
2024-02-17debug(benchmark): Add log statement to validator on `TestResult`Gravatar Reinier van der Leer 1-0/+8
2024-02-17fix(ci/benchmark): Allow workflow to continue regardless of challenge outcomesGravatar Reinier van der Leer 1-0/+7
2024-02-16chore: Update agbenchmark dependency for agent and forgeGravatar Reinier van der Leer 2-2/+2