Add command for directly executing python code #4581

erik-megarad · 2023-06-05T00:46:58Z

Background

When running Auto-GPT it very, very frequently wants to run python code. To do it is usually 3 steps (execute file [that probably does not exist], write file, execute file). Not only is this quite costly, but if there are errors in the code (there usually are), the iteration cycle to fix it is slow and extremely error prone.

This change adds a new command, execute_python_code, which will execute straight python code. It does this by writing the code to a file in the workspace directory, executing it using the existing execute_python_file command, and then returning the results. The file is kept around for later reference.

I know that we're limiting the number of new commands we have, but this one is a 200%+ increase in efficiency for the majority of tasks that Auto-GPT decides to do.

Changes

Add a new command, execute_python_code

Documentation

In-code comments

Test Plan

Wrote a new unit test
Ran a few manual tests, output of one example below:

Input:what is the square root of 213949
SYSTEM:  Human feedback: what is the square root of 213949
MATHGPT THOUGHTS:  I will use Python to calculate the square root of 213949.
REASONING:  Python has a built-in math library that can be used to calculate the square root of a number.
PLAN:
-  Use Python to calculate the square root of 213949
-  Provide the result to the user
CRITICISM:  I need to ensure that the result is accurate and that I am using the correct syntax to calculate the square root.
SPEAK:  I will use Python to calculate the square root of 213949.
NEXT ACTION:  COMMAND = execute_python_code ARGUMENTS = {'code': 'import math\nresult = math.sqrt(213949)\nprint(result)'}
  Enter 'y' to authorise command, 'y -N' to run N continuous commands, 's' to run self-feedback commands, 'n' to exit program, or enter feedback for MathGPT...
  Asking user via keyboard...
Input:y
-=-=-=-=-=-=-= COMMAND AUTHORISED BY USER -=-=-=-=-=-=-=
  Executing python file '/app/autogpt/auto_gpt_workspace/temp_code.py' in working directory '/app/autogpt/auto_gpt_workspace'
SYSTEM:  Command execute_python_code returned: 462.5462139073241

PR Quality Checklist

My pull request is atomic and focuses on a single change.
I have thoroughly tested my changes with multiple different prompts.
I have considered potential risks and mitigations for my changes.
I have documented my changes clearly and comprehensively.
I have not snuck in any "extra" small tweaks changes.

I have run the following commands against my code to ensure it passes our linters:

black .
isort .
mypy
autoflake --remove-all-unused-imports --recursive --ignore-init-module-imports autogpt tests --in-place

vercel · 2023-06-05T00:47:01Z

Deployment failed with the following error:

Resource is limited - try again in 25 minutes (more than 100, code: "api-deployments-free-per-day").

vercel · 2023-06-05T00:52:44Z

Deployment failed with the following error:

Resource is limited - try again in 20 minutes (more than 100, code: "api-deployments-free-per-day").

erik-megarad · 2023-06-05T00:53:15Z

Since this is a new command that will be highly used I've opened a discussion in discord. Will update this PR with any outcome. Also welcome comments directly on the PR

Auto-GPT-Bot · 2023-06-05T02:36:18Z

You changed AutoGPT's behaviour. The cassettes have been updated and will be merged to the submodule when this Pull Request gets merged.

codecov · 2023-06-05T02:41:09Z

Codecov Report

Patch coverage: 84.61% and project coverage change: +0.11 🎉

Comparison is base (3b0d49a) 69.67% compared to head (855edac) 69.79%.

❗ Current head 855edac differs from pull request most recent head 754b993. Consider uploading reports for the commit 754b993 to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #4581      +/-   ##
==========================================
+ Coverage   69.67%   69.79%   +0.11%     
==========================================
  Files          72       72              
  Lines        3558     3575      +17     
  Branches      569      570       +1     
==========================================
+ Hits         2479     2495      +16     
- Misses        890      891       +1     
  Partials      189      189

Impacted Files	Coverage Δ
autogpt/commands/execute_code.py	`68.93% <84.61%> (+3.37%)`	⬆️

... and 2 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

vercel · 2023-06-05T02:42:50Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
docs	⬜️ Ignored (Inspect)			Jun 9, 2023 5:34pm

waynehamadi · 2023-06-05T02:47:57Z

@erik-megarad we skipped the debug code challenge, we need to unskip it so we can tie the challenges to this command.
This is not a blocker to merge the PR but it would be nice. I will do it when I am done with my current tasks.

Auto-GPT-Bot · 2023-06-05T02:55:35Z

You changed AutoGPT's behaviour. The cassettes have been updated and will be merged to the submodule when this Pull Request gets merged.

Boostrix · 2023-06-05T11:46:17Z

Requesting review by @Pwuts since he's commented on this idea on discord and in related RFEs, and indicated being supportive of this.

I really believe we should strive to get this reviewed and integrated, we have a number of requests related to creating commands (#4549) and tooling (#4214) "on the fly", being able to execute python code directly (including potentially unit tests), could be a first step towards this.

This would also help implement other features, and make a few similar PRs obsolete:

We would only need a way to register certain code as [new] commands.

autogpt/commands/execute_code.py

vercel · 2023-06-05T19:18:22Z

Deployment failed with the following error:

Resource is limited - try again in 3 hours (more than 100, code: "api-deployments-free-per-day").

vercel · 2023-06-05T19:30:50Z

Deployment failed with the following error:

Resource is limited - try again in 2 hours (more than 100, code: "api-deployments-free-per-day").

autogpt/commands/execute_code.py

Auto-GPT-Bot · 2023-06-06T15:11:05Z

You changed AutoGPT's behaviour. The cassettes have been updated and will be merged to the submodule when this Pull Request gets merged.

autogpt/commands/execute_code.py

github-actions · 2023-06-07T01:11:20Z

This pull request has conflicts with the base branch, please resolve those so we can evaluate the pull request.

vercel · 2023-06-07T02:19:49Z

Deployment failed with the following error:

Resource is limited - try again in 38 minutes (more than 100, code: "api-deployments-free-per-day").

Auto-GPT-Bot · 2023-06-07T02:22:51Z

You changed AutoGPT's behaviour. The cassettes have been updated and will be merged to the submodule when this Pull Request gets merged.

vercel · 2023-06-07T03:19:20Z

Deployment failed with the following error:

Resource is limited - try again in 13 minutes (more than 100, code: "api-deployments-free-per-day").

vercel · 2023-06-07T08:02:56Z

Deployment failed with the following error:

Resource is limited - try again in 6 hours (more than 100, code: "api-deployments-free-per-day").

github-actions · 2023-06-07T08:03:08Z

Conflicts have been resolved! 🎉 A maintainer will review the pull request shortly.

Auto-GPT-Bot · 2023-06-07T19:01:06Z

You changed AutoGPT's behaviour. The cassettes have been updated and will be merged to the submodule when this Pull Request gets merged.

Auto-GPT-Bot · 2023-06-09T17:39:26Z

You changed AutoGPT's behaviour. The cassettes have been updated and will be merged to the submodule when this Pull Request gets merged.

Pwuts · 2023-06-09T17:48:00Z

Why did you go with basename rather than filename?

erik-megarad · 2023-06-09T18:12:55Z

I explained it in a previous comment. Using filename caused the AI to always give an absolute path, no matter what I did. Since we’re placing it specifically in another location it was necessary to change the argument name.

…

On Fri, Jun 9, 2023 at 10:48 AM Reinier van der Leer < ***@***.***> wrote: Why did you do with basename rather than filename? — Reply to this email directly, view it on GitHub <#4581 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAABQXHCPMNPYMWGBBZQJN3XKNOVXANCNFSM6AAAAAAY2I2NEM> . You are receiving this because you were mentioned.Message ID: ***@***.***>

github-actions bot added the size/m label Jun 5, 2023

erik-megarad force-pushed the me/work branch from b7c9de2 to 7452920 Compare June 5, 2023 00:52

Auto-GPT-Bot added the behaviour change label Jun 5, 2023

waynehamadi force-pushed the me/work branch from 5651e2e to 1c32ebe Compare June 5, 2023 02:45

Boostrix requested a review from Pwuts June 5, 2023 11:49

Boostrix assigned erik-megarad Jun 5, 2023

Boostrix added command system function: run code labels Jun 5, 2023

Boostrix reviewed Jun 5, 2023

View reviewed changes

autogpt/commands/execute_code.py Outdated Show resolved Hide resolved

Boostrix reviewed Jun 5, 2023

View reviewed changes

autogpt/commands/execute_code.py Show resolved Hide resolved

erik-megarad force-pushed the me/work branch from bb900c7 to 29df5bf Compare June 5, 2023 19:18

github-actions bot added size/l and removed size/m labels Jun 5, 2023

erik-megarad force-pushed the me/work branch from 29df5bf to 1d2576d Compare June 5, 2023 19:30

waynehamadi suggested changes Jun 5, 2023

View reviewed changes

autogpt/commands/execute_code.py Outdated Show resolved Hide resolved

erik-megarad force-pushed the me/work branch 2 times, most recently from d6bae54 to dd466b6 Compare June 6, 2023 15:07

Boostrix reviewed Jun 6, 2023

View reviewed changes

autogpt/commands/execute_code.py Show resolved Hide resolved

erik-megarad force-pushed the me/work branch from 1b43f96 to fd08d49 Compare June 6, 2023 19:35

Pwuts requested changes Jun 6, 2023

View reviewed changes

autogpt/commands/execute_code.py Outdated Show resolved Hide resolved

autogpt/commands/execute_code.py Outdated Show resolved Hide resolved

Add command for directly executing python code

d07026c

erik-megarad force-pushed the me/work branch from fd08d49 to d07026c Compare June 6, 2023 20:44

erik-megarad added 2 commits June 6, 2023 14:23

Merge branch 'master' into me/work

9678bb4

Fix docstring

ac5f161

github-actions bot added the conflicts Automatically applied to PRs with merge conflicts label Jun 7, 2023

github-actions bot added size/m and removed size/l labels Jun 7, 2023

Merge branch 'master' into me/work

855edac

erik-megarad force-pushed the me/work branch from 9d3b60f to 855edac Compare June 7, 2023 08:02

github-actions bot removed the conflicts Automatically applied to PRs with merge conflicts label Jun 7, 2023

Pwuts previously approved these changes Jun 7, 2023

View reviewed changes

Pwuts added this to the v0.4.1 Release milestone Jun 7, 2023

erik-megarad added 2 commits June 7, 2023 10:44

Merge branch 'master' into me/work

6760e1d

Clarify / update filename references

3f6d420

erik-megarad dismissed Pwuts’s stale review via 3f6d420 June 7, 2023 18:58

waynehamadi approved these changes Jun 9, 2023

View reviewed changes

Merge branch 'master' into me/work

754b993

waynehamadi merged commit 94280b2 into Significant-Gravitas:master Jun 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add command for directly executing python code #4581

Add command for directly executing python code #4581

erik-megarad commented Jun 5, 2023 •

edited

Loading

vercel bot commented Jun 5, 2023

vercel bot commented Jun 5, 2023

erik-megarad commented Jun 5, 2023

Auto-GPT-Bot commented Jun 5, 2023

codecov bot commented Jun 5, 2023 •

edited

Loading

vercel bot commented Jun 5, 2023 •

edited

Loading

waynehamadi commented Jun 5, 2023

Auto-GPT-Bot commented Jun 5, 2023

Boostrix commented Jun 5, 2023 •

edited

Loading

vercel bot commented Jun 5, 2023

vercel bot commented Jun 5, 2023

Auto-GPT-Bot commented Jun 6, 2023

github-actions bot commented Jun 7, 2023

vercel bot commented Jun 7, 2023

Auto-GPT-Bot commented Jun 7, 2023

vercel bot commented Jun 7, 2023

vercel bot commented Jun 7, 2023

github-actions bot commented Jun 7, 2023

Auto-GPT-Bot commented Jun 7, 2023

Auto-GPT-Bot commented Jun 9, 2023

Pwuts commented Jun 9, 2023 •

edited

Loading

erik-megarad commented Jun 9, 2023 via email

Add command for directly executing python code #4581

Add command for directly executing python code #4581

Conversation

erik-megarad commented Jun 5, 2023 • edited Loading

Background

Changes

Documentation

Test Plan

PR Quality Checklist

vercel bot commented Jun 5, 2023

vercel bot commented Jun 5, 2023

erik-megarad commented Jun 5, 2023

Auto-GPT-Bot commented Jun 5, 2023

codecov bot commented Jun 5, 2023 • edited Loading

Codecov Report

vercel bot commented Jun 5, 2023 • edited Loading

waynehamadi commented Jun 5, 2023

Auto-GPT-Bot commented Jun 5, 2023

Boostrix commented Jun 5, 2023 • edited Loading

vercel bot commented Jun 5, 2023

vercel bot commented Jun 5, 2023

Auto-GPT-Bot commented Jun 6, 2023

github-actions bot commented Jun 7, 2023

vercel bot commented Jun 7, 2023

Auto-GPT-Bot commented Jun 7, 2023

vercel bot commented Jun 7, 2023

vercel bot commented Jun 7, 2023

github-actions bot commented Jun 7, 2023

Auto-GPT-Bot commented Jun 7, 2023

Auto-GPT-Bot commented Jun 9, 2023

Pwuts commented Jun 9, 2023 • edited Loading

erik-megarad commented Jun 9, 2023 via email

erik-megarad commented Jun 5, 2023 •

edited

Loading

codecov bot commented Jun 5, 2023 •

edited

Loading

vercel bot commented Jun 5, 2023 •

edited

Loading

Boostrix commented Jun 5, 2023 •

edited

Loading

Pwuts commented Jun 9, 2023 •

edited

Loading