Added constraints and evaluations #1874

javableu · 2023-04-16T12:39:24Z

We can maybe improve the return message of the AI. To do this, we could give a list of straightforward instructions that can be passed to GPT-4. The text generated by a "stupid" AI will be of lower quality than the one generated by a "smart" AI. However, even a "stupid" AI can potentially produce results that match those of a "smart" AI. It certainly just needs more iteration, focus, and a list of simple instructions that he has to follow. It's important to have the AI think before he writes anything to autogpt ( critique-plan process of a single sentence).

1 Choose a sentence to work on.
2 Compose an impactful sentence.
3 Critique the sentence for clarity, coherence, grammar, and overall effectiveness. 4 Create a plan for revising the sentence, considering the critique. 5 Evaluate the plan to ensure it addresses the issues identified in the critique. 6 Refine the plan as needed.
7 Rephrase the sentence according to the refined plan. 8 Repeat steps 2-7 for additional sentences.
9 Merge the revised sentences.
10 Showcase the integrated text

For the performance evaluation, the AI has a tendency to be a bit adhd, Added refocusing criteria might help.

Background

Changes

Documentation

Test Plan

PR Quality Checklist

My pull request is atomic and focuses on a single change.
I have thoroughly tested my changes with multiple different prompts.
I have considered potential risks and mitigations for my changes.
I have documented my changes clearly and comprehensively.
I have not snuck in any "extra" small tweaks changes

We can maybe improve the return message of the AI. To do this, we could give a list of straightforward instructions that can be passed to GPT-4. The text generated by a "stupid" AI will be of lower quality than the one generated by a "smart" AI. However, even a "stupid" AI can potentially produce results that match those of a "smart" AI. It certainly just needs more iteration, focus, and a list of simple instructions that he has to follow. It's important to have the AI think before he writes anything to autogpt ( critique-plan process of a single sentence). 1 Choose a sentence to work on. 2 Compose an impactful sentence. 3 Critique the sentence for clarity, coherence, grammar, and overall effectiveness. 4 Create a plan for revising the sentence, considering the critique. 5 Evaluate the plan to ensure it addresses the issues identified in the critique. 6 Refine the plan as needed. 7 Rephrase the sentence according to the refined plan. 8 Repeat steps 2-7 for additional sentences. 9 Merge the revised sentences. 10 Showcase the integrated text For the performance evaluation, the AI has a tendency to be a bit adhd, Added refocusing critaria might help.

javableu · 2023-04-16T17:53:40Z

Since it impacts the main agent's behavior, it is really hard to understand the full implications of this update. It would be nice to have user feedback on this update to see if people have noticed an improvement

nponeccop · 2023-04-16T18:18:14Z

Yes, the label "Needs benchmark" is exactly for that

github-actions · 2023-04-19T23:57:24Z

This pull request has conflicts with the base branch, please resolve those so we can evaluate the pull request.

p-i- · 2023-05-05T00:57:51Z

This is a mass message from the AutoGPT core team.
Our apologies for the ongoing delay in processing PRs.
This is because we are re-architecting the AutoGPT core!

For more details (and for infor on joining our Discord), please refer to:
https://github.com/Significant-Gravitas/Auto-GPT/wiki/Architecting

waynehamadi · 2023-05-05T16:20:24Z

@javableu we're currently building challenges and I would like to discuss with you.

Please join use on Discord through this link https://discord.gg/autogpt (if not already)

DM me on the Auto-GPT discord channel (my discord is merwanehamadi).

Boostrix · 2023-05-06T17:52:35Z

Since it impacts the main agent's behavior, it is really hard to understand the full implications of this update. It would be nice to have user feedback on this update to see if people have noticed an improvement

given that these PRs are such a common thing but due to the lack of testing data also so controversial, how about coming up with support for "prompt profiles", as in a shared folder with sub-folders that contain profiles for different prompts, so that people can tinker with different profiles without stepping on anyone's toes ?

Having this sort of infrastructure in place would also provide a good baseline for regression testing and benchmarking.
It would not directly be the solution proposed in #3858, but probably would be the only solution to have our cake and eat it, while also growing test data:

Prompt Profiles #3954

anonhostpi · 2023-05-07T06:58:33Z

Constraints awareness issue - #3466

nponeccop previously approved these changes Apr 16, 2023

View reviewed changes

nponeccop added Needs Benchmark This change is hard to test and requires a benchmark B7 labels Apr 16, 2023

Update prompt.py

8854f8e

javableu dismissed nponeccop’s stale review via 8854f8e April 16, 2023 16:17

nponeccop mentioned this pull request Apr 17, 2023

PR Batch 7 #2256

Closed

1 task

github-actions bot added the conflicts Automatically applied to PRs with merge conflicts label Apr 19, 2023

This was referenced May 6, 2023

Input failure loop fix #298

Closed

Full prompt test cases #1354

Closed

This was referenced May 7, 2023

Improve website summary quality via browse prompt change #3551

Closed

Supporting different languages #1563

Closed

Centralize all of our prompts in one place so people can change it easily #3858

Closed

javableu closed this by deleting the head repository May 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added constraints and evaluations #1874

Added constraints and evaluations #1874

javableu commented Apr 16, 2023 •

edited

Loading

javableu commented Apr 16, 2023

nponeccop commented Apr 16, 2023

github-actions bot commented Apr 19, 2023

p-i- commented May 5, 2023

waynehamadi commented May 5, 2023 •

edited

Loading

Boostrix commented May 6, 2023 •

edited

Loading

anonhostpi commented May 7, 2023

Added constraints and evaluations #1874

Added constraints and evaluations #1874

Conversation

javableu commented Apr 16, 2023 • edited Loading

Background

Changes

Documentation

Test Plan

PR Quality Checklist

javableu commented Apr 16, 2023

nponeccop commented Apr 16, 2023

github-actions bot commented Apr 19, 2023

p-i- commented May 5, 2023

waynehamadi commented May 5, 2023 • edited Loading

Boostrix commented May 6, 2023 • edited Loading

anonhostpi commented May 7, 2023

javableu commented Apr 16, 2023 •

edited

Loading

waynehamadi commented May 5, 2023 •

edited

Loading

Boostrix commented May 6, 2023 •

edited

Loading