Auto-switch to gpt-35-turbo, gpt-4 and gpt-4-32k when number of tokens exceeded by query #3367

NKT00 · 2023-04-26T23:56:44Z

Duplicates

I have searched the existing issues

Summary 💡

There's a few bug reports close to this, but, would it not make sense to get rid of the error
SYSTEM: Command get_text_summary returned: Error: This model's maximum context length is 4097 tokens. However, your messages resulted in 5113 tokens. Please reduce the length of the messages.
by simply swapping model, just for that query, when the length of the query is below the limit for another model?

gpt-35-turbo is 4096 tokens, whereas the token limits for gpt-4 and gpt-4-32k are 8192 and 32768 respectively. This could be implemented easily.

Examples 🌈

No response

Motivation 🔦

Everything that pulls a website page fails, as the webpages are too big, generally. However, some are only slightly too big, and could be run through a different model to downsize them first.

The text was updated successfully, but these errors were encountered:

dimitar-d-d · 2023-04-28T19:03:13Z

I strongly support this proposal. This should be easy to implement and would definitely help make this thing being actually useful.

Boostrix · 2023-04-28T19:11:31Z

you could probably use some sort of preprocessor/preparation stage prior to passing such contexts to the LLM

dimitar-d-d · 2023-04-28T19:19:16Z

you could probably use some sort of preprocessor/preparation stage prior to passing such contexts to the LLM

Yep. I could do that and I do it. I end up splitting my text assignments into three separate runs of AutoGPT just to not to get an error... This, however, is time consuming and impractical.

The ability of the tool to dynamically call the larger LLM model when applicable, and combined with better chunking, will definitely reduce the amount of fatal errors.

Rykimaruh · 2023-05-04T21:53:53Z

has there been any workaround to this? I thought auto-gpt used gpt-4 which had greater token limit than 3.5 but I'm still getting the 4097 max token limit

Boostrix · 2023-05-05T05:14:49Z

it depends on the level of OpenAI API access you've got

anonhostpi · 2023-05-05T05:19:22Z

I'd like to say that there should be a switching mechanism that switches between all of the supported APIs, not just OpenAI's models/APIs.

@p-i- perhaps if and when the repository gets around to implementing the APIs as plugins, maybe add a plugin object that reports the rate limit associated with that API, so that AutoGPT can completely switch plugins, not just models.

Boostrix · 2023-05-05T05:21:33Z

t there should be a switching mechanism that switches between all of the supported APIs, not just OpenAI's models/APIs.

Which seems to be work in progress #2158

maybe add a plugin object that reports the rate limit associated with that API, so that AutoGPT can completely switch plugins, not just models.

👍 the basic idea is this #3466

anonhostpi · 2023-05-05T05:23:37Z

love ya boostrix, which one are you on the Discord Server?

Boostrix · 2023-05-09T07:07:40Z

I'd like to say that there should be a switching mechanism that switches between all of the supported APIs, not just OpenAI's models/APIs.

that's a form of feature scaling, and #3466 - #528

but agreed, if one model fails, there should be an option try another one - even if that's not the preferred one

kinance · 2023-06-13T15:18:53Z

To fix this issue, the batch summarization approach introduced by the PR #4652 can also be applied to summarize_text function in text.py

unitythemaker · 2023-06-14T09:54:35Z

gpt-3.5-turbo-16k is here.

github-actions · 2023-09-06T21:02:41Z

This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days.

github-actions · 2023-09-19T01:47:13Z

This issue was closed automatically because it has been stale for 10 days with no activity.

prathamesh-0909 · 2024-03-18T09:23:40Z

This issue was closed automatically because it has been stale for more than 10 days without any activity.

NKT00 changed the title ~~Auto-switch to GPT3.5 when number of tokens for GPT4 exceeded by query~~ Auto-switch to gpt-35-turbo, gpt-4 and gpt-4-32k when number of tokens exceeded by query Apr 26, 2023

k-boikov added enhancement New feature or request AI model limitation Not related to AutoGPT directly. AI efficacy labels May 6, 2023

This was referenced Jun 1, 2023

COMMAND = list_files - openai.error.InvalidRequestError: This model's maximum context length is 4097 tokens #3681

Closed

Improve chunking and chunk handling #38

Closed

Tarolrr mentioned this issue Aug 29, 2023

Collection of issues related to other models support Tarolrr/Auto-GPT#1

Open

github-actions bot added the Stale label Sep 6, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto-switch to gpt-35-turbo, gpt-4 and gpt-4-32k when number of tokens exceeded by query #3367

Auto-switch to gpt-35-turbo, gpt-4 and gpt-4-32k when number of tokens exceeded by query #3367

NKT00 commented Apr 26, 2023

dimitar-d-d commented Apr 28, 2023

Boostrix commented Apr 28, 2023

dimitar-d-d commented Apr 28, 2023

Rykimaruh commented May 4, 2023

Boostrix commented May 5, 2023

anonhostpi commented May 5, 2023

Boostrix commented May 5, 2023 •

edited

Loading

anonhostpi commented May 5, 2023

Boostrix commented May 9, 2023

kinance commented Jun 13, 2023

unitythemaker commented Jun 14, 2023 •

edited

Loading

github-actions bot commented Sep 6, 2023

github-actions bot commented Sep 19, 2023

prathamesh-0909 commented Mar 18, 2024

Auto-switch to gpt-35-turbo, gpt-4 and gpt-4-32k when number of tokens exceeded by query #3367

Auto-switch to gpt-35-turbo, gpt-4 and gpt-4-32k when number of tokens exceeded by query #3367

Comments

NKT00 commented Apr 26, 2023

Duplicates

Summary 💡

Examples 🌈

Motivation 🔦

dimitar-d-d commented Apr 28, 2023

Boostrix commented Apr 28, 2023

dimitar-d-d commented Apr 28, 2023

Rykimaruh commented May 4, 2023

Boostrix commented May 5, 2023

anonhostpi commented May 5, 2023

Boostrix commented May 5, 2023 • edited Loading

anonhostpi commented May 5, 2023

Boostrix commented May 9, 2023

kinance commented Jun 13, 2023

unitythemaker commented Jun 14, 2023 • edited Loading

github-actions bot commented Sep 6, 2023

github-actions bot commented Sep 19, 2023

prathamesh-0909 commented Mar 18, 2024

Boostrix commented May 5, 2023 •

edited

Loading

unitythemaker commented Jun 14, 2023 •

edited

Loading