Fix Exception #21

yiranwu0 · 2024-09-02T02:11:10Z

Why are these changes needed?

Fix exceptions of non-OpenAI generations:

Remove meaningless exceptions that just catches the error and throw it. This also adds confusion to debugging when an error happens.
Remove retries: Some retries are also useless. If we catch any errors and throw it, the retry is not needed. If no error is caught, we don't need retry.

Fix Gemini retry:

Remove retry in GeminiClient. It should be in OpenAIWrapper.
OpenAIWrapper only catches OpenAI rate limits. This PR adds catches of google's rate limits so that different keys can be tried.

Future todo:

Add exception catches of non-openai models to OpenAIWrapper. Currently only gemini is added.
(Optional Feature): Add a wait time option for OpenAIWrapper to wait out the rate limits.

Related issue number

Checks

I've included any doc changes needed for https://autogen-ai.github.io/autogen/. See https://autogen-ai.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

Hk669 · 2024-09-02T11:10:00Z

thanks @yiranwu0 for the changes. can you confirm if the updated code is working as expected with all the clients. have you tested it? if not @marklysze and i can give it a try.

cc @marklysze

marklysze · 2024-09-05T22:44:30Z

LGTM. @marklysze could you take a look too?

Yep, I'll have a look and test now.

marklysze · 2024-09-05T23:57:28Z

Thanks for raising this PR @yiranwu0, better bubbling of exceptions and centralised retries is definitely needed with all the client classes in place.

From what I understand from the code changes, the client-specific exceptions are added to client.py and then handled in the try/except around the client.create(params). Then it's logged and raised if it's the last client.

I was testing with Bedrock and checked out the exceptions that could be raised, and there's quite a lot of possible exceptions (boto and aws exceptions and click Click to see a full list of static exceptions). So, I was wondering whether, rather than trying to add each client's full list of exceptions into client.py and catch them, would it be possible to create either a generic ClientException class or an exception class in each client class (e.g. BedrockClientException, GeminiClientException) and then raise that, if we want to handle client-specific exceptions. Other options are to create our own set of exceptions for specific common scenarios, like ClientRateLimitException, and then we raise those specific ones from the client class and any others can be generic, like ClientException.

Amazon Bedrock is probably going to be the one with the most exceptions, though Anthropic with Bedrock may also have a few. Additionally, some exceptions may have the same name, so if we continue with the proposed approach then perhaps we should prefix them, e.g. InternalServerError becomes GeminiInternalServerError.

Let me know what you think...

yiranwu0 · 2024-09-07T18:07:48Z

Hello @marklysze, thank you for testing it!
I don't think we need to attend to all exceptions from different clients, we only want to catch the few "time rate limits" exception and handle that in client.py because we can try different configs. For other exceptions, I think we should raise the original exception, so that user can debug from that, or search online.

However, I think it is good if we can have all the rate limit exceptions wrapped in an exception of our own, so that we only need to catch one exception. But that needs more thinking and could be done in the next PR.

marklysze · 2024-09-08T08:51:29Z

Thanks @yiranwu0, sounds like a plan! I'll look at the rest of the client classes and test :)

…here stop error, removed try/except for Ollama

marklysze · 2024-09-09T00:22:44Z

Hey @yiranwu0, I've updated the code as follows:

Removed try/except from Ollama client
Added main exceptions for non-OpenAI clients into client.py
Prefixed exceptions with the client name, e.g. gemini_InternalServerError and anthorpic_InternalServerError, to avoid conflicts
Changed variables for exceptions in ImportError to be of type Exception instead of None as None can't be used in an except clause. Also moved the except clause to the end of the three exception blocks.
Tested non-function and function workflows with the following (to ensure they are still working): Anthorpic, Cohere, Groq, Mistral, Ollama, Together.AI. (I haven't tested Gemini).
Added a note for users to also install fix-busted-json if trying to use the Ollama client.

It caught exceptions when they arose. Though I can't test all exceptions.

yiranwu0 · 2024-09-10T01:36:13Z

Hello @marklysze, your change looks good! I already tested Gemini. If there are good, we can merge it!

marklysze · 2024-09-10T06:07:18Z

Hello @marklysze, your change looks good! I already tested Gemini. If there are good, we can merge it!

Okay, great... I'm good with it!

Hk669

looks good to me. Thanks @yiranwu0 and @marklysze

update

213c18d

yiranwu0 had a problem deploying to openai1 September 2, 2024 02:11 — with GitHub Actions Failure

update

35d06e6

yiranwu0 had a problem deploying to openai1 September 2, 2024 02:23 — with GitHub Actions Error

yiranwu0 had a problem deploying to openai1 September 2, 2024 02:23 — with GitHub Actions Failure

yiranwu0 had a problem deploying to openai1 September 2, 2024 02:23 — with GitHub Actions Error

yiranwu0 had a problem deploying to openai1 September 2, 2024 02:23 — with GitHub Actions Failure

yiranwu0 temporarily deployed to openai1 September 2, 2024 02:23 — with GitHub Actions Inactive

yiranwu0 had a problem deploying to openai1 September 2, 2024 02:23 — with GitHub Actions Failure

Hk669 requested review from marklysze and Hk669 September 2, 2024 11:10

sonichi requested a review from BeibinLi September 4, 2024 15:58

qingyun-wu had a problem deploying to openai1 September 5, 2024 21:39 — with GitHub Actions Failure

Merge branch 'main' into fixexception

ec662a2

yiranwu0 had a problem deploying to openai1 September 7, 2024 18:08 — with GitHub Actions Failure

Added exceptions for non-openai clients into clients.py, corrected co…

bc9dc3d

…here stop error, removed try/except for Ollama

marklysze had a problem deploying to openai1 September 9, 2024 00:18 — with GitHub Actions Failure

marklysze approved these changes Sep 11, 2024

View reviewed changes

Hk669 approved these changes Sep 11, 2024

View reviewed changes

sonichi merged commit 6712b64 into main Sep 13, 2024
146 of 153 checks passed

sonichi deleted the fixexception branch September 13, 2024 00:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Exception #21

Fix Exception #21

yiranwu0 commented Sep 2, 2024 •

edited

Loading

Hk669 commented Sep 2, 2024

marklysze commented Sep 5, 2024

marklysze commented Sep 5, 2024 •

edited

Loading

yiranwu0 commented Sep 7, 2024

marklysze commented Sep 8, 2024

marklysze commented Sep 9, 2024

yiranwu0 commented Sep 10, 2024

marklysze commented Sep 10, 2024

Hk669 left a comment

Fix Exception #21

Fix Exception #21

Conversation

yiranwu0 commented Sep 2, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

Hk669 commented Sep 2, 2024

marklysze commented Sep 5, 2024

marklysze commented Sep 5, 2024 • edited Loading

yiranwu0 commented Sep 7, 2024

marklysze commented Sep 8, 2024

marklysze commented Sep 9, 2024

yiranwu0 commented Sep 10, 2024

marklysze commented Sep 10, 2024

Hk669 left a comment

Choose a reason for hiding this comment

yiranwu0 commented Sep 2, 2024 •

edited

Loading

marklysze commented Sep 5, 2024 •

edited

Loading