Auto Recovery Logic for chat completion clients on different types of server errors #3632

ekzhu · 2024-10-03T22:20:53Z

What feature would you like to be added?

It would be nice instead if we can support a model client that auto-recovers from various host-related errors with a configurable logic such as retries.

Here is a good example of an apparently transient error:

openai.APIStatusError: Error code: 424 - {'error': {'message': 'Error occurred while processing image(s).', 'type': 'failed_dependency', 'param': None, 'code': None}}

Moreover, even for rate limit errors like below, it doesn't always retry, there is some inconsistency happening

openai.RateLimitError: Error code: 429 - {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 1 seconds.'}}

Why is this needed?

Currently, the implementation of the chat completion client for OpenAI fails to recover and crashes for a host of errors (except for retries that are rate-limit related where it auto-retries), however, a host of other errors randomly occur and that forces the applications built on top of the client to have their own logic for auto-recovery.

The text was updated successfully, but these errors were encountered:

ekzhu · 2024-10-03T22:21:23Z

@husseinmozannar Moved your issue here.

ekzhu · 2025-01-28T01:11:12Z

Just encountered the rate limiting error again, I think it would be good to have a retry logic built into the client itself. it's a pain to always have to write the code around the client.

ekzhu added enhancement proj-core labels Oct 3, 2024

github-actions bot added the needs-triage label Oct 3, 2024

ekzhu removed the needs-triage label Oct 3, 2024

ekzhu added the proj-extensions label Oct 3, 2024

jackgerrits mentioned this issue Oct 7, 2024

[Feature Request]: Retry llm requests on server failures #3609

Closed

rysweet added this to the future milestone Oct 22, 2024

jackgerrits modified the milestones: future, 0.4 Oct 22, 2024

fniedtner removed the feature label Oct 24, 2024

jackgerrits added proj-extensions size-small takes 1-2 days size-medium takes up to a week and removed proj-core proj-extensions size-small takes 1-2 days labels Oct 24, 2024

fniedtner modified the milestones: 0.4, 0.4.1 Oct 24, 2024

jackgerrits modified the milestones: 0.4.1, 0.4.x Jan 13, 2025

ekzhu added the help wanted Extra attention is needed label Jan 28, 2025

ekzhu mentioned this issue Feb 18, 2025

When using the "AzureOpenAIChatCompletionClient", a user triggering a content violation policy causes an exception within Autogen 0.4.6 that cannot be caught #5569

Open

ekzhu changed the title ~~Auto Recovery Logic for chat completion clients~~ Auto Recovery Logic for chat completion clients on different types of server errors Feb 18, 2025

ekzhu linked a pull request Feb 21, 2025 that will close this issue

[DRAFT] Add OpenAI Client Error Handler #5615

Draft

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto Recovery Logic for chat completion clients on different types of server errors #3632

Auto Recovery Logic for chat completion clients on different types of server errors #3632

ekzhu commented Oct 3, 2024 •

edited

Loading

ekzhu commented Oct 3, 2024

ekzhu commented Jan 28, 2025

Auto Recovery Logic for chat completion clients on different types of server errors #3632

Auto Recovery Logic for chat completion clients on different types of server errors #3632

Comments

ekzhu commented Oct 3, 2024 • edited Loading

What feature would you like to be added?

Why is this needed?

ekzhu commented Oct 3, 2024

ekzhu commented Jan 28, 2025

ekzhu commented Oct 3, 2024 •

edited

Loading