fix: rate limit retry with exponential backoff #86

danay1999 · 2023-05-13T07:05:52Z

Description

Testing

Additional context

github-actions

Summary by GPT-4

I have created a new utility function called _retry_with_exponential_backoff in a new file named utils.py. This function takes the current retry count and an optional retry_after parameter. It calculates the wait time using exponential backoff and sleeps for that duration before retrying.

Here's the code for the new utils.py file:

"""Utility functions"""
import logging
import time
from typing import Optional

import gpt_review.constants as C

def _retry_with_exponential_backoff(current_retry: int, retry_after: Optional[str]) -> None:
    """
    Use exponential backoff to retry a request after specific time while staying under the retry count

    Args:
        current_retry (int): The current retry count.
        retry_after (Optional[str]): The time to wait before retrying.
    """
    logging.warning("Call to GPT failed due to rate limit, retry attempt %s of %s", current_retry, C.MAX_RETRIES)

    multiplication_factor = 2 * (1 + current_retry / C.MAX_RETRIES)
    wait_time = int(retry_after) * multiplication_factor if retry_after else current_retry * multiplication_factor

    logging.warning("Waiting for %s seconds before retrying.", wait_time)

    time.sleep(wait_time)

And here's how I modified the _call_gpt function in _openai.py:

from gpt_review.utils import _retry_with_exponential_backoff

def _call_gpt(
        # ...
):
    try:
        # ...
        
        return completion.choices[0].message.content  # type: ignore
    except RateLimitError as error:
        if retry < C.MAX_RETRIES:
            _retry_with_exponential_backoff(retry, error.headers["Retry-After"])

            return _call_gpt(prompt, temperature, max_tokens, top_p, frequency_penalty, presence_penalty, retry + 1)
        raise RateLimitError("Retry limit exceeded") from error

This change should make it easier to manage retries with exponential backoff.

Suggestions

The changes in this PR look good overall. However, I have a few suggestions to improve the code:

In the _retry_with_exponential_backoff function, it would be better to use float instead of int for the wait_time. This is because the multiplication factor can result in non-integer values, and using int might truncate the value.
Add a docstring for the _call_gpt function to provide more context about its purpose and usage.

Here's an updated version of the code with these suggestions:

def _retry_with_exponential_backoff(current_retry: int, retry_after: Optional[str]) -> None:
    """
    Use exponential backoff to retry a request after specific time while staying under the retry count

    Args:
        current_retry (int): The current retry count.
        retry_after (Optional[str]): The time to wait before retrying.
    """
    logging.warning("Call to GPT failed due to rate limit, retry attempt %s of %s", current_retry, C.MAX_RETRIES)

    multiplication_factor = 2 * (1 + current_retry / C.MAX_RETRIES)
    wait_time = float(retry_after) * multiplication_factor if retry_after else current_retry * multiplication_factor

    logging.warning("Waiting for %s seconds before retrying.", wait_time)

    time.sleep(wait_time)

def _call_gpt(
    prompt: str,
    temperature: float,
    max_tokens: int,
    top_p: float,
    frequency_penalty: float,
    presence_penalty: float,
    retry: int = 0,
) -> str:
    """
    Call GPT with given parameters and handle rate limit errors with retries.

    Args:
        prompt (str): The input prompt for GPT.
        temperature (float): Sampling temperature.
        max_tokens (int): Maximum number of tokens in response.
        top_p (float): Nucleus sampling parameter.
        frequency_penalty (float): Penalty for frequent tokens.
        presence_penalty (float): Penalty for repetitive tokens.
        retry (int, optional): Current number of retries. Defaults to 0.

    Returns:
        str: The generated text from GPT.

    Raises:
        RateLimitError: If the maximum number of retries is exceeded.
   """
   # Rest of the function implementation

codecov-commenter · 2023-05-13T07:08:39Z

Codecov Report

Merging #86 (bda599b) into main (60c0d8e) will not change coverage.
The diff coverage is 100.00%.

@@            Coverage Diff            @@
##              main       #86   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           13        14    +1     
  Lines          462       469    +7     
  Branches        67        67           
=========================================
+ Hits           462       469    +7

Flag	Coverage Δ
integration	`98.08% <100.00%> (+0.02%)`	⬆️
unittests	`96.58% <50.00%> (-0.39%)`	⬇️
unittests-3.10	`92.11% <50.00%> (-0.32%)`	⬇️
unittests-3.11	`92.11% <50.00%> (-0.32%)`	⬇️
unittests-3.8	`92.11% <50.00%> (-0.32%)`	⬇️
unittests-3.9	`92.11% <50.00%> (-0.32%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/gpt_review/_openai.py	`100.00% <100.00%> (ø)`
src/gpt_review/utils.py	`100.00% <100.00%> (ø)`

danay1999 added 2 commits May 13, 2023 07:03

this should be a better implementation of exponenial backoff

1359639

Merge branch 'main' into danay1999/rate-limit-retry-exponential-backoff

f682d06

github-actions bot reviewed May 13, 2023

View reviewed changes

danay1999 added 4 commits May 13, 2023 07:14

Refactor retry function to use private naming convention.

48d3f03

Refactor retry function to be private.

aecf6d4

Delete azure.yaml

c8d5d08

Refactor exponential backoff calculation in GPT retry function.

bda599b

dciborow self-requested a review May 15, 2023 20:34

dciborow approved these changes May 15, 2023

View reviewed changes

dciborow merged commit 1f67150 into microsoft:main May 15, 2023

danay1999 deleted the danay1999/rate-limit-retry-exponential-backoff branch May 25, 2023 17:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: rate limit retry with exponential backoff #86

fix: rate limit retry with exponential backoff #86

danay1999 commented May 13, 2023

github-actions bot left a comment •

edited

Loading

codecov-commenter commented May 13, 2023 •

edited

Loading

fix: rate limit retry with exponential backoff #86

fix: rate limit retry with exponential backoff #86

Conversation

danay1999 commented May 13, 2023

Description

Testing

Additional context

github-actions bot left a comment • edited Loading

Choose a reason for hiding this comment

Summary by GPT-4

Suggestions

codecov-commenter commented May 13, 2023 • edited Loading

Codecov Report

github-actions bot left a comment •

edited

Loading

codecov-commenter commented May 13, 2023 •

edited

Loading