Add AI Model Fallback Mechanism for Improved Reliability #901

jelveh · 2024-11-22T01:51:23Z

Well, we just got banned by Anthropic without any warning, which highlights the importance of being resilient when it comes to using 3rd party services.

Currently, when an AI model request fails (e.g., due to provider errors), the entire request fails. We should implement an automatic fallback mechanism to improve reliability and ensure requests succeed whenever possible.

Current Behavior

When a model provider (e.g., OpenAI) returns an error, the request fails completely
No automatic fallback to alternative models
Users experience complete failure rather than degraded service

Proposed Solution
Implement an automatic fallback chain for AI model requests:

Try requested model first (e.g., claude-3-5-sonnet)
If that fails, fall back to gpt-4o mini
If gpt-4o mini fails, fall back to llama
Only fail completely if all fallback options are exhausted

Implementation Details

Response Enhancement:
- Add a new response attribute (e.g., usedModel) indicating which model actually handled the request
- This allows clients to know if a fallback was used

Configuration Option:

Add a new option to disable fallback behavior: disableFallback (default: false)
Example usage:

puter.ai.chat("Hello!", {
  model: "claude-3-5-sonnet",
  disableFallback: true  // Disable automatic fallback
})

Fallback Chain:
- Define clear fallback hierarchy
- Consider timeout/retry logic before moving to next fallback

Expected Behavior

// Example 1: With fallback enabled (default)
const response = await puter.ai.chat("Hello!", { model: "claude-3-5-sonnet" });
console.log(response.text);         // The response text
console.log(response.usedModel);    // e.g., "gpt-4o-mini" if Claude failed

// Example 2: With fallback disabled
try {
  const response = await puter.ai.chat("Hello!", {
    model: "claude-3-5-sonnet",
    disableFallback: true
  });
} catch (error) {
  // Original error from Claude is thrown
}

The text was updated successfully, but these errors were encountered:

jelveh added enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed labels Nov 22, 2024

jelveh assigned KernelDeimos Nov 22, 2024

jelveh removed good first issue Good for newcomers help wanted Extra attention is needed labels Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add AI Model Fallback Mechanism for Improved Reliability #901

Add AI Model Fallback Mechanism for Improved Reliability #901

jelveh commented Nov 22, 2024

Add AI Model Fallback Mechanism for Improved Reliability #901

Add AI Model Fallback Mechanism for Improved Reliability #901

Comments

jelveh commented Nov 22, 2024