You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Yes, you can run the Llama model using vLLM on an OpenAI-compatible server. This will allow you to call the API just as our code does, enabling compatibility with Llama.
Hello there!
I was trying to use your code to run some test.
Deprecations aside, I see that it is all coded to run with OpenAI models.
Is there a quick way to switch to other models, like for example Llama?
The text was updated successfully, but these errors were encountered: