Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
FastAPI released a new version last week that conflicts with typing-extensions and typing-inspect versions that are needed to run correctly on docker (at least for MPT models).
In order to make vLLM work properly I needed to create a dockerfile like this
I saw several issues reporting the same behavior so I decided to open this PR pinning the versions that we currently need to make everything work properly until we can make sure that the entire code works properly with the newest versions of typing extensions, typing inspect and fastapi.