[CI/Build] Avoid downloading all HF files in `RemoteOpenAIServer` #7836

DarkLight1337 · 2024-08-24T14:08:49Z

github-actions · 2024-08-24T14:09:03Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

mgoin

Nice fix

youkaichao · 2024-08-25T06:46:17Z

tests/utils.py

        self.host = str(args.host or 'localhost')
        self.port = int(args.port)

+        # download the model before starting the server to avoid timeout
+        engine_args = AsyncEngineArgs.from_cli_args(args)
+        engine_config = engine_args.create_engine_config()


can we directly create a load config object? it should be simple in my opinion. just load format auto.

Let's fix the tests first before simplifying this.

I'm not that familiar with the model loading code so may need some help regarding fixing the tests.

youkaichao · 2024-08-25T07:11:10Z

tests/utils.py

@@ -60,36 +61,40 @@ class RemoteOpenAIServer:

    def __init__(self,
                 model: str,
-                 cli_args: List[str],
+                 serve_args: List[str],


why rename the arg name btw?

I meant to associate them with vllm serve specifically, not just any CLI. Perhaps it could be clearer.

Edit: Updated the name to vllm_serve_args

youkaichao · 2024-08-25T07:12:07Z

tests/utils.py


        parser = FlexibleArgumentParser(
            description="vLLM's remote OpenAI server.")
        parser = make_arg_parser(parser)
-        args = parser.parse_args(cli_args)


I think you just need args = parser.parse_args(["--model", model, *cli_args])

youkaichao

thanks for the fix!

…lm-project#7836)

…lm-project#7836) Signed-off-by: Alvant <[email protected]>

…lm-project#7836)

Fix model name being overwritten

7f63693

DarkLight1337 mentioned this pull request Aug 25, 2024

Update RemoteOpenAIServer to use common prepare_weights function #7839

Closed

mgoin approved these changes Aug 25, 2024

View reviewed changes

Merge branch 'upstream' into fix-test-openai-server

e553a77

youkaichao reviewed Aug 25, 2024

View reviewed changes

DarkLight1337 added 2 commits August 26, 2024 01:47

No need to pre-download if the model is local

4b0c805

Rename to be more clear

c13996b

youkaichao approved these changes Aug 26, 2024

View reviewed changes

DarkLight1337 changed the title ~~[Bugfix][CI/Build] Fix model name being overwritten~~ [CI/Build] Avoid downloading all HF files in RemoteOpenAIServer Aug 26, 2024

Fix failing test

ffcd139

DarkLight1337 enabled auto-merge (squash) August 26, 2024 04:03

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 26, 2024

DarkLight1337 merged commit 029c71d into vllm-project:main Aug 26, 2024
56 checks passed

DarkLight1337 deleted the fix-test-openai-server branch August 26, 2024 05:34

omrishiv pushed a commit to omrishiv/vllm that referenced this pull request Aug 26, 2024

[CI/Build] Avoid downloading all HF files in RemoteOpenAIServer (vl…

b00c3fe

…lm-project#7836)

alexeykondrat mentioned this pull request Sep 6, 2024

[Bug]: Tensorizer test is broken #8249

Closed

1 task

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[CI/Build] Avoid downloading all HF files in RemoteOpenAIServer (vl…

563b64a

…lm-project#7836) Signed-off-by: Alvant <[email protected]>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[CI/Build] Avoid downloading all HF files in RemoteOpenAIServer (vl…

ad93efc

…lm-project#7836)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI/Build] Avoid downloading all HF files in `RemoteOpenAIServer` #7836

[CI/Build] Avoid downloading all HF files in `RemoteOpenAIServer` #7836

DarkLight1337 commented Aug 24, 2024

github-actions bot commented Aug 24, 2024

mgoin left a comment

youkaichao Aug 25, 2024

DarkLight1337 Aug 25, 2024

DarkLight1337 Aug 25, 2024 •

edited

Loading

youkaichao Aug 25, 2024

DarkLight1337 Aug 25, 2024 •

edited

Loading

youkaichao Aug 25, 2024

youkaichao left a comment

[CI/Build] Avoid downloading all HF files in RemoteOpenAIServer #7836

[CI/Build] Avoid downloading all HF files in RemoteOpenAIServer #7836

Conversation

DarkLight1337 commented Aug 24, 2024

github-actions bot commented Aug 24, 2024

mgoin left a comment

Choose a reason for hiding this comment

youkaichao Aug 25, 2024

Choose a reason for hiding this comment

DarkLight1337 Aug 25, 2024

Choose a reason for hiding this comment

DarkLight1337 Aug 25, 2024 • edited Loading

Choose a reason for hiding this comment

youkaichao Aug 25, 2024

Choose a reason for hiding this comment

DarkLight1337 Aug 25, 2024 • edited Loading

Choose a reason for hiding this comment

youkaichao Aug 25, 2024

Choose a reason for hiding this comment

youkaichao left a comment

Choose a reason for hiding this comment

[CI/Build] Avoid downloading all HF files in `RemoteOpenAIServer` #7836

[CI/Build] Avoid downloading all HF files in `RemoteOpenAIServer` #7836

DarkLight1337 Aug 25, 2024 •

edited

Loading

DarkLight1337 Aug 25, 2024 •

edited

Loading