[CI/Build] Split up models tests #10069

DarkLight1337 · 2024-11-06T06:38:40Z

To reduce the impact of flakiness (e.g. connection failures), this PR splits up the model tests into groups.

I have also fixed VLM test failures that were introduced by #8346 and #9983.

Signed-off-by: DarkLight1337 <[email protected]>

github-actions · 2024-11-06T06:38:53Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

.buildkite/test-pipeline.yaml

DarkLight1337 · 2024-11-06T08:04:14Z

It looks like sharding doesn't work over parameter combinations.

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 · 2024-11-09T13:18:38Z

Would be great if we could get this in before the next nightly!

mergify · 2024-11-09T16:21:11Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @DarkLight1337.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: DarkLight1337 <[email protected]>

youkaichao

thanks for the fix!

youkaichao · 2024-11-09T19:39:41Z

I merged it to fix the test on main ASAP. also left some comments.

youkaichao · 2024-11-09T19:42:05Z

vllm/model_executor/models/fuyu.py

-        self.language_model = PersimmonForCausalLM(config.text_config,
-                                                   cache_config=cache_config,
-                                                   quant_config=quant_config)
+        self.language_model = PersimmonForCausalLM(


it would be better if this can use init_vllm_registered_model .

I think this config.text_config misses an architectures field. we can manually add the field, and then call init_vllm_registered_model.

we also need to change the signature of init_vllm_registered_model in the future.

It has model_type defined so it should still be possible to match if we add this model to the registry.

youkaichao · 2024-11-09T19:44:49Z

vllm/config.py

+    def with_hf_config(self, hf_config: PretrainedConfig) -> "VllmConfig":
+        model_config = copy.deepcopy(self.model_config)
+        model_config.hf_config = hf_config
+
+        return replace(self, model_config=model_config)


we should flatten VllmConfig in the future, hf_config can be the same level as model_config . and we can have a top-level function vllm_config.replace_with(hf_config=hf_config)

Some values in model_config depend on the hf_config, so it might be better to just have a convenience property to access hf_config directly from VllmConfig, instead of changing the nested structure.

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: OmerD <[email protected]>

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Loc Huynh <[email protected]>

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Jee Jee Li <[email protected]>

Signed-off-by: DarkLight1337 <[email protected]>

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

DarkLight1337 added 2 commits November 6, 2024 06:36

Try sharding models tests

a092408

Signed-off-by: DarkLight1337 <[email protected]>

Merge branch 'main' into shard-models-tests

f61a90d

mergify bot added the ci/build label Nov 6, 2024

DarkLight1337 commented Nov 6, 2024

View reviewed changes

.buildkite/test-pipeline.yaml Outdated Show resolved Hide resolved

DarkLight1337 marked this pull request as draft November 6, 2024 08:04

DarkLight1337 added 3 commits November 9, 2024 07:24

Merge branch 'main' into shard-models-tests

f16509b

Add quant_model subcategory

6ff8e80

Signed-off-by: DarkLight1337 <[email protected]>

Split up multimodal tests into groups

78e6857

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 marked this pull request as ready for review November 9, 2024 08:17

DarkLight1337 requested a review from ywang96 as a code owner November 9, 2024 08:17

DarkLight1337 changed the title ~~[CI/Build] Shard models tests~~ [CI/Build] Split up models tests Nov 9, 2024

DarkLight1337 added 4 commits November 9, 2024 09:30

Merge branch 'main' into shard-models-tests

d65b1fd

Fix and redistribute tests

6b983e2

Signed-off-by: DarkLight1337 <[email protected]>

Update timings

d1882c9

Signed-off-by: DarkLight1337 <[email protected]>

Redistribute tests

274f332

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 9, 2024

DarkLight1337 requested review from mgoin and Isotr0py November 9, 2024 12:52

Fix

6a402f9

Signed-off-by: DarkLight1337 <[email protected]>

mergify bot added the needs-rebase label Nov 9, 2024

DarkLight1337 added 2 commits November 9, 2024 16:24

Update timings

88b2d68

Signed-off-by: DarkLight1337 <[email protected]>

Merge branch 'main' into shard-models-tests

a00ce74

Signed-off-by: DarkLight1337 <[email protected]>

mergify bot removed the needs-rebase label Nov 9, 2024

Fix missing import

a8797b4

Signed-off-by: DarkLight1337 <[email protected]>

youkaichao approved these changes Nov 9, 2024

View reviewed changes

youkaichao merged commit 51c2e1f into main Nov 9, 2024
69 of 70 checks passed

youkaichao deleted the shard-models-tests branch November 9, 2024 19:39

youkaichao reviewed Nov 9, 2024

View reviewed changes

omer-dayan pushed a commit to omer-dayan/vllm that referenced this pull request Nov 10, 2024

[CI/Build] Split up models tests (vllm-project#10069)

1c0738f

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: OmerD <[email protected]>

JC1DA pushed a commit to JC1DA/vllm that referenced this pull request Nov 11, 2024

[CI/Build] Split up models tests (vllm-project#10069)

1b91ca1

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Loc Huynh <[email protected]>

jeejeelee pushed a commit to jeejeelee/vllm that referenced this pull request Nov 11, 2024

[CI/Build] Split up models tests (vllm-project#10069)

83933d1

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Jee Jee Li <[email protected]>

rickyyx pushed a commit to rickyyx/vllm that referenced this pull request Nov 13, 2024

[CI/Build] Split up models tests (vllm-project#10069)

1326f30

Signed-off-by: DarkLight1337 <[email protected]>

sumitd2 pushed a commit to sumitd2/vllm that referenced this pull request Nov 14, 2024

[CI/Build] Split up models tests (vllm-project#10069)

08d6ca0

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI/Build] Split up models tests #10069

[CI/Build] Split up models tests #10069

DarkLight1337 commented Nov 6, 2024 •

edited

Loading

github-actions bot commented Nov 6, 2024

DarkLight1337 commented Nov 6, 2024

DarkLight1337 commented Nov 9, 2024

mergify bot commented Nov 9, 2024

youkaichao left a comment

youkaichao commented Nov 9, 2024

youkaichao Nov 9, 2024

DarkLight1337 Nov 10, 2024

youkaichao Nov 9, 2024

DarkLight1337 Nov 10, 2024

[CI/Build] Split up models tests #10069

[CI/Build] Split up models tests #10069

Conversation

DarkLight1337 commented Nov 6, 2024 • edited Loading

github-actions bot commented Nov 6, 2024

DarkLight1337 commented Nov 6, 2024

DarkLight1337 commented Nov 9, 2024

mergify bot commented Nov 9, 2024

youkaichao left a comment

Choose a reason for hiding this comment

youkaichao commented Nov 9, 2024

youkaichao Nov 9, 2024

Choose a reason for hiding this comment

DarkLight1337 Nov 10, 2024

Choose a reason for hiding this comment

youkaichao Nov 9, 2024

Choose a reason for hiding this comment

DarkLight1337 Nov 10, 2024

Choose a reason for hiding this comment

DarkLight1337 commented Nov 6, 2024 •

edited

Loading