New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add tests for Whisper static pipeline #1250

Open

eshiryae wants to merge 5 commits into openvinotoolkit:master from eshiryae:b_whisper_static_tests

Contributor

eshiryae commented Nov 22, 2024

No description provided.

github-actions bot added the no-match-files label

github-actions bot added the category: GHA label

eshiryae force-pushed the b_whisper_static_tests branch from a1bab59 to 48cb438 Compare

December 2, 2024 11:36

Contributor Author

eshiryae commented Dec 4, 2024

With transformers 4.46.3 version encoder model have different dynamic shape for input_features, fix for StaticWhisperPipeline - PR #1293

eshiryae marked this pull request as ready for review

December 4, 2024 12:01

eshiryae force-pushed the b_whisper_static_tests branch from e697a90 to 3bab16c Compare

December 5, 2024 08:04

TolyaTalamanov reviewed

View reviewed changes

.github/labeler.yml Outdated Show resolved Hide resolved

.github/workflows/linux.yml Outdated Show resolved Hide resolved

.github/workflows/linux.yml Outdated Show resolved Hide resolved

tests/python_tests/test_whisper_pipeline_static.py Show resolved Hide resolved

tests/python_tests/test_whisper_pipeline_static.py Outdated Show resolved Hide resolved

tests/python_tests/test_whisper_pipeline_static.py Outdated Show resolved Hide resolved

tests/python_tests/test_whisper_pipeline_static.py Outdated

+              @pytest.mark.parametrize("model_descr", get_whisper_models_list(tiny_only=True))
+              @pytest.mark.parametrize("test_sample",
+                  [
+              #        *get_samples_from_dataset(language="fr", length=2),  # 1/2 failed

Collaborator

TolyaTalamanov Dec 9, 2024

What is failed? Do we have ticket for this?

Contributor Author

eshiryae Dec 10, 2024 •

edited

Loading

For one test (with Spanish language), there's mismatch between expected and actual output (looks like language is not detected correctly)

expected: Habritan aguas poco profundas y lo cosas.

actual_out: Habt ihr da noch was poco perfundes und lohosen?

For one test (with French language), there's an error:

RuntimeError: Check '*roi_end <= *max_dim' failed at src\inference\src\dev\make_tensor.cpp:34

I will create tickets for found fails.

Collaborator

TolyaTalamanov Dec 16, 2024

Created?

Contributor Author

eshiryae Jan 7, 2025

The same issue as in ticket with difference between NPU and CPU outputs.
With fix #1469 - not reproduced.

tests/python_tests/test_whisper_pipeline_static.py Show resolved Hide resolved

eshiryae force-pushed the b_whisper_static_tests branch 2 times, most recently from 2fdac12 to 0e11d54 Compare

December 13, 2024 14:43

TolyaTalamanov reviewed

View reviewed changes

Collaborator

TolyaTalamanov left a comment

It seems like everything can be covered by test_static_whisper_generation_compare_with_cpu with different inputs (@pytest.mark.parametrize)

tests/python_tests/test_whisper_pipeline_static.py

+                  "test_sample", get_samples_from_dataset(language="de", length=3)
+              )
+              @pytest.mark.precommit
+              def test_static_whisper_language_de(model_descr, test_sample):

Collaborator

TolyaTalamanov Dec 16, 2024

What does it actually check? How it's different from test_static_whisper_autodetect?

Contributor Author

eshiryae Jan 7, 2025

Here we explicitly set language in config, in test_static_whisper_autodetect at first additional infer request will be called to detect language of the audio.

tests/python_tests/test_whisper_pipeline_static.py

+                  "test_sample", get_samples_from_dataset(language="fr", length=3)
+              )
+              @pytest.mark.precommit
+              def test_static_whisper_language_fr(model_descr, test_sample):

Collaborator

TolyaTalamanov Dec 16, 2024

Same question, how it's different from test_static_whisper_autodetect

dmatveev assigned TolyaTalamanov

dmatveev added this to the 2025.0 milestone

ilya-lavrenov added the category: NPU label

eshiryae added 4 commits

January 7, 2025 11:17


          Add tests for Whisper static pipeline

6751bc2


          StaticWhisperPipeline: fix build target for tests

abc01e9


          Add config settings to run with NPUW:CPU

25a950b


          Download model if it's not found

92e08dd

eshiryae force-pushed the b_whisper_static_tests branch 2 times, most recently from 3482704 to 09e1a99 Compare

January 7, 2025 11:51

Contributor Author

eshiryae commented Jan 7, 2025

Tested locally, all 17 tests passed with changes from #1469

Contributor Author

eshiryae commented Jan 7, 2025

It seems like everything can be covered by test_static_whisper_generation_compare_with_cpu with different inputs (@pytest.mark.parametrize)

Agree, tests code is similar. But decided to separate them as they cover different cases/functionality of static whisper pipeline.


          Address review comments + add test with long input

09e1a99

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: GHA category: NPU no-match-files