StaticLLMPipeline: Update config #969

TolyaTalamanov · 2024-10-15T08:03:41Z

No description provided.

…genai into at/update-static-llm-pipeline-config

….com/TolyaTalamanov/openvino.genai into at/update-static-llm-pipeline-config

dmatveev

okay-ish for now but, you now, this never ends

dmatveev · 2024-10-15T17:24:50Z

src/cpp/src/llm_pipeline_static.cpp

    ov::AnyMap config = {
-        { "NPU_USE_NPUW", "YES" },
+        { "NPU_COMPILATION_MODE_PARAMS", "compute-layers-with-higher-precision=Sqrt,Power,ReduceMean,Add_RMSNorm" },


So do you remember this Add vs Add_RMSNorm problem?

Yes, it should be just Add perhaps

dmatveev · 2024-10-15T18:09:03Z

src/cpp/src/llm_pipeline_static.cpp

+                                     const std::optional<NPUDesc>& desc) {
+    auto config = get_baseline_common_config();
+    if (desc.has_value() && desc->support_max_mem_alloc_size) {
+        config.emplace("NPUW_PMM", "NO");


Disabled PMM can be in the base (common) config I believe

dmatveev · 2024-10-15T18:09:56Z

src/cpp/src/llm_pipeline_static.cpp

+ov::AnyMap get_default_common_config(const std::shared_ptr<ov::Model>& model,
+                                     const std::optional<NPUDesc>& desc) {
+    auto config = get_baseline_common_config();
+    if (desc.has_value() && desc->support_max_mem_alloc_size) {


Also, is this check enough? shouldn't you check for the actual value?

This part is removed, will be added in next PRs

dmatveev · 2024-10-15T18:11:11Z

src/cpp/src/llm_pipeline_static.cpp

+    auto config = get_baseline_common_config();
+    if (desc.has_value() && desc->support_max_mem_alloc_size) {
+        config.emplace("NPUW_PMM", "NO");
+        config.emplace("NPUW_FUNCALL_FOR_ALL", "YES");


Once FCFA is on, the PARALLEL_COMPILE can be ON for both models

Will take it into account for the next PR that enables FCFA, thanks!

dmatveev

So for it and merge., Tomorrow there will be another one. :D

This reverts commit 9c0fb7b.

TolyaTalamanov added 2 commits October 11, 2024 12:54

Refactor config creation

Unverified

This user has not yet uploaded their public signing key.

GPG key ID: 3FCC32880193C153

Learn about vigilant mode

44acd45

Update StaticLLMPipeline config

e5863f4

ilya-lavrenov added the category: LLM label Oct 15, 2024

ilya-lavrenov added this to the 2024.5 milestone Oct 15, 2024

ilya-lavrenov added the Code Freeze label Oct 15, 2024

TolyaTalamanov added 4 commits October 15, 2024 09:46

Merge branch 'master' of https://github.com/openvinotoolkit/openvino.…

e3d928e

…genai into at/update-static-llm-pipeline-config

Update llm_pipeline_static.cpp

63e9482

Optimize config based on driver provided

9c0fb7b

Merge branch 'at/update-static-llm-pipeline-config' of https://github…

52d77d9

….com/TolyaTalamanov/openvino.genai into at/update-static-llm-pipeline-config

TolyaTalamanov requested a review from dmatveev October 15, 2024 13:49

dmatveev reviewed Oct 15, 2024

View reviewed changes

dmatveev approved these changes Oct 15, 2024

View reviewed changes

TolyaTalamanov added 2 commits October 16, 2024 10:57

Revert "Optimize config based on driver provided"

4a58a61

This reverts commit 9c0fb7b.

Update config

3d56a96

TolyaTalamanov added this pull request to the merge queue Oct 16, 2024

Merged via the queue into openvinotoolkit:master with commit be23fc6 Oct 16, 2024
48 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

StaticLLMPipeline: Update config #969

StaticLLMPipeline: Update config #969

TolyaTalamanov commented Oct 15, 2024

dmatveev left a comment

dmatveev Oct 15, 2024

TolyaTalamanov Oct 16, 2024

dmatveev Oct 15, 2024

TolyaTalamanov Oct 16, 2024

dmatveev Oct 15, 2024

TolyaTalamanov Oct 16, 2024

dmatveev Oct 15, 2024

TolyaTalamanov Oct 16, 2024

dmatveev left a comment

StaticLLMPipeline: Update config #969

StaticLLMPipeline: Update config #969

Conversation

TolyaTalamanov commented Oct 15, 2024

dmatveev left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dmatveev left a comment

Choose a reason for hiding this comment