Skip to content

Commit

Permalink
Replace 'CACHE_DIR' with 'NPUW_CACHE_DIR' in StatefulLLMPipeline (#1489)
Browse files Browse the repository at this point in the history
Handle `CACHE_DIR` in `StatefulLLMPipeline` the same way as in
`StatelessLLMPipeline`
  • Loading branch information
AsyaPronina authored Jan 7, 2025
1 parent eb6e556 commit 73fd564
Showing 1 changed file with 4 additions and 1 deletion.
5 changes: 4 additions & 1 deletion src/cpp/src/llm_pipeline_static.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -739,7 +739,10 @@ std::shared_ptr<ov::CompiledModel> StatefulLLMPipeline::setupAndCompileModel(

rename_key(pipeline_config, "PREFILL_CONFIG", "NPUW_LLM_PREFILL_CONFIG");
rename_key(pipeline_config, "GENERATE_CONFIG", "NPUW_LLM_GENERATE_CONFIG");


// Replace CACHE_DIR option if NPUW is enabled
set_npuw_cache_dir(pipeline_config);

return std::make_shared<ov::CompiledModel>(genai::utils::singleton_core().compile_model(model, "NPU", pipeline_config));
}

Expand Down

0 comments on commit 73fd564

Please sign in to comment.