Log model properties #677

rasapala · 2024-07-24T15:16:10Z

Adding possibility to get and log compiled model properties.
Very useful for benchmarking and comparing settings on different hardware.

Example output:
Loaded model configuration:
AFFINITY: CORE
CPU_DENORMALS_OPTIMIZATION: NO
CPU_SPARSE_WEIGHTS_DECOMPRESSION_RATE: 1
DYNAMIC_QUANTIZATION_GROUP_SIZE: 32
ENABLE_CPU_PINNING: YES
ENABLE_HYPER_THREADING: NO
EXECUTION_DEVICES: CPU
EXECUTION_MODE_HINT: PERFORMANCE
INFERENCE_NUM_THREADS: 24
INFERENCE_PRECISION_HINT: f32
KV_CACHE_PRECISION: f16
LOG_LEVEL: LOG_NONE
MODEL_DISTRIBUTION_POLICY:
NETWORK_NAME: Model0
NUM_STREAMS: 1
OPTIMAL_NUMBER_OF_INFER_REQUESTS: 1
PERFORMANCE_HINT: LATENCY
PERFORMANCE_HINT_NUM_REQUESTS: 0
PERF_COUNT: YES
SCHEDULING_CORE_TYPE: ANY_CORE

src/cpp/src/debug_utils.hpp

src/cpp/src/continuous_batching_pipeline.cpp

ilya-lavrenov · 2024-08-02T11:52:29Z

src/cpp/include/openvino/genai/continuous_batching_pipeline.hpp

@@ -56,6 +56,10 @@ class OPENVINO_GENAI_EXPORTS ContinuousBatchingPipeline {

    PipelineMetrics get_metrics() const;

+    std::vector<std::string> get_model_configuration();
+
+    void print_model_configuration();


I think that instead of adding debug API to public API, we can either:

Enable compiled time cmake option to print some debug information.

Or do it via environment variable. In this case we need to ensure that debug info handling does not affect performance.

It can be useful for timers which are currently profile each iteration / step. I vote for cmake option with debug information.

I have moved the functionality to utils as we want to have it every time we use cb_pipeline.

ilya-lavrenov · 2024-08-19T15:47:54Z

src/cpp/include/openvino/genai/continuous_batching_pipeline.hpp

@@ -61,6 +61,8 @@ class OPENVINO_GENAI_EXPORTS ContinuousBatchingPipeline {
    GenerationHandle add_request(uint64_t request_id, const ov::Tensor& input_ids, const ov::genai::GenerationConfig& sampling_params);
    GenerationHandle add_request(uint64_t request_id, const std::string& prompt, const ov::genai::GenerationConfig& sampling_params);

+    std::string get_model_configuration_string();


I'm against of adding API method for debug purposes.

Can we enable it via env var?

Env var would be problematic. Maybe we can add a flag in the constructor that is set to false by default ?

why is it problematic? once model is compiled, we can dump this information based on env var.

Because we do not handle ENV variables in OVMS runtime, this is extra capability reserved only for CLOUD services for example secrets or endpoints. We can add a graph llm options parameter, then we can pass it to the constructor.

I think we should store loaded model configuration and provide getter for it.
In current shape it indeed looks like a debug method as the string it returns is not very useful except for printing.
I would change it to return a map with properties represented as {key, value}. This way user would be able to conveniently check model configuration and possibly make decisions in their own applications based on value of certain properties. It might also be useful for UX and monitoring purposes.
The full_log flag in pipeline constructor is unnecessary in my opinion. I think we should store configuration either way.

I would consider this method as informative so not only for debugging purposes. The output of the method in a form of vector of strings can be used for various purposes on the client application side just like such API is in OV Runtime.
Introducing extra env variable OV_CB_FULL_LOG to mute the output of the method is in my opinion totally useless. I suggest to keep the same output from get_model_configuration(). It will be the client application to use it or not.
It will be used in OVMS to track the configuration of the model for monitoring and benchmarking purposes.

andrei-kochin requested a review from Wovchena July 24, 2024 21:03

Wovchena requested changes Jul 25, 2024

View reviewed changes

src/cpp/src/debug_utils.hpp Outdated Show resolved Hide resolved

src/cpp/src/continuous_batching_pipeline.cpp Outdated Show resolved Hide resolved

src/cpp/src/continuous_batching_pipeline.cpp Outdated Show resolved Hide resolved

ilya-lavrenov self-assigned this Jul 31, 2024

ilya-lavrenov requested changes Aug 2, 2024

View reviewed changes

ilya-lavrenov marked this pull request as draft August 7, 2024 07:39

rasapala force-pushed the log_model_properties2 branch from 951d09e to 9a1b7a9 Compare August 19, 2024 12:34

rasapala marked this pull request as ready for review August 19, 2024 13:27

Wovchena approved these changes Aug 19, 2024

View reviewed changes

ilya-lavrenov reviewed Aug 19, 2024

View reviewed changes

rasapala force-pushed the log_model_properties2 branch from 05580ef to 608b361 Compare August 20, 2024 14:40

rasapala force-pushed the log_model_properties2 branch from 608b361 to 7eb692e Compare September 2, 2024 10:56

rasapala mentioned this pull request Sep 2, 2024

Log model settings openvinotoolkit/model_server#2660

Open

rasapala added 6 commits September 6, 2024 11:46

model properties

18b56c2

Working

a7f7468

Code review

e1686ed

Cleanup

814bd83

Full log flag

8fbb1c9

Base on env

f39478f

rasapala force-pushed the log_model_properties2 branch from 7eb692e to f39478f Compare September 6, 2024 09:46

Wovchena mentioned this pull request Sep 30, 2024

[Continuous Batching] Introduce echo parameter #900

Merged

ilya-lavrenov closed this Oct 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Log model properties #677

Log model properties #677

rasapala commented Jul 24, 2024

ilya-lavrenov Aug 2, 2024

rasapala Aug 19, 2024

ilya-lavrenov Aug 19, 2024

rasapala Aug 20, 2024

ilya-lavrenov Aug 20, 2024

rasapala Aug 20, 2024

mzegla Aug 21, 2024

dtrawins Sep 2, 2024

Log model properties #677

Log model properties #677

Conversation

rasapala commented Jul 24, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment