[TFLite] Custom attribute reading and While operation support #17932

jane-intel · 2023-06-07T13:25:19Z

Tickets:

CVS-106156

rkazants · 2023-06-07T13:34:11Z

src/frontends/tensorflow_lite/src/op/while.cpp

+    const auto& decoder = get_decoder(node);
+    int32_t cond_idx = decoder->get_attribute(&tflite::WhileOptions::cond_subgraph_index);
+    int32_t body_idx = decoder->get_attribute(&tflite::WhileOptions::body_subgraph_index);
+


the code is quite similar to TF While translator. Can we create translate_session class for storing cached previously converted body graphs? I think it makes sense to do because in this case we can use Frontend class inplementation for convert method that contain logic for telemetry.

I am leaning to align this code with TF implementation.

I have considered it. However, upon the closer look into the TF FE translation session I have encountered lots of logic which is TF specific -- there is no need for it from TFL FE perspective. Remaining code is quite simple. This is why I kept translation logic in InputModel and overloaded public frontend::NodeContext API that we have right now.

With all that frontend::tensorflow_lite::NodeContext is able to trigger translation on demand.

With regards to the alignment -- It would be great to pull all the similar code into common space. Perhaps passing condition / body as ov::Model would be a good start. None the less, there are several places where TF implementation has shape / type setting for better TensorList and DT_VARIANT support. These parts are not needed for TFL.

I had an attempt to unify this code, but it broke into several pieces which made it less elegant. So, I left them separate for now. I believe we will return to this unification question when unification along all the frontends would come. But for now I don't see the urgency in it.

Yes, there are some TF specific stuff in TranslateSession but you can easily skip them and design the common TranslateSession class and inherit TFL TranslateSession class with your specific methods.

My point is to have the common TranslateSession with common get_converted_model in the future. Now you can overload this function if we don't have FW node support in TFL FE.

Now if we decline use of TranlateSession, we create additional gap from moving to the common Frontend where we can place telemetry stuff, for example.

src/frontends/tensorflow_lite/include/openvino/frontend/tensorflow_lite/node_context.hpp

src/frontends/tensorflow_lite/src/decoder_flatbuffer.cpp

src/frontends/tensorflow_lite/src/frontend.cpp

src/frontends/tensorflow_lite/src/op/while.cpp

rkazants · 2023-06-10T12:38:36Z

src/frontends/tensorflow_lite/src/decoder_flatbuffer.h


    const tflite::Operator* m_node_def;
    std::string m_type, m_name;
    std::map<size_t, ov::frontend::tensorflow_lite::TensorInfo> m_input_info, m_output_info;
 };

+class DecoderFlatBufferTensors : public DecoderFlatBuffer {


Please add a comment about why we need this class.

I could handle this in the next PR.

rkazants · 2023-06-10T12:43:13Z

src/frontends/tensorflow_lite/src/frontend.cpp

+    auto subgraphs_as_input_models = model_lite->get_subgraphs();
+    auto input_to_ov_model = [&](const std::shared_ptr<ov::frontend::tensorflow_lite::InputModel>& in_model) {
+        auto simple_lambda = [&]() -> std::shared_ptr<ov::Model> {
+            std::shared_ptr<ov::Model> model;
+            if (in_model)
+                translate_graph(in_model, fail_fast, no_conversion, model);
+            return model;
+        };
+        return simple_lambda;
+    };
+    std::vector<std::function<std::shared_ptr<ov::Model>()>> submodel_translation_functions;
+    submodel_translation_functions.reserve(subgraphs_as_input_models.size());
+    for (const auto& subgraph : subgraphs_as_input_models) {
+        submodel_translation_functions.push_back(input_to_ov_model(subgraph));
+    }


Am I correct that you translate all sub-graphs even it is not needed? For example, in case of cutting some sub-graph can be redundant.
Is it possible to translate by demand inside translator for While operation and cache it?

It is not a full translation. It is just a creation of InputModel. Here for each sub-graph I make creation functions from InputModel to ov::Model which are not triggered here -- they are triggered in the While op translator. So sub-graph is only getting translated during NodeContext::get_subgraph(int) call.

rkazants · 2023-06-10T12:47:08Z