Is it possible to control scale and zero point of full integer quantized (INT8) tflite model during conversion? #709

deltacosos · 2024-10-11T14:03:59Z

Issue Type

Feature Request

OS

Linux

onnx2tf version number

1.20.0

onnx version number

1.16.2

onnxruntime version number

1.19.2

onnxsim (onnx_simplifier) version number

tensorflow version number

2.17.0

Download URL for ONNX

https://github.com/ultralytics/ultralytics/blob/main/docs/en/models/yolov8.md

SiLUs replaced with ReLU. 320 resolution.

Parameter Replacement JSON

Description

My purpose is to personally test how tflite conversion and quantization works without need of running scale and offset correction on CPU while running object detection model.
Currently when I quantize the model with representative dataset, the scale and mean of input tensor is correct (0.003921569 and 0.0 respectively) but I would like to get same values for the output tensor but cannot do it (currently for output tensor they are 0.00605 and -7). In my object detection model confidence tensor goes right but during conversion of coordinates from dist2bbox something goes wrong.
I tried multi-output model, placing layers in output processing layers to different order and to change the version of tensorflow without success.
It would be hugely beneficial if this feature could be added or showed to me how it can be applied.
As input I trained yolov8 model with ultralytics library but replaced SiLU activations with ReLUs.

PINTO0309 · 2024-10-11T14:43:36Z

https://github.com/PINTO0309/onnx2tf/issues?q=label%3AQuantization+is%3Aclosed

deltacosos · 2024-10-11T15:15:14Z

Yes I think so that it is very much related to this issue: #269. I also have normalized the coordinates to the range [0,1] like confidence values are in the same range. It is absolutely possible that I just should take them to separate outputs and not using concat which might confuse Tensorflow Quantization. I was just wondering if the output scale and zero_point could be frozen before starting quantization. Freezing of activations and weights can be done with other quantization libraries like with tfmot and AIMET but not sure with TFLiteConverter. Other possibility is to change them afterwards.

PINTO0309 · 2024-10-11T15:29:01Z

Just rewrite flatbuffer as python code with flatbuffers package.

onnx2tf/onnx2tf/utils/common_functions.py

Line 4541 in 2ecc03f

def rewrite_tflite_inout_opname(

deltacosos · 2024-10-11T15:51:25Z

Okey I will have a look at it and tell soon how it worked :)

github-actions · 2024-10-17T09:03:59Z

If there is no activity within the next two days, this issue will be closed automatically.

PINTO0309 added the Quantization Quantization label Oct 11, 2024

github-actions bot added the no-issue-activity label Oct 17, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Oct 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to control scale and zero point of full integer quantized (INT8) tflite model during conversion? #709

Is it possible to control scale and zero point of full integer quantized (INT8) tflite model during conversion? #709

deltacosos commented Oct 11, 2024

PINTO0309 commented Oct 11, 2024 •

edited

Loading

deltacosos commented Oct 11, 2024 •

edited

Loading

PINTO0309 commented Oct 11, 2024 •

edited

Loading

deltacosos commented Oct 11, 2024

github-actions bot commented Oct 17, 2024

Is it possible to control scale and zero point of full integer quantized (INT8) tflite model during conversion? #709

Is it possible to control scale and zero point of full integer quantized (INT8) tflite model during conversion? #709

Comments

deltacosos commented Oct 11, 2024

Issue Type

OS

onnx2tf version number

onnx version number

onnxruntime version number

onnxsim (onnx_simplifier) version number

tensorflow version number

Download URL for ONNX

Parameter Replacement JSON

Description

PINTO0309 commented Oct 11, 2024 • edited Loading

deltacosos commented Oct 11, 2024 • edited Loading

PINTO0309 commented Oct 11, 2024 • edited Loading

deltacosos commented Oct 11, 2024

github-actions bot commented Oct 17, 2024

PINTO0309 commented Oct 11, 2024 •

edited

Loading

deltacosos commented Oct 11, 2024 •

edited

Loading

PINTO0309 commented Oct 11, 2024 •

edited

Loading