Does mmdeploy supports onnx model quantization(onnx model with fp16 mode)?? #818

sanjaypavo · 2022-07-26T14:04:52Z

Hi. I need to deploy my model(any object detection model) in onnx format in fp16 mode.Is it possible in mmdeploy?

Thanks in advance..

tpoisonooo · 2022-07-27T01:37:39Z

In theory, fp16 mode is not a kind of quantization, it is just convert fp32 value with bf16 format.

onnx format with fp16 precision not tested now.

mmdeploy using ppq to quantize ncnn int8. please check

tpoisonooo · 2022-07-27T01:40:23Z

Looks like I have to write an English version of the quantization doc +_+

tpoisonooo · 2022-08-09T02:24:10Z

JIAOJINYU · 2024-10-25T09:56:33Z

@tpoisonooo
I apologize for the sudden bother you.
I would like to ask you 2 questions.

Does mmdeploy only support fp16 level quantization for onnxruntime at this moment?
I currently would like to quantize rtmpose to int8. I try to quantize rtmpose to int8 using onnxruntime's static quantization. but the accuracy of the quantized model has been zero. Can I modify the pose-detection_onnxruntime-fp16_static.py to pose-detection_onnxruntime-int8_static.py by myself to realize the int8 quantization using mmdeploy?

tpoisonooo self-assigned this Jul 27, 2022

tpoisonooo closed this as completed Aug 9, 2022

Provide feedback