fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' #1959

VTrngNghia · 2024-07-16T02:55:35Z

What does this PR do?

Adds a keyword argument to allow passing extra_options to ORTQuantizer.quantize()

To fix RuntimeError:

Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0'. shape_inference failed to return a type probably this node is from a different domain or using an input produced by such an operator. This may happen if you quantize a model already quantized. You may use extra_options DefaultTensorType to indicate the default weight type, usually onnx.TensorProto.FLOAT.

Maybe it can be added to AutoQuantizationConfig, but there any many @staticmethod for that, so maybe this quick fix is simpler.

Who can review?

It's very simple. Anyone can review.

ONNX / ONNX Runtime : @fxmarty, @echarlaix, @JingyaHuang, @michaelbenayoun
ONNX Runtime Training: @JingyaHuang
BetterTransformer: @fxmarty
GPTQ, quantization: @fxmarty, @SunMarc
TFLite export: @michaelbenayoun

severinsimmler · 2024-10-30T14:56:22Z

+1

Thanks for fixing this @VTrngNghia

feat(quantization): add extra_options

4f9a167

VTrngNghia changed the title ~~feat(quantization): add extra_options~~ fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' Jul 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' #1959

fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' #1959

VTrngNghia commented Jul 16, 2024 •

edited

Loading

severinsimmler commented Oct 30, 2024

fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' #1959

Are you sure you want to change the base?

fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' #1959

Conversation

VTrngNghia commented Jul 16, 2024 • edited Loading

What does this PR do?

Who can review?

severinsimmler commented Oct 30, 2024

VTrngNghia commented Jul 16, 2024 •

edited

Loading