float_quantize at multi-gpu works wrong. #67

jinsol-neubla · 2023-03-16T08:32:44Z

When I run model after applying float_quantize to weight or activation with multi-GPU,
(huggingface opt-model with device_map='auto')
quantization of layers allocated to second or later gpu works wrong.
The output of quantization shows mostly 0-value.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

float_quantize at multi-gpu works wrong. #67

float_quantize at multi-gpu works wrong. #67

jinsol-neubla commented Mar 16, 2023

float_quantize at multi-gpu works wrong. #67

float_quantize at multi-gpu works wrong. #67

Comments

jinsol-neubla commented Mar 16, 2023