Skip to content

Commit

Permalink
Reduce computation time for ASYM mode
Browse files Browse the repository at this point in the history
  • Loading branch information
l-bat committed Jun 11, 2024
1 parent 0c51517 commit d754a97
Showing 1 changed file with 2 additions and 0 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -165,6 +165,8 @@ def apply(
original_weight = fns.zeros_like(weight) + weight

compressed_weights, scale, zp = do_integer_quantization(original_weight, reduction_axis, config)
if zp is not None:
zp = zp.astype(scale.dtype)
q_weights = do_dequantization(compressed_weights, scale, zp, reduction_axis)

s = fns.unsqueeze(s, 0)
Expand Down

0 comments on commit d754a97

Please sign in to comment.