Is it possible to use int8 for other task? #29

haritsahm · 2022-09-03T09:21:26Z

From your paper and blog post, the quantization was tested using transformers. Is it possible to use the library for objet detection models and will it experience performance degradation?

TimDettmers · 2022-09-05T23:06:32Z

I have not tested this, but it should be possible for vision transformers. I known already from data that vision transformers behave the very same in terms of outliers than language model transformers. As such, I think it will work.

However, it will not straightforwardly work for convolutional networks, since the current implementation just supports the replacement of torch.nn.Linear layers and not convolutional layers with the 8-bit equivalent.

haritsahm closed this as completed Sep 7, 2022

JohnnyRacer mentioned this issue Nov 13, 2024

Support for quantization of convolutional layers #1414

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to use int8 for other task? #29

Is it possible to use int8 for other task? #29

haritsahm commented Sep 3, 2022 •

edited

Loading

TimDettmers commented Sep 5, 2022

Is it possible to use int8 for other task? #29

Is it possible to use int8 for other task? #29

Comments

haritsahm commented Sep 3, 2022 • edited Loading

TimDettmers commented Sep 5, 2022

haritsahm commented Sep 3, 2022 •

edited

Loading