Update on "Autoquant" · pytorch/ao@55cce68

Commit

Update on "Autoquant"

Summary: Adding autoquantization functionality, using hte do_quant api
we can test kernel speeds and pick the best quantization type (or no
quantization) for each layer.

Test Plan: python test/test.py -k "autoquant"

also tested on SAM and SDXL
pytorch-labs/segment-anything-fast#114
HDCharles/sdxl-fast@8d9942a

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D55103983](https://our.internmc.facebook.com/intern/diff/D55103983)

[ghstack-poisoned]

Loading branch information

HDCharles committed Mar 19, 2024

2 parents 8dcfafe + 7fd7aac commit 55cce68

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit

There are no files selected for viewing

0 comments on commit `55cce68`

Commit

There are no files selected for viewing

0 comments on commit 55cce68

0 comments on commit `55cce68`