Skip to content

Commit

Permalink
Update on "Autoquant"
Browse files Browse the repository at this point in the history
Summary: Adding autoquantization functionality, using hte do_quant api
we can test kernel speeds and pick the best quantization type (or no
quantization) for each layer.

Test Plan: python test/test.py -k "autoquant"

also tested on SAM and SDXL
pytorch-labs/segment-anything-fast#114
HDCharles/sdxl-fast@8d9942a

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: [D55103983](https://our.internmc.facebook.com/intern/diff/D55103983)

[ghstack-poisoned]
  • Loading branch information
HDCharles committed Mar 19, 2024
2 parents 8dcfafe + 7fd7aac commit 55cce68
Showing 0 changed files with 0 additions and 0 deletions.

0 comments on commit 55cce68

Please sign in to comment.