New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Better 1.5 bit quantization #5971

Merged

ikawrakow merged 15 commits into master from ik/iq1s_blocks16

Mar 11, 2024

+1,153 −394

Commits on Mar 11, 2024

Trying blocvks of 16 for IQ1_S - seems slightly better

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for c9e9acf

Browse repository at this point
Copy the full SHA

c9e9acf View commit details

Browse the repository at this point in the history
iq1s_blocks16: Adjust scale fudge factor to 1.125

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for cd83a7d

Browse repository at this point
Copy the full SHA

cd83a7d View commit details

Browse the repository at this point in the history
iq1s_blocks16: going to blocks of 32
```
with 2048 lattice points, so same bpw.
This is even better than blocks of 16.
Should I try blocks of 64? But to keep the same
bpw, when I go to 4096 lattice points, I need to
remove blocks alltogether and just have superblocks of
256 weights.
```
Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for 4c4404a

Browse repository at this point
Copy the full SHA

4c4404a View commit details

Browse the repository at this point in the history
iq1s_blocks16: Use 2*<x^2> as sigma2 in weight adjustment

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for c55e66f

Browse repository at this point
Copy the full SHA

c55e66f View commit details

Browse the repository at this point in the history
iq1s_blocks16: scalar and AVX2 dot products

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for 864a5c2

Browse repository at this point
Copy the full SHA

864a5c2 View commit details

Browse the repository at this point in the history
iq1s_blocks16: CUDA dot product

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for f092d04

Browse repository at this point
Copy the full SHA

f092d04 View commit details

Browse the repository at this point in the history
iq1s_blocks16: Metal works, Neon does not
```
Metal works but TG is dog slow (35 t/s). PP is OKish (493 t/s).
Not seeing the bug in the Neon implementation for now.
```
Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for fbb001e

Browse repository at this point
Copy the full SHA

fbb001e View commit details

Browse the repository at this point in the history
iq1s_blocks16: fixed Neon

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for 15acc79

Browse repository at this point
Copy the full SHA

15acc79 View commit details

Browse the repository at this point in the history
iq1s_blocks16: very slightly faster TG on Metal
```
Still pathetic at 37 t/s
```
Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for 8561139

Browse repository at this point
Copy the full SHA

8561139 View commit details

Browse the repository at this point in the history
iq1s_blocks16: speedup Metal by packing codebook into uint32_t's

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for d3da9d1

Browse repository at this point
Copy the full SHA

d3da9d1 View commit details

Browse the repository at this point in the history
Formatting

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for 7545d69

Browse repository at this point
Copy the full SHA

7545d69 View commit details

Browse the repository at this point in the history
iq1s_blocks16: uint32_t codebook is also better in CUDA
```
TG-128 is now 204 t/s up from 194 t/s.
PP-512 is 5890 t/s, so significantly better than other quants
```
Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for 156220f

Browse repository at this point
Copy the full SHA

156220f View commit details

Browse the repository at this point in the history
iq1s_blocks16: slightly faster Neon dot product

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for 101b18d

Browse repository at this point
Copy the full SHA

101b18d View commit details

Browse the repository at this point in the history
iq1s_blocks16: faster AVX2 dot product

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for 34bc21f

Browse repository at this point
Copy the full SHA

34bc21f View commit details

Browse the repository at this point in the history
iq1s_blocks16: adjust to ggml-common.h

Kawrakow committed Mar 11, 2024
Configuration menu
View commit details

Copy full SHA for 9d83171

Browse repository at this point
Copy the full SHA

9d83171 View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better 1.5 bit quantization #5971

Better 1.5 bit quantization #5971

Commits on Mar 11, 2024