Update state dict and model together #573

mikekgfb · 2024-04-30T03:06:37Z

Update state dict and model together

ali-khosh

approved

…_model_together

jerryzh168 · 2024-05-01T23:43:39Z

qops.py

+            weight_scale=scales.float(),
+            weight_zero_point=0,


these two has to either both be tensor or both be scalar, it should work if you do scales.float().item() I think

Wait... This is a vector of 32000 elements

oh but _qdq_dynamic_quantized_linear only supports per tensor quantization, then you may want to call a different op in that function

to align the size you could do: weight_zero_point = torch.zeros(scales.shape)

* code beautification * code beautification, move functions together * rewrite model rewriter * rewrite quantizers * weights is none check * typo * not weight -> weight is not None * fix dimensions for parallel prefill * test * typo * bfloat16 on ARM with MacOS 14 * precision for a8w4 * sdpa_kv * fixes * inline qlq definition * trial and error * qdq not working * ci * not so fast with bf16=fast * typo, and handle fast across maxcos version... * typo * type cast

mikekgfb added 3 commits April 29, 2024 19:57

code beautification

18c4904

code beautification, move functions together

6fdc903

rewrite model rewriter

1fcd2c2

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 30, 2024

rewrite quantizers

93c9b96

ali-khosh approved these changes Apr 30, 2024

View reviewed changes

metascroy approved these changes Apr 30, 2024

View reviewed changes

mikekgfb added 23 commits April 30, 2024 09:42

typo

dbecfa0

weights is none check

200768c

typo

0e7e0cd

Merge remote-tracking branch 'origin/main' into update_state_dict_and…

d3b62d2

…_model_together

not weight -> weight is not None

8f1e6e6

fix dimensions for parallel prefill

a87955c

test

7f98f19

Merge remote-tracking branch 'origin/main' into update_state_dict_and…

797bb19

…_model_together

typo

7c8f1c6

bfloat16 on ARM with MacOS 14

4ee421d

precision for a8w4

29e60a3

Merge remote-tracking branch 'origin/main' into update_state_dict_and…

d2e5524

…_model_together

Merge remote-tracking branch 'origin/main' into update_state_dict_and…

78fb086

…_model_together

sdpa_kv

40e754b

fixes

b7c70bf

inline qlq definition

aeff4bf

trial and error

eb61376

qdq not working

dc9e3df

ci

8c2cf07

not so fast with bf16=fast

cdb6f20

typo, and handle fast across maxcos version...

c098667

typo

c8e85b8

type cast

21a935b

jerryzh168 reviewed May 1, 2024

View reviewed changes

mikekgfb merged commit 60616bf into main May 1, 2024
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update state dict and model together #573

Update state dict and model together #573

mikekgfb commented Apr 30, 2024

ali-khosh left a comment

jerryzh168 May 1, 2024

mikekgfb May 1, 2024

jerryzh168 May 1, 2024

Update state dict and model together #573

Update state dict and model together #573

Conversation

mikekgfb commented Apr 30, 2024

ali-khosh left a comment

Choose a reason for hiding this comment

jerryzh168 May 1, 2024

Choose a reason for hiding this comment

mikekgfb May 1, 2024

Choose a reason for hiding this comment

jerryzh168 May 1, 2024

Choose a reason for hiding this comment