Any plans to update models and their quantizations? #44

Calandiel · 2023-04-21T13:48:27Z

ggml has support for Q1_O quantization now which was reported to offer better inference quality for some of the models at a cost of slower execution. At the same time, Open Assistant released newer weights for the pythia based model than the ones that are currently being pulled.
Perhaps it'd be worth updating the model on hugginface using the new quantization method?
I would make a PR with it myself but I don't have access to a GPU with enough RAM to quantize the 12B model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any plans to update models and their quantizations? #44

Any plans to update models and their quantizations? #44

Calandiel commented Apr 21, 2023

Any plans to update models and their quantizations? #44

Any plans to update models and their quantizations? #44

Comments

Calandiel commented Apr 21, 2023