Does LoKr support to wrap a LLM Model? #220

WhuanY · 2024-10-13T11:41:19Z

Thanks for your amazing work! As title mentioned, my task is to successfully fine-tune a large language model via diffrent lora-like adapters. Currently I am exploring Finetuning LoKr on our customized model via different framework(i.e.: torch, paddle).

Is there any documentation reference that tells me how to apply LoKr adapter on such a LLM? I used the LoKr API from HuggingFace recently, but it seems that there are bugs, which I guess due to the Hf developers didn't test LoKr on any LLMs(See here for why I think so. Thanks.

KohakuBlueleaf · 2024-10-13T13:02:39Z

In HakuPhi I already show how to use lycoris on LLM.
LyCORIS is now designed to be a general PEFT library to wrap ALL the pytorch module. not matter what it is for. We just won't ensure the performance will surpass or on par with the algo that LyCORIS didn't implement.
Check the example, if your model is implemented in pytorch and you are using custom attention (instead of pytorch MHA). It will definitely work.

WhuanY · 2024-10-14T06:45:43Z

Check the example, if your model is implemented in pytorch and you are using custom attention (instead of pytorch MHA). It will definitely work.

Thanks. By saying "pytorch MHA", could I just make sure you are refering to pytorch MHA ?

Knowing that would be of great help. Thanks!

KohakuBlueleaf · 2024-10-14T07:43:13Z

Yes, pytorch's MHA make q,k,v together (not always) which make things tricky.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does LoKr support to wrap a LLM Model? #220

Does LoKr support to wrap a LLM Model? #220

WhuanY commented Oct 13, 2024

KohakuBlueleaf commented Oct 13, 2024

WhuanY commented Oct 14, 2024

KohakuBlueleaf commented Oct 14, 2024

Does LoKr support to wrap a LLM Model? #220

Does LoKr support to wrap a LLM Model? #220

Comments

WhuanY commented Oct 13, 2024

KohakuBlueleaf commented Oct 13, 2024

WhuanY commented Oct 14, 2024

KohakuBlueleaf commented Oct 14, 2024