feat: extract LoRA for arbitrary models #312

jon-chuang · 2023-04-14T13:38:57Z

It's straightforward, using low-rank approximation, low-rank matrix factorization.

Given a model and its fine-tuning, and a target rank $k$, extract the "best" low-rank approximation to each difference in the model weights, and export as LoRA.

The parameter $k$ can be a constant or can be unique to each matrix, e.g. $k_i$ for $W_i \in \Theta$

To summarize, given $W' - W = \Delta W$, find $\hat{A},\hat{B}$ each with $k$ rows such that the norm $||\hat{A}^T\hat{B} - \Delta W||_2$ is minimized.

mekaneeky · 2023-04-14T18:59:11Z

This is definitely interesting. Wonder whether gradient approaches, evolutionary algorithms or plain old linear algebra norms (Spectral, etc) and factorization would be ideal for solving this.

A digression: I wonder if there is going to be a clear gradient path from the weights to the lora weights? I am not fluent enough in calculus but assume that factorization might not be differentiable. What could be an alternative operation in this case, where instead of propagating gradients backwards another signal can be used to reach a lora weight from non-lora weights.

jon-chuang · 2023-04-26T16:47:47Z

I believe the problem should be differentiable, however, we do not need to rely on gradient-based methods as you say.

We should use whatever method for low-rank approximation that is available and most effective.

github-actions · 2023-05-21T15:03:17Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

johnwick123f · 2023-10-13T23:23:21Z

Hmm yeah I would like this feature too.

thomasgauthier · 2024-02-11T20:31:42Z

I implemented something in this direction using singular value decomposition (SVD). I call it LoRD for Low-Rank Decomposition

jon-chuang mentioned this issue Apr 14, 2023

Tracking: LoRA ggerganov/llama.cpp#964

Closed

7 tasks

This was referenced May 20, 2023

跟踪： LoRA ziwang-com/zero-lora#13

Open

为任意模型提取 LoRA ziwang-com/zero-lora#14

Open

github-actions bot closed this as completed May 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: extract LoRA for arbitrary models #312

feat: extract LoRA for arbitrary models #312

jon-chuang commented Apr 14, 2023 •

edited

Loading

mekaneeky commented Apr 14, 2023 •

edited

Loading

jon-chuang commented Apr 26, 2023

github-actions bot commented May 21, 2023

johnwick123f commented Oct 13, 2023

thomasgauthier commented Feb 11, 2024

feat: extract LoRA for arbitrary models #312

feat: extract LoRA for arbitrary models #312

Comments

jon-chuang commented Apr 14, 2023 • edited Loading

mekaneeky commented Apr 14, 2023 • edited Loading

jon-chuang commented Apr 26, 2023

github-actions bot commented May 21, 2023

johnwick123f commented Oct 13, 2023

thomasgauthier commented Feb 11, 2024

jon-chuang commented Apr 14, 2023 •

edited

Loading

mekaneeky commented Apr 14, 2023 •

edited

Loading