Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Separate reset for KV state and LoRA state in LLMPipeline #1058

Merged

Conversation

slyalin
Copy link
Collaborator

@slyalin slyalin commented Oct 24, 2024

Fixing a bug when LoRA state is experienced reset each time when generate is invoked that brought unnecessary overhead in each generate call even if LoRA tensors/alphas are not changed.

…chat. Remove too early LoRA state setting in LLMPipeline ctror.
@github-actions github-actions bot added category: LLM LLM pipeline (stateful, static) category: sampling Sampling / Decoding algorithms category: GenAI C++ API Changes in GenAI C++ public headers labels Oct 24, 2024
@ilya-lavrenov ilya-lavrenov added this to the 2024.5 milestone Oct 24, 2024
@ilya-lavrenov ilya-lavrenov added bug Something isn't working and removed category: sampling Sampling / Decoding algorithms labels Oct 24, 2024
@ilya-lavrenov ilya-lavrenov self-assigned this Oct 24, 2024
@ilya-lavrenov ilya-lavrenov added the category: LoRA Low rank adapters label Oct 24, 2024
@ilya-lavrenov ilya-lavrenov added this pull request to the merge queue Oct 24, 2024
Merged via the queue into openvinotoolkit:master with commit 6a4ba7f Oct 24, 2024
49 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working category: GenAI C++ API Changes in GenAI C++ public headers category: LLM LLM pipeline (stateful, static) category: LoRA Low rank adapters Code Freeze
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants