Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) #5669

oobabooga · 2024-03-09T03:14:54Z

Cleaned-up version of #4761. It seems to be working reliably for both llamacpp and llamacpp_HF now.

Description

When active, this prevents the prompt from being re-evaluated once an old chat message is removed, thus allowing you to talk to the model indefinitely.

Usage

--streaming-llm or check the box below before loading the model.

Ph0rk0z · 2024-03-09T12:02:16Z

It doesn't peg CPU this time? Will have to try it.

…5669)

RichardFevrier · 2024-04-09T09:36:34Z

Naive question @oobabooga could it work with exllamav2?

…5669)

oobabooga added 4 commits March 8, 2024 17:58

Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt)

03dc960

Handle llamacpp_HF just once

57486c5

Fix llamacpp_HF

4d400b1

Lint

d7fa254

oobabooga merged commit afb51bd into dev Mar 9, 2024

This was referenced Mar 9, 2024

StreamingLLM (llama.cpp & llamacpp_HF loaders) #4761

Closed

Context Shifting #5265

Closed

oobabooga deleted the streamingllmv2 branch March 17, 2024 15:26

bartowski1182 pushed a commit to bartowski1182/text-generation-webui that referenced this pull request Mar 23, 2024

Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) (oobabooga#…

1baefeb

…5669)

PoetOnTheRun pushed a commit to PoetOnTheRun/text-generation-webui that referenced this pull request Oct 22, 2024

Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) (oobabooga#…

c89b813

…5669)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) #5669

Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) #5669

oobabooga commented Mar 9, 2024 •

edited

Loading

Ph0rk0z commented Mar 9, 2024

RichardFevrier commented Apr 9, 2024

Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) #5669

Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) #5669

Conversation

oobabooga commented Mar 9, 2024 • edited Loading

Description

Usage

Ph0rk0z commented Mar 9, 2024

RichardFevrier commented Apr 9, 2024

oobabooga commented Mar 9, 2024 •

edited

Loading