Bump llama-cpp-python to 0.2.36 #5397

oobabooga · 2024-01-30T02:30:10Z

I had to remove older Python/CUDA versions due to GitHub rate limiting my Github Actions jobs.

Ph0rk0z · 2024-01-30T12:54:55Z

It needs split by rows or layers setting as doing it by layers is slower, even with 3090s.

abetlen/llama-cpp-python#1085

oobabooga · 2024-01-30T16:18:38Z

Could you make a PR? I don't have 2 GPUs to properly test this.

Ph0rk0z · 2024-01-30T16:32:43Z

Sure, I can PR it later on. Can just make a checkbox to uncheck.

Bump llama-cpp-python, remove python 3.8/3.9, cuda 11.7

f954409

oobabooga changed the base branch from main to dev January 30, 2024 02:30

oobabooga marked this pull request as draft January 30, 2024 02:30

oobabooga marked this pull request as ready for review January 30, 2024 16:15

oobabooga merged commit 89f6036 into dev Jan 30, 2024

oobabooga deleted the llamacpp_0.2.36 branch February 4, 2024 04:16

PoetOnTheRun pushed a commit to PoetOnTheRun/text-generation-webui that referenced this pull request Feb 22, 2024

Bump llama-cpp-python, remove python 3.8/3.9, cuda 11.7 (oobabooga#5397)

83798ef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump llama-cpp-python to 0.2.36 #5397

Bump llama-cpp-python to 0.2.36 #5397

oobabooga commented Jan 30, 2024 •

edited

Loading

Ph0rk0z commented Jan 30, 2024 •

edited

Loading

oobabooga commented Jan 30, 2024

Ph0rk0z commented Jan 30, 2024

Bump llama-cpp-python to 0.2.36 #5397

Bump llama-cpp-python to 0.2.36 #5397

Conversation

oobabooga commented Jan 30, 2024 • edited Loading

Ph0rk0z commented Jan 30, 2024 • edited Loading

oobabooga commented Jan 30, 2024

Ph0rk0z commented Jan 30, 2024

oobabooga commented Jan 30, 2024 •

edited

Loading

Ph0rk0z commented Jan 30, 2024 •

edited

Loading