Skip to content

CUDA: faster Mixtral prompt processing for partial offloading#4553

Merged
JohannesGaessler merged 1 commit intoggerganov:masterfrom JohannesGaessler:cuda-mixtral-partial-ppDec 21, 2023

Commits

Commits on Dec 21, 2023