Free up VRAM if not in use #1090

senscarlos · 2023-12-20T07:18:15Z

Would be great to have the option to unload the model(s) from VRAM if tabby is not being used for some time.

To allow other things to run instead. For example during part of the day I use Tabby and during another time ollama for a chat interface. There's no place (or reason) to have them both loaded at the same time.

wsxiaoys · 2023-12-20T07:19:32Z

Related discussion and a solution in #624

senscarlos added the enhancement New feature or request label Dec 20, 2023

wsxiaoys added duplicate This issue or pull request already exists and removed enhancement New feature or request labels Dec 21, 2023

wsxiaoys closed this as completed Dec 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Free up VRAM if not in use #1090

Free up VRAM if not in use #1090

senscarlos commented Dec 20, 2023

wsxiaoys commented Dec 20, 2023

Free up VRAM if not in use #1090

Free up VRAM if not in use #1090

Comments

senscarlos commented Dec 20, 2023

wsxiaoys commented Dec 20, 2023