You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Description
For those who don't mind trading some speed and vram for some reduced perplexity, this setting is great for squeezing just that extra bit from mixtral.
Additional Context
The setting can be found in "q_mlp.cu" It used to be hardcoded to 2, but is now simple integer variable. I've been editing it manually, but it's probably possible to pass a user defined setting from the webui.
The text was updated successfully, but these errors were encountered:
This issue has been closed due to inactivity for 6 weeks. If you believe it is still relevant, please leave a comment below. You can tag a developer in your comment.
Description
For those who don't mind trading some speed and vram for some reduced perplexity, this setting is great for squeezing just that extra bit from mixtral.
Additional Context
The setting can be found in "q_mlp.cu" It used to be hardcoded to 2, but is now simple integer variable. I've been editing it manually, but it's probably possible to pass a user defined setting from the webui.
The text was updated successfully, but these errors were encountered: