Changes to lmstudio to fix JSON decode error #208

raisindetre · 2023-10-31T02:50:04Z

Hi @cpacker . I've made some tweaks to the LM Studio settings.py and api.py code which (for me at least) have resolved the issues using LM Studio as a back-end.

Changed the endpoint to use http://localhost:1234/v1/chat/completions
Set stream to false per LMStudio curl API example as think it might provide a performance gain.
Rewrapped the prompt JSON object within message object for compatability with endpoint and updated reference to response text accordingly.
Set the context_overflow_policy to option 2 (rolling context window)

In local testing I'm getting expected behavior in chats and much improved performance. Might be worth checking the code in Windows for compatability (I don't have access at the moment).

cpacker · 2023-10-31T04:39:59Z

memgpt/local_llm/lmstudio/settings.py

@@ -9,5 +9,8 @@
        # '\n#',
        # '\n\n\n',
    ],
-    "max_tokens": 500,
+    "max_tokens": 3072,
+    "lmstudio": {"context_overflow_policy": 2},


Ideally we shouldn't really be using any context overflow policies, since MemGPT has its own way of handling this and this might conflict. But if this works as a bandaid fix for some other bug wrt open LLM integration I'm OK with leaving it for now.

Ah ok. Well it may work ok without it. As you say, I think its the endpoint change that's doing the heavy lifting for... reasons lol.

…ack with recognizable error message) + add backwards compat option to use completions endpoint

…verflow policy)

* Changes to lmstudio to fix JSON decode error * black formatting * properly handle context overflow error (propogate exception up the stack with recognizable error message) + add backwards compat option to use completions endpoint * set max tokens to 8k, comment out the overflow policy (use memgpt's overflow policy) * 8k not 3k --------- Co-authored-by: Matt Poff <[email protected]> Co-authored-by: cpacker <[email protected]>

Matt Poff and others added 2 commits October 31, 2023 15:25

Changes to lmstudio to fix JSON decode error

ba25ad9

black formatting

bcc183a

cpacker reviewed Oct 31, 2023

View reviewed changes

cpacker added 3 commits October 30, 2023 22:51

properly handle context overflow error (propogate exception up the st…

29e8bc2

…ack with recognizable error message) + add backwards compat option to use completions endpoint

set max tokens to 8k, comment out the overflow policy (use memgpt's o…

52b14c5

…verflow policy)

8k not 3k

7434e04

cpacker merged commit a048a33 into letta-ai:main Oct 31, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes to lmstudio to fix JSON decode error #208

Changes to lmstudio to fix JSON decode error #208

raisindetre commented Oct 31, 2023

cpacker Oct 31, 2023

raisindetre Oct 31, 2023

Changes to lmstudio to fix JSON decode error #208

Changes to lmstudio to fix JSON decode error #208

Conversation

raisindetre commented Oct 31, 2023

cpacker Oct 31, 2023

Choose a reason for hiding this comment

raisindetre Oct 31, 2023

Choose a reason for hiding this comment