Document how to use specific LLMs #417

rbren · 2024-03-30T23:38:29Z

What problem or use case are you trying to solve?
Lots of folks are struggling to get OpenDevin working with non-OpenAI models. Local ollama seems to be particularly hard

Describe the UX of the solution you'd like
We should have a doc that lists out 3-4 major providers, explains how to get an API key, and how to configure OpenDevin

Do you have thoughts on the technical implementation?
Just a Models.md that we can link to from README.md

gaborkukucska · 2024-03-31T06:48:56Z

+1 for ollama support and documentation.

I get ModuleNotFoundError: No module named 'llama_index.embeddings.ollama' when
I use LLM_EMBEDDING_MODEL="llama2" and ModuleNotFoundError: No module named 'llama_index.embeddings.huggingface' when I use LLM_EMBEDDING_MODEL="local"

enyst · 2024-04-02T05:03:34Z

Do you have thoughts on the technical implementation?
Just a Models.md that we can link to from README.md

I see github still has the wiki feature. I don't see it on opendevin, but IIRC that's because it would need enabled for the project. What if we use that for a few / several model-specific pages? I don't remember if access can be enabled widely but it's a wiki, I'd assume so.

rbren · 2024-04-02T12:34:47Z

That seems reasonable to me

DanRegalia · 2024-04-02T21:16:26Z

Can you update the supported tags for the .env with commands to connect to open llms, openAI/OllamaWebUI/Oogabooga/ETC?

Dampfinchen · 2024-04-02T21:16:32Z

IMO, Ollama can be a nuisance for people who already have gguf files because it requires GGUFs to be converted. It also has no GUI, in contrast to koboldcpp and Ooba.

I recommend using llama.cpp server, koboldcpp and Ooba instead. Those are really easy to use inference programs.

DGdev91 · 2024-04-02T22:54:39Z

Agreed. I also made a PR for setting LLM_BASE_URL with make setup-config.
Should be enough for any OpenAI-compatible #616

For example, it works with Koboldcpp by setting it like this:
LLM_API_KEY="."
LLM_MODEL="gpt-4-0125-preview"
LLM_BASE_URL="http://localhost:5001/v1/"
WORKSPACE_DIR="./workspace"

JayQuimby · 2024-04-03T02:26:52Z

I made a markdown file for Ollama #615.

feuler · 2024-04-03T21:04:46Z

I use oobabooga with the openai compatible api. And currently trying OpenCodeInterpreter-DS-33B (exl2) as model first. I'm not aware which non-openai/claude model works best yet...

config.toml:
WORKSPACE_DIR="./workspace"
LLM_API_KEY="sk-111111111111111111111111111111111111111111111111"
LLM_BASE_URL="http://127.0.0.1:5000"
LLM_EMBEDDING_MODEL="local"
LLM_MODEL="oobabooga/OpenCodeInterpreter-DS-33B-4.65bpw-h6-exl2"

After starting a development i currently get these kind of errors sometimes during the process:

ERROR:
'str' object has no attribute 'copy'
Traceback (most recent call last):
File "/opt/llm/OpenDevin/opendevin/controller/agent_controller.py", line 113, in step
action = self.agent.step(self.state)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/llm/OpenDevin/agenthub/monologue_agent/agent.py", line 166, in step
action = prompts.parse_action_response(action_resp)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/llm/OpenDevin/agenthub/monologue_agent/utils/prompts.py", line 135, in parse_action_response
return action_from_dict(action_dict)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/llm/OpenDevin/opendevin/action/init.py", line 24, in action_from_dict
action = action.copy()
^^^^^^^^^^^
AttributeError: 'str' object has no attribute 'copy'

###############
ERROR:
"'action' key is not found in action={}"
Traceback (most recent call last):
File "/opt/llm/OpenDevin/opendevin/controller/agent_controller.py", line 113, in step
action = self.agent.step(self.state)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/llm/OpenDevin/agenthub/monologue_agent/agent.py", line 166, in step
action = prompts.parse_action_response(action_resp)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/llm/OpenDevin/agenthub/monologue_agent/utils/prompts.py", line 135, in parse_action_response
return action_from_dict(action_dict)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/llm/OpenDevin/opendevin/action/init.py", line 26, in action_from_dict
raise KeyError(f"'action' key is not found in {action=}")
KeyError: "'action' key is not found in action={}"

stratte89 · 2024-04-03T21:59:14Z

Can you update the supported tags for the .env with commands to connect to open llms, openAI/OllamaWebUI/Oogabooga/ETC?

here is a guide for oobabooga webui
08a2dfb#commitcomment-140559598

rmortes · 2024-04-04T09:44:01Z

I use oobabooga with the openai compatible api. And currently trying OpenCodeInterpreter-DS-33B (exl2) as model first. I'm not aware which non-openai/claude model works best yet...

config.toml: WORKSPACE_DIR="./workspace" LLM_API_KEY="sk-111111111111111111111111111111111111111111111111" LLM_BASE_URL="http://127.0.0.1:5000" LLM_EMBEDDING_MODEL="local" LLM_MODEL="oobabooga/OpenCodeInterpreter-DS-33B-4.65bpw-h6-exl2"

After starting a development i currently get these kind of errors sometimes during the process:

ERROR: 'str' object has no attribute 'copy' Traceback (most recent call last): File "/opt/llm/OpenDevin/opendevin/controller/agent_controller.py", line 113, in step action = self.agent.step(self.state) ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/llm/OpenDevin/agenthub/monologue_agent/agent.py", line 166, in step action = prompts.parse_action_response(action_resp) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/llm/OpenDevin/agenthub/monologue_agent/utils/prompts.py", line 135, in parse_action_response return action_from_dict(action_dict) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/llm/OpenDevin/opendevin/action/init.py", line 24, in action_from_dict action = action.copy() ^^^^^^^^^^^ AttributeError: 'str' object has no attribute 'copy'

############### ERROR: "'action' key is not found in action={}" Traceback (most recent call last): File "/opt/llm/OpenDevin/opendevin/controller/agent_controller.py", line 113, in step action = self.agent.step(self.state) ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/llm/OpenDevin/agenthub/monologue_agent/agent.py", line 166, in step action = prompts.parse_action_response(action_resp) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/llm/OpenDevin/agenthub/monologue_agent/utils/prompts.py", line 135, in parse_action_response return action_from_dict(action_dict) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/llm/OpenDevin/opendevin/action/init.py", line 26, in action_from_dict raise KeyError(f"'action' key is not found in {action=}") KeyError: "'action' key is not found in action={}"

Getting a similar error using the following config in MacOS

config.toml

LLM_MODEL="ollama/deepseek-coder:instruct"
LLM_API_KEY="ollama"
LLM_EMBEDDING_MODEL="local"
LLM_BASE_URL="http://localhost:11434"
WORKSPACE_DIR="./workspace"

Oops. Something went wrong: 'str' object has no attribute 'copy'
Oops. Something went wrong: "'action['action']=None' is not defined. Available actions: dict_keys(['kill', 'run', 'browse', 'read', 'write', 'recall', 'think', 'finish', 'add_task', 'modify_task'])"

stratte89 · 2024-04-04T09:47:50Z

hey, use openai/modelname like

LLM_API_KEY="na"
LLM_BASE_URL="http://0.0.0.0:5000/v1"
LLM_MODEL="openai/alpindale_Mistral-7B-v0.2-hf"
LLM_EMBEDDING_MODEL="lokal"
WORKSPACE_DIR="./workspace"
MAX_ITERATIONS=20000

although i dont think that the model name really matter for oobabooga as long as its openai/ it can be anything, at least it works for me, i can switch model without changing the config and it's working.

rbren · 2024-04-04T13:36:32Z

@Rags100 this isn't really the right place for your question. Please follow the README, and search our issues for related problems (there's an existing one for uvloop--you'll need WSL to make it work).

If you continue to have trouble, feel free to file a new issue with the template filled out. Thanks!

rbren · 2024-04-09T16:59:58Z

Hey all--lots of unrelated comments in this thread. Please try to keep this about LLM Model documentation.

I'm going to delete the unrelated comments--feel free to open new issues if you're having trouble

enyst · 2024-04-12T03:23:55Z

Documentation for Azure: #1035

larkinwc · 2024-04-23T22:13:30Z

Added documentation for using Google's Gemini model through AI studio as well as VertexAI through GCP #1321

rbren · 2024-05-03T20:17:26Z

We've made a lot of progress on this one, so I'm going to close it. More docs welcome!

rbren added the enhancement New feature or request label Mar 30, 2024

This was referenced Mar 31, 2024

Google Gemini #461

Closed

Google Gemini #490

Closed

ollama: 'llama2' not found, try pulling it first #311

Closed

rbren mentioned this issue Apr 2, 2024

Support Open source LLMs #592

Closed

DGdev91 mentioned this issue Apr 2, 2024

Add the ability to set LLM_BASE_URL with make setup-config #616

Merged

All-Hands-AI deleted a comment from grey-demon Apr 9, 2024

All-Hands-AI deleted a comment from yashupadhyayy1 Apr 9, 2024

All-Hands-AI deleted a comment from DOLARIK Apr 9, 2024

All-Hands-AI deleted a comment from SNEHAL-VATS Apr 9, 2024

All-Hands-AI deleted a comment from Rags100 Apr 9, 2024

rbren changed the title ~~Model-specific documentation~~ Document how to use specific LLMs Apr 9, 2024

rbren added the severity:medium Affecting multiple users label Apr 9, 2024

rbren closed this as completed May 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document how to use specific LLMs #417

Document how to use specific LLMs #417

rbren commented Mar 30, 2024

gaborkukucska commented Mar 31, 2024

enyst commented Apr 2, 2024

rbren commented Apr 2, 2024

DanRegalia commented Apr 2, 2024

Dampfinchen commented Apr 2, 2024 •

edited

Loading

DGdev91 commented Apr 2, 2024

JayQuimby commented Apr 3, 2024

feuler commented Apr 3, 2024 •

edited

Loading

stratte89 commented Apr 3, 2024

rmortes commented Apr 4, 2024

stratte89 commented Apr 4, 2024 •

edited

Loading

rbren commented Apr 4, 2024

rbren commented Apr 9, 2024

enyst commented Apr 12, 2024

larkinwc commented Apr 23, 2024

rbren commented May 3, 2024

Document how to use specific LLMs #417

Document how to use specific LLMs #417

Comments

rbren commented Mar 30, 2024

gaborkukucska commented Mar 31, 2024

enyst commented Apr 2, 2024

rbren commented Apr 2, 2024

DanRegalia commented Apr 2, 2024

Dampfinchen commented Apr 2, 2024 • edited Loading

DGdev91 commented Apr 2, 2024

JayQuimby commented Apr 3, 2024

feuler commented Apr 3, 2024 • edited Loading

stratte89 commented Apr 3, 2024

rmortes commented Apr 4, 2024

stratte89 commented Apr 4, 2024 • edited Loading

rbren commented Apr 4, 2024

rbren commented Apr 9, 2024

enyst commented Apr 12, 2024

larkinwc commented Apr 23, 2024

rbren commented May 3, 2024

Dampfinchen commented Apr 2, 2024 •

edited

Loading

feuler commented Apr 3, 2024 •

edited

Loading

stratte89 commented Apr 4, 2024 •

edited

Loading