From 47737a5f8786470849d8554b7ae41438cd491b17 Mon Sep 17 00:00:00 2001
From: Vivian Fang <hi@vivi.sh>
Date: Mon, 6 Nov 2023 13:08:12 -0800
Subject: [PATCH] Add autogen+localllm docs (#335)

Co-authored-by: Jirito0 <jirito0@users.noreply.github.com>
---
 docs/autogen.md | 96 +++++++++++++++++++++++++++++++++++++++++++++++++
 mkdocs.yml      |  2 ++
 2 files changed, 98 insertions(+)
 create mode 100644 docs/autogen.md

diff --git a/docs/autogen.md b/docs/autogen.md
new file mode 100644
index 0000000000..05f80a5196
--- /dev/null
+++ b/docs/autogen.md
@@ -0,0 +1,96 @@
+## MemGPT + Autogen
+[examples/agent_groupchat.py](https://github.com/cpacker/MemGPT/blob/main/memgpt/autogen/examples/agent_groupchat.py) contains an example of a groupchat where one of the agents is powered by MemGPT.
+
+If you are using OpenAI, you can also run it using the [example notebook](https://github.com/cpacker/MemGPT/blob/main/memgpt/autogen/examples/memgpt_coder_autogen.ipynb).
+
+In the next section, we detail how to set up MemGPT+Autogen to run with local LLMs.
+
+
+## Connect Autogen + MemGPT to non-OpenAI LLMs (AutoGen+MemGPT+OpenLLMs)
+
+In WebUI enable the [openai extension](https://github.com/oobabooga/text-generation-webui/tree/main/extensions/openai)! This is for non-MemGPT Autogen agents.
+
+To get MemGPT to work with a local LLM, you need to have the LLM running on a server that takes API requests.
+
+For the purposes of this example, we're going to serve (host) the LLMs using [oobabooga web UI](https://github.com/oobabooga/text-generation-webui#starting-the-web-ui), but if you want to use something else you can! This also assumes your running web UI locally - if you're running on e.g. Runpod, you'll want to follow Runpod specific instructions (for example use [TheBloke's one-click UI and API](https://github.com/TheBlokeAI/dockerLLM/blob/main/README_Runpod_LocalLLMsUIandAPI.md))
+
+### Part 1: Get AutoGen working
+1. Install oobabooga web UI using the instructions [here](https://github.com/oobabooga/text-generation-webui#starting-the-web-ui)
+2. Once installed, launch the web server with `python server.py`
+3. Navigate to the web app (if local, this is probably [`http://127.0.0.1:7860`](http://localhost:7860)), select the model you want to use, adjust your GPU and CPU memory settings, and click "load"
+4. After the model is successfully loaded, navigate to the "Session" tab, and select and enable the "openai" extension. Then click "Apply flags/extensions and restart". The WebUI will then restart.
+5. Once the WebUI has reloaded, double-check that your selected model and parameters are still selected -- If not, then select your model and re-apply your settings and click "load" once more.
+5. Assuming steps 1-4 went correctly, the LLM is now properly hosted on a port you can point MemGPT to!
+
+### Part 2: Get MemGPT working
+
+1. In your terminal where you're running MemGPT (depending if you are on macOS or Windows), run either of the following:
+
+***(Running WebUI locally)***
+
+For macOS :
+```sh
+# the default port will be 5000
+export OPENAI_API_BASE=http://127.0.0.1:5000
+export BACKEND_TYPE=webui
+```
+
+For Windows (while using PowerShell & running WebUI locally):
+```sh
+$env:OPENAI_API_BASE = "http://127.0.0.1:5000"
+$env:BACKEND_TYPE = "webui"
+```
+
+***(Running WebUI on Runpod)***
+
+For macOS :
+```sh
+export OPENAI_API_BASE=https://yourpodaddresshere-5000.proxy.runpod.net
+export BACKEND_TYPE=webui
+```
+
+For Windows (while using PowerShell):
+```sh
+$env:OPENAI_API_BASE = "https://yourpodaddresshere-5000.proxy.runpod.net"
+$env:BACKEND_TYPE = "webui"
+```
+
+### Important Notes
+- When exporting/setting the environment variables: Ensure that you do NOT include `/v1` as part of the address. MemGPT will automatically append the /v1 to the address.
+    - For non-MemGPT Autogen agents: the [config](https://github.com/cpacker/MemGPT/blob/main/memgpt/autogen/examples/agent_groupchat.py#L38) should specify `/v1` in the address.
+
+- Make sure you are using port 5000 (unless configured otherwise) when exporting the environment variables. MemGPT uses the non-OpenAI API, which is by default on port 5000 for WebUI.
+
+- In the following steps, you will finish configuring Autogen to work with MemGPT+OpenLLMs. There is a `config_list` that will state to include `/v1` as part of the LocalHost address, as well as using port 5001 (instead of port 5000) which you must keep included (this is separate from the MemGPT `OPENAI_API_BASE` address you exported earlier, so AutoGen can connect to port 5001, which the "/v1" must remain).
+
+WebUI exposes a lot of parameters that can dramatically change LLM outputs, to change these you can modify the [WebUI settings file](https://github.com/cpacker/MemGPT/blob/main/memgpt/local_llm/webui/settings.py).
+
+⁉️ If you have problems getting WebUI setup, please use the [official web UI repo for support](https://github.com/oobabooga/text-generation-webui)! There will be more answered questions about web UI there vs here on the MemGPT repo.
+
+### Example groupchat
+Going back to the example we first mentioned, [examples/agent_groupchat.py](https://github.com/cpacker/MemGPT/blob/main/memgpt/autogem/examples/agent_groupchat.py) contains an example of a groupchat where one of the agents is powered by MemGPT.
+
+In order to run this example on a local LLM, go to lines 32-55 in [examples/agent_groupchat.py](https://github.com/cpacker/MemGPT/blob/main/memgpt/autogem/examples/agent_groupchat.py) and fill in the config files with your local LLM's deployment details. For example, if you are using webui, it will look something like this:
+
+```
+config_list = [
+    {
+        "model": "dolphin-2.1-mistral-7b",  # this indicates the MODEL, not the WRAPPER (no concept of wrappers for AutoGen)
+        "api_base": "http://127.0.0.1:5001/v1"
+        "api_key": "NULL", # this is a placeholder
+        "api_type": "open_ai",
+    },
+]
+config_list_memgpt = [
+    {
+        "model": "airoboros-l2-70b-2.1",  # this specifies the WRAPPER MemGPT will use, not the MODEL
+    },
+]
+```
+`config_list` is used by non-MemGPT agents, which expect an OpenAI-compatible API. 
+
+`config_list_memgpt` is used by MemGPT agents. Currently, MemGPT interfaces with the LLM backend by exporting `OPENAI_API_BASE` and `BACKEND_TYPE` as described above. Note that MemGPT does not use the OpenAI-compatible API (it uses the direct API).
+
+If you're using WebUI and want to run the non-MemGPT agents with a local LLM instead of OpenAI, enable the [openai extension](https://github.com/oobabooga/text-generation-webui/tree/main/extensions/openai) and point `config_list`'s `api_base` to the appropriate URL (usually port 5001).
+Then, for MemGPT agents, export `OPENAI_API_BASE` and `BACKEND_TYPE` as described in [Local LLM support](../local_llm) (usually port 5000).
+
diff --git a/mkdocs.yml b/mkdocs.yml
index 47e43cc563..f1f1712c22 100644
--- a/mkdocs.yml
+++ b/mkdocs.yml
@@ -22,6 +22,8 @@ nav:
     - 'llama.cpp': llamacpp.md
     - 'koboldcpp': koboldcpp.md
     - 'Troubleshooting': local_llm_faq.md
+  - 'Integrations':
+    - 'Autogen': autogen.md
   - 'Advanced':
     - 'Configuring storage backends': storage.md
     - 'Adding support for new LLMs': adding_wrappers.md