AI Dev Team #819

ElishaKay · 2024-09-02T22:16:23Z

Setup:

Step 1: Generate a Github Personal Access Token

For the GITHUB_TOKEN (Personal Access Token) value, you'll need to generate a Personal Access Token:
Click on your profile picture in the top-right corner of GitHub and select "Settings".
In the left sidebar, click on "Developer settings".
Click on "Personal access tokens", then "Tokens (classic)".
Click "Generate new token" and select the appropriate scopes for your needs.
Copy the generated token.
Paste the token you generated into the value field of the new secret.
Click "Add secret" to save it.

Step 2: Set these environment variables:

OPENAI_API_KEY={Your OpenAI API Key here}
TAVILY_API_KEY={Your Tavily API Key here}

PGVECTOR_CONNECTION_STRING=postgresql://username:password...
GITHUB_TOKEN=

Step 3:

pip install -r multi_agents/requirements.txt
python -m multi_agents.dev_team.main

Status: Running to completion

assafelovic · 2024-09-07T17:51:12Z

This is sick @ElishaKay have to admit I'm hooked on this!

assafelovic · 2024-09-07T17:52:09Z

Just please try to reuse any already existing code i see a lot of duplicates!

ElishaKay · 2024-09-10T07:30:59Z

Status report:
The bot supports 1-on-1 chats.
Within discord, the user can trigger a modal by typing: "/ask" in the chat and hitting "Enter"
Here is what the form looks like

…but needs import fixes

…tructure as long as PGVECTOR_CONNECTION_STRING & GITHUB_TOKEN env vars are set

…ter than nodemon hanging

…l be logged within the same thread

…js server should log errors without restarting

…nnels - also added a cool down logic so that it only advises about the /ask command every 30 minutes per channel

…thub repo

…me & embedding everything with metadata in gptr-compatible format

…ning to completion with relevant files fetched from gptr's __get_similar_content_by_query_with_vectorstore method

…tructure as long as PGVECTOR_CONNECTION_STRING & GITHUB_TOKEN env vars are set

…rum - limit to every 30 minutes per post

ElishaKay · 2024-09-22T05:39:57Z

Status Report:

Flow:

Step 1: The GithubAgent is in charge of the first step of fetching the data from Github with an API Tool & logging the Directory Structure.

He's also in charge of saving the Github repo within a LangChain VectorStore.

Step 2: The RepoAnalyzerAgent leverages the vectorstore with the repo by running GPTResearcher like so:

   # Run GPTResearcher
   researcher = GPTResearcher(
       query=query,
       report_type="research_report",
       report_source="langchain_vectorstore",
       vector_store=vector_store,
   )

Step 3: The WebSearchAgent takes the output of the RepoAnalyyzerAgent & complements any insights from his analysis with info from the web. He runs GPTR like so:

   # Run GPTResearcher
   researcher = GPTResearcher(
       query=query,
       report_type="research_report",
       report_source="web"
   )

Step 4: The RubberDuckerAgent talks out loud about what the game plan is for answering the user. He's forced to talk through his reasoning based on the outputs of the RepoAnalyyzerAgent & WebSearchAgent

The GithubAgent now has the ability to parse & embed the entire Github Repo contents in optimized Langchain Documents format. He passes those Langchain Documents to the VectorAgent who saves them in the LangChain VectorStore.

That VectorStore is then passed into the GPTR report flow by the RepoAnalyzerAgent, like so:

researcher = GPTResearcher(
      query=query,
      report_type="research_report",
      report_source="langchain_vectorstore",
      vector_store=vector_store,
)

The resulting report is solid, but:

properly embedding the full github repo takes a minute or so

Areas for improvement:

leverage the same vector_store across reports & follow-up questions (i.e. don't re-embed the entire repo on every run)
add the WebSearch Agent into the mix
Save the file names from the results of the vector search within Langgraph State (to be leveraged by FileSearchAgent and/or RubberDuckerAgent)
Give the Github Agent a method to calculate the delta between commits & only re-embed the files that changed since the latest commits on a given branch

Also:

rebased on top of Add Document To Vector Store #838
would be cool to also rebase this on top of the follow-up questions feature in [Experimental] Chat with history #832 before the big release

ElishaKay · 2024-09-22T06:40:32Z

Current flow

…into devs

assafelovic · 2024-10-04T07:38:25Z

@ElishaKay whats up brother, what's the status on this? Please also see we've massively refactored the code base so there are some conflicts here

…yncronous langchain vector store is passed into GPTR for async vector search - generated report is looking good

ElishaKay · 2024-10-06T15:17:08Z

@assafelovic

Status Update:

Langchain PGVector support is looking good for the saving & retrieving parts of the flow.

That will be an important step for long-form documents that don't need to re-embed on every run & seamless support for langgraph cloud.

Next steps are what's marked here in "Areas for improvement". I'll probably end up resolving the merge conflicts after those are implemented.

ElishaKay mentioned this pull request Sep 6, 2024

"detailed Report" doesn't work. #807

Closed

khoangothe added 4 commits September 9, 2024 23:50

add chat agent

82f7c02

add chat history websocket

c3d440e

add chat functionality to frontend

c157f41

cleanup

0a3d1d7

khoangothe added 2 commits September 12, 2024 23:40

fix ui bug

57997bb

decouple, refactor chat agent and add docstring

69964b6

ElishaKay changed the title ~~AI Dev Team~~ 🛑: AI Dev Team Sep 13, 2024

ElishaKay force-pushed the devs branch from e7f500f to f13f1e5 Compare September 22, 2024 02:54

khoangothe and others added 18 commits September 22, 2024 03:23

add document to vectorstore

f7f40b7

clean up

cc8450e

add documentation for vector store

85f9a10

dev-team setup

7341e0f

more dev-team setup - can be triggered via 'python -m dev_team.main' …

e6d662b

…but needs import fixes

restructure

5292898

utils

0c48f9e

triggering run_dev_team_flow from cli

223d9a0

test_researcher.py

8b486c1

debug github agent

5dd82b3

python -m dev_team.test_github_agent logs the gptr repo's directory s…

89f6034

…tructure as long as PGVECTOR_CONNECTION_STRING & GITHUB_TOKEN env vars are set

combing through errors

37ff0cb

less errors - reaching RubberDuckerAgent.think_aloud()

2da26fd

logging research state with significant data from agents

9562132

logging state with lots of data & leveraging langgraph nodes

380c61d

saving vectorstore in correct format

8e91ed7

dev team runs to completion

933d366

pass repo name in web search

5281c7c

ElishaKay and others added 16 commits September 22, 2024 03:26

much better formatting & other error handling

632355f

fixed error in 1-on-1 chat

3b03717

server restarts independently when running prod Dockerfile - much bet…

e9272ed

…ter than nodemon hanging

much better handling of triggering reports from threads - reports wil…

cf1fffd

…l be logged within the same thread

rubber-duck shouldnt answer in json - just plain string. discord node…

0d5ce98

…js server should log errors without restarting

restricted gptr-bot to help forum, general channel, gptr-allstars cha…

11c01c9

…nnels - also added a cool down logic so that it only advises about the /ask command every 30 minutes per channel

removed duplicate agents/utils folder - ai dev team runs to completion

391e3a0

vector tests

08eed23

add document to vectorstore

2b81396

passing vector_store into gptr - more metadata after filtering the gi…

ab63ed9

…thub repo

logging python files in __get_similar_content_by_query_with_vectorstore

803cf30

embedding flow running much better - looping through one file at a ti…

2a11f1b

…me & embedding everything with metadata in gptr-compatible format

embedding correctly - skipping errors of trying to embed images - run…

3dddf1a

…ning to completion with relevant files fetched from gptr's __get_similar_content_by_query_with_vectorstore method

python -m dev_team.test_github_agent logs the gptr repo's directory s…

b72be91

…tructure as long as PGVECTOR_CONNECTION_STRING & GITHUB_TOKEN env vars are set

dev team runs to completion

64d4a4d

only share the /ask guide when a new message is posted in the help fo…

ea9c9e6

…rum - limit to every 30 minutes per post

ElishaKay force-pushed the devs branch from f13f1e5 to ea9c9e6 Compare September 22, 2024 05:23

ElishaKay mentioned this pull request Sep 22, 2024

Add Document To Vector Store #838

Merged

ElishaKay mentioned this pull request Sep 22, 2024

[Experimental] Chat with history #832

Closed

ElishaKay added 4 commits October 2, 2024 06:00

Merge remote-tracking branch 'khoangothe/features/chat-with-history' …

bea1a81

…into devs

cleanup

7df936d

cleanup

761a216

gitignore faiss index

6c0300a

github data is saved in a langchain vector store syncronously & an as…

41a4fd9

…yncronous langchain vector store is passed into GPTR for async vector search - generated report is looking good

ElishaKay changed the title ~~🛑: AI Dev Team~~ AI Dev Team Oct 13, 2024

pgvector docs

c1375a5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AI Dev Team #819

AI Dev Team #819

ElishaKay commented Sep 2, 2024 •

edited

Loading

assafelovic commented Sep 7, 2024

assafelovic commented Sep 7, 2024

ElishaKay commented Sep 10, 2024 •

edited

Loading

ElishaKay commented Sep 22, 2024 •

edited

Loading

ElishaKay commented Sep 22, 2024 •

edited

Loading

assafelovic commented Oct 4, 2024

ElishaKay commented Oct 6, 2024 •

edited

Loading

AI Dev Team #819

Are you sure you want to change the base?

AI Dev Team #819

Conversation

ElishaKay commented Sep 2, 2024 • edited Loading

assafelovic commented Sep 7, 2024

assafelovic commented Sep 7, 2024

ElishaKay commented Sep 10, 2024 • edited Loading

ElishaKay commented Sep 22, 2024 • edited Loading

ElishaKay commented Sep 22, 2024 • edited Loading

assafelovic commented Oct 4, 2024

ElishaKay commented Oct 6, 2024 • edited Loading

ElishaKay commented Sep 2, 2024 •

edited

Loading

ElishaKay commented Sep 10, 2024 •

edited

Loading

ElishaKay commented Sep 22, 2024 •

edited

Loading

ElishaKay commented Sep 22, 2024 •

edited

Loading

ElishaKay commented Oct 6, 2024 •

edited

Loading