Feat: Include relevant file names if any #2865

SmartManoj · 2024-07-09T11:35:52Z

What is the problem that this fixes or functionality that this introduces? Does it fix any open issues?
Closes #2838; Included relevant file names if any in step 1 only.

Regenerated tests

agenthub/codeact_agent/codeact_agent.py

I'd like this get reviewed by other team member(s)

tobitege · 2024-07-12T06:47:09Z

The refactored code looks good to me.
I'll defer the decision whether to use it as is to other team members' review.

tobitege · 2024-07-12T07:10:02Z

@SmartManoj did you run your PR with LLM being enabled in integration test to confirm the change doesn't cause an issue there?

SmartManoj · 2024-07-12T07:15:27Z

As mentioned in the description, not regenerated tests yet. Tested directly in live versions.

tobitege · 2024-07-12T07:19:57Z

As mentioned in the description, Did live test only.

Ok, please run the integration tests on your end then, too.
Please have a look at the errors in the actions and try to find out why your PR wouldn't pass here.

SmartManoj · 2024-07-12T07:21:48Z

Failed because of the prompt change. Added that change only in the description image. Will generate once it's approved.

tobitege · 2024-07-12T07:25:20Z

Failed because of the prompt change. Added that change only in the description image. Will generate once it's approved.

Please have a look at this error, it is separate from the prompt change, I think:
https://github.com/OpenDevin/OpenDevin/actions/runs/9902616135/job/27357899808?pr=2865#step:7:444

Did you try your live test with an empty workspace, too?

SmartManoj · 2024-07-12T07:28:02Z

Same error.

tobitege · 2024-07-12T07:31:29Z

Same error.

That is not the location I linked you, please look more closely!
The mock response not found error is only a subsequent error.

SmartManoj · 2024-07-12T07:34:08Z

Line 476 error
Line 443-475 is it's traceback

tobitege · 2024-07-12T07:40:22Z

Line 476 error Line 443-475 is it's traceback

So you ignore line 450 then in your error analysis?

ryanhoangt

Not sure about others' thoughts, but I am more inclined towards a simpler approach: explicitly letting the agent know that the project is already set up in the prompt. This way, we can minimize manual intervention in the prompt sent to the LLM, which is preferred to keep the agent more general imo.

ryanhoangt · 2024-07-12T10:23:22Z

agenthub/codeact_agent/codeact_agent.py

@@ -217,6 +218,16 @@ def _get_messages(self, state: State) -> list[dict[str, str]]:
            {'role': 'user', 'content': self.in_context_example},
        ]

+        workspace_contents = ', '.join(list_files(config.workspace_base))


This may also require mounting workspace to work when running swe-bench eval similar to RepoMap before? CC: @xingyaoww

mounting workspace to work

work means?

codeact_swe_agent.py is only used for swe-bench. right?

cc: @rezzie-rich

rezzie-rich · 2024-07-13T19:26:55Z

@xingyaoww An ui beside option for the user to choose to mount or not mount any workspace could be a simple solution. This way, it's useful when working with existing projects as well as swe-bench eval.

rezzie-rich · 2024-07-13T19:35:25Z

IMO, this PR should be tied to a vector-base 'repomap' so OD not only knows the file names of an existing project but also the content of it to effectively work on old or new projects. Also IMHO that should be a priority because successful integration of it will make OD capable of working on and improving OD, making the development supersonic.

mamoodi · 2024-07-29T19:16:21Z

@xingyaoww can you take a quick look at this PR again and see if this is not the correct approach and please close it if so.

xingyaoww · 2024-07-29T22:41:40Z

Let's close this for now and wait to develop a more general solution (e.g., a search agent)

SmartManoj · 2024-07-30T02:52:58Z

a more general solution (e.g., a search agent)

@mamoodi, Could you create a new issue for this to track?

SmartManoj · 2024-07-30T03:30:32Z

wait to develop a more general solution (e.g., a search agent)

Till that one can use this PR if one needs that quickly if is opened.

Also, if it is opened, it will prevent duplicate PRs if one use a feature like list-prs-for-file refined-github/refined-github#2197

SmartManoj · 2024-07-30T03:32:32Z

IMO, this PR should be tied to a vector-base 'repomap' so OD not only knows the file names of an existing project but also the content of it to effectively work on old or new projects. Also IMHO that should be a priority because successful integration of it will make OD capable of working on and improving OD, making the development supersonic.

@rezzie-rich, Does the current commit match your expectation?

enyst · 2024-07-30T03:58:50Z

@SmartManoj You're missing problems pointed out above and the alternative solution here #2865 (review). To make embeddings only for this is not a good solution, when a ls or simply telling the agent a few words would do.

rezzie-rich · 2024-07-30T04:17:56Z

I'm not sure about the exact technical implementation. However, having all the file names from the project in memory can be useful. It can serve as a skeleton map for the search agents.

The search agent can go through the project and create a small summary of each files including the key contexts. This way, the search agent can know about a complete project not by the entire source code but by the small summaries it creates per file name. It will help it be aware of the complete project without maxing out the context limit, and when a task requires relevant source code from the project, it can effectively navigate through the file structure quickly and extract the actual code for completion.

Summarizing the content of a 200 lines code file can be done using a single line of natural language. It is a form of compression by contextual meaning.

Going through a project and making summaries will definitely use a lot of tokens, so it should be done through a UI option where users can choose to load a local project/github repo to the knowledge base. By default, OD should assume it's a new project, IG, that will help with eval as well.

rezzie-rich · 2024-07-30T04:36:37Z

@enyst, we might be talking about something similar.

@SmartManoj i was pro embedding until mentat-bot turned out useless. A real project can be of any size, and even with embedding, there's a risk of exceeding the context window. Since context window is the amount of info llm can process at a time, it's better for llm to have summarized whole context rather than embedded/raw incomplete context.

Embedding after summarization could have different potential if that allows to squeeze more context in without extending the context window.

SmartManoj · 2024-07-30T05:20:21Z

@SmartManoj You're missing problems pointed out above and the alternative solution here #2865 (review). To make embeddings only for this is not a good solution, when a ls or simply telling the agent a few words would do.

@enyst listdir was the first commit c972cd0 (#2865)

@tobitege, could you hide the comments about the integration tests?

@rezzie-rich I think there is no need to send the summaries of the workspace to the LLM too. If the right files are given, it will work on it.
Copilot workspace works like that.

Workspace link

rezzie-rich · 2024-07-30T05:36:26Z

@rezzie-rich I think there is no need to send the summaries of the workspace to the LLM too. If the right files are given, it will work on it.
Copilot workspace works like that.

@SmartManoj, when you work on a project, you don't recall every line of code, do you? U just recall the file structure and an abstract idea of what is where. And since you know the abstract content of each files u can effectively navigate. It's the same with LLM as it's designed after how the human mind works.

By file names only, you can't get all the info regarding that file. Codeact will access the raw context of a file when a task requires it, but it will be able to navigate the project effectively and hit all the files related to the task with scattered methods if it has a summery of all the files.

You can't fit a whole project under 128k window, but u can definitely fit a complete summary of it. Ai can generate what it needs to generate if only it has all the necessary information.

SmartManoj · 2024-07-30T07:51:13Z

when you work on a project, you don't recall every line of code, do you? U just recall the file structure and an abstract idea of what is where. And since you know the abstract content of each files u can effectively navigate. It's the same with LLM as it's designed after how the human mind works.

here the output is relevant file names. right?

By file names only, you can't get all the info regarding that file.

Why not? for eg: if it needs info about an imported method, it can fetch that accordingly.

Could you provide a small example that provides summaries? Isn't the docstring of a file enough?

rezzie-rich · 2024-07-30T16:51:57Z

Summary of `agenthub/dummy_agent/agent.py`

This file defines a basic dummy agent within the OpenDevin framework. It includes:

Imports:
- import os
- import sys
- import logging
- from agenthub.base_agent import BaseAgent
- from agenthub.utils import some_utility_function
DummyAgent Class: Implements a minimal agent with initialization, execution, and cleanup routines.
- Initialization (__init__): Configures the agent and initializes logging.
- Run Method (run): Core logic for agent actions during execution.
- Helper Methods:
  - setup: Prepares the agent for execution.
  - execute_task: Manages task execution.
  - cleanup: Cleans up after task completion.
Related Files:
- agenthub/base_agent.py: Contains the BaseAgent class that DummyAgent inherits from.
- agenthub/utils.py: Includes utility functions used by the agent.

@SmartManoj, as you can see, this is much more info than just the file name and much less compared to the actual file.

This is just a sample. The actual prompt for the search agent for summarization should be optimized to retrieve the core and useful information effectively.

Feat: Include workspace contents if any

c972cd0

tobitege previously requested changes Jul 11, 2024

View reviewed changes

agenthub/codeact_agent/codeact_agent.py Outdated Show resolved Hide resolved

SmartManoj marked this pull request as draft July 12, 2024 04:20

SmartManoj added 2 commits July 12, 2024 10:07

refactor list_files

bf6a6dc

use list files

c6e205e

SmartManoj force-pushed the ls branch from 86b0f4a to c6e205e Compare July 12, 2024 04:38

SmartManoj marked this pull request as ready for review July 12, 2024 04:38

lint

6150b44

SmartManoj force-pushed the ls branch from 2b28d95 to 6150b44 Compare July 12, 2024 04:52

Merge branch 'main' into ls

8ebd69a

rezzie-rich mentioned this pull request Jul 12, 2024

Mentat-bot #2821

Closed

tobitege reviewed Jul 12, 2024

View reviewed changes

agenthub/codeact_agent/codeact_agent.py Outdated Show resolved Hide resolved

tobitege requested review from neubig and xingyaoww July 12, 2024 05:24

SmartManoj changed the title ~~Feat: Include workspace contents if any~~ Feat: Include workspace item names if any Jul 12, 2024

SmartManoj added 2 commits July 12, 2024 13:42

Update conftest.py

d69c4af

regenerate tests

8bbe9de

ryanhoangt reviewed Jul 12, 2024

View reviewed changes

neubig removed their request for review July 15, 2024 17:31

neubig assigned SmartManoj Jul 15, 2024

find relevant files

4098fab

SmartManoj changed the title ~~Feat: Include workspace item names if any~~ Feat: Include relevant file names if any Jul 16, 2024

SmartManoj added 3 commits July 16, 2024 16:23

Merge branch 'main' into ls

f8cf70f

exclude some dirs

020c6a1

Merge branch 'main' into ls

d984323

SmartManoj requested review from tobitege, xingyaoww and ryanhoangt July 17, 2024 07:54

SmartManoj added 3 commits July 27, 2024 19:15

Merge branch 'main' into ls

d9a290d

Update codeact_agent.py

43910d6

Merge branch 'main' into ls

2b9084e

xingyaoww closed this Jul 29, 2024

kevin-support-bot bot mentioned this pull request Jul 31, 2024

Include relevant file names if any SmartManoj/Kevin#7

Closed

SmartManoj mentioned this pull request Aug 10, 2024

Include workspace contents if any at first step only. SmartManoj/Kevin#30

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Include relevant file names if any #2865

Feat: Include relevant file names if any #2865

SmartManoj commented Jul 9, 2024 •

edited

Loading

tobitege commented Jul 12, 2024

tobitege commented Jul 12, 2024 •

edited

Loading

SmartManoj commented Jul 12, 2024 •

edited

Loading

tobitege commented Jul 12, 2024

SmartManoj commented Jul 12, 2024 •

edited

Loading

tobitege commented Jul 12, 2024

SmartManoj commented Jul 12, 2024

tobitege commented Jul 12, 2024

SmartManoj commented Jul 12, 2024

tobitege commented Jul 12, 2024

ryanhoangt left a comment •

edited

Loading

ryanhoangt Jul 12, 2024 •

edited

Loading

SmartManoj Jul 12, 2024

SmartManoj Jul 12, 2024

SmartManoj Jul 12, 2024

rezzie-rich commented Jul 13, 2024

rezzie-rich commented Jul 13, 2024

mamoodi commented Jul 29, 2024

xingyaoww commented Jul 29, 2024

SmartManoj commented Jul 30, 2024

SmartManoj commented Jul 30, 2024

SmartManoj commented Jul 30, 2024

enyst commented Jul 30, 2024

rezzie-rich commented Jul 30, 2024

rezzie-rich commented Jul 30, 2024

SmartManoj commented Jul 30, 2024

rezzie-rich commented Jul 30, 2024 •

edited

Loading

SmartManoj commented Jul 30, 2024 •

edited

Loading

rezzie-rich commented Jul 30, 2024 •

edited

Loading

Feat: Include relevant file names if any #2865

Feat: Include relevant file names if any #2865

Conversation

SmartManoj commented Jul 9, 2024 • edited Loading

tobitege commented Jul 12, 2024

tobitege commented Jul 12, 2024 • edited Loading

SmartManoj commented Jul 12, 2024 • edited Loading

tobitege commented Jul 12, 2024

SmartManoj commented Jul 12, 2024 • edited Loading

tobitege commented Jul 12, 2024

SmartManoj commented Jul 12, 2024

tobitege commented Jul 12, 2024

SmartManoj commented Jul 12, 2024

tobitege commented Jul 12, 2024

ryanhoangt left a comment • edited Loading

Choose a reason for hiding this comment

ryanhoangt Jul 12, 2024 • edited Loading

Choose a reason for hiding this comment

SmartManoj Jul 12, 2024

Choose a reason for hiding this comment

SmartManoj Jul 12, 2024

Choose a reason for hiding this comment

SmartManoj Jul 12, 2024

Choose a reason for hiding this comment

rezzie-rich commented Jul 13, 2024

rezzie-rich commented Jul 13, 2024

mamoodi commented Jul 29, 2024

xingyaoww commented Jul 29, 2024

SmartManoj commented Jul 30, 2024

SmartManoj commented Jul 30, 2024

SmartManoj commented Jul 30, 2024

enyst commented Jul 30, 2024

rezzie-rich commented Jul 30, 2024

rezzie-rich commented Jul 30, 2024

SmartManoj commented Jul 30, 2024

rezzie-rich commented Jul 30, 2024 • edited Loading

SmartManoj commented Jul 30, 2024 • edited Loading

rezzie-rich commented Jul 30, 2024 • edited Loading

Summary of agenthub/dummy_agent/agent.py

SmartManoj commented Jul 9, 2024 •

edited

Loading

tobitege commented Jul 12, 2024 •

edited

Loading

SmartManoj commented Jul 12, 2024 •

edited

Loading

SmartManoj commented Jul 12, 2024 •

edited

Loading

ryanhoangt left a comment •

edited

Loading

ryanhoangt Jul 12, 2024 •

edited

Loading

rezzie-rich commented Jul 30, 2024 •

edited

Loading

SmartManoj commented Jul 30, 2024 •

edited

Loading

rezzie-rich commented Jul 30, 2024 •

edited

Loading

Summary of `agenthub/dummy_agent/agent.py`