Cleanup code context and code features #401

PCSwingle · 2023-12-16T00:22:57Z

TODO:

Confirm that embeddings are still cached (I think they should be, but best to check)
Figure out LLM filter user included files.

…api, modify get code message

mentat/code_feature.py

mentat/cost_tracker.py

PCSwingle · 2023-12-19T20:18:21Z

The LLM filter no longer knows which files are user included; how should we give it this information? I think just giving it all of the file names would work well, but let me know your ideas.

granawkins

It's a great PR, look at those numbers! +507 −732
In hindsight I really overdid it with the caching 😅
Lots of good refactors. I left a few suggestions, take em or leave em.

granawkins · 2023-12-19T22:35:06Z

mentat/code_context.py

+        auto_tokens = min(get_max_tokens() - tokens_used, config.auto_tokens)
+
+        # Get auto included features
+        if config.auto_context and prompt:


Suggested change

if config.auto_context and prompt:

if config.auto_context and prompt and auto_tokens > 0:

In my persistent context PR I actually merge auto_context and auto_tokens into auto_context_tokens and I check if it's more than 0 there, so this change is pretty much already made!

mentat/code_context.py

mentat/llm_api_handler.py

granawkins · 2023-12-20T04:51:24Z

mentat/code_context.py


    async def get_code_message(
        self,
+        prompt_tokens: int,


As I understand the system ultimately goes back to model.config. This will make it difficult to use with other models/scenarios, like what we did/plan-to-do with LLMFeatureFilter. Also, using max_tokens seems like a cleaner interface for the function? Just "give me up to N tokens of context".

tbh our LLMFeatureFilter should have a test to catch that - making a note

I changed it to prompt_tokens instead of max_tokens because we don't actually tell it to give us up to N tokens of context; we tell it to include all of the user's include files (since we no longer truncate it) and give us up to auto_tokens of automatic context (as long as it fits). The prompt tokens is just so we can check if there are enough tokens for user included files, and so we can only add as many auto context files as can fit in context. I think this makes a lot more sense to me (since we aren't truncating user included files anymore).

What do you mean by difficult to use with different models/scenarios? I'm not sure I understand

mentat/code_context.py

granawkins · 2023-12-20T08:18:22Z

mentat/code_feature.py

+                    if (
+                        section.start >= self.interval.start
+                        and section.start < self.interval.end
+                    ):


Suggested change

if (

section.start >= self.interval.start

and section.start < self.interval.end

):

if self.interval.intersects(section):

DiffAnnotations inherit Interval so this works. Also, the earlier wouldn't catch an annotation spanning multiple intervals. Still not ideal, but will guarantee it's there at least.

This isn't actually an intersection check; this is checking to ensure that the diff annotation starts within the interval (if it ends inside of the interval, we don't want to include it because a. it could be included in multiple intervals and b. the start of the diff annotation wouldn't make much sense because those line numbers aren't in this interval)

mentat/code_feature.py

mentat/cost_tracker.py

mentat/code_context.py

biobootloader

hmm, when I run with auto context on it now takes significantly longer to embed my query / get embeddings. It hangs for 5+ seconds before showing the loading bar for embeddings:

PCSwingle · 2023-12-20T20:01:54Z

hmm, when I run with auto context on it now takes significantly longer to embed my query / get embeddings. It hangs for 5+ seconds before showing the loading bar for embeddings:

I managed to track it down to us calling 'get_code_message' on every code feature in the project multiple times (which reads the entire file for every feature, meaning we're reading the same file multiple times whenever there are multiple features in a file) when we get embeddings; for now, I'll return the code feature caching on get_code_message, but later on I want to add caching where it should matter in code_file_manager to cache the individual files and only re-read the files if their last modified time has been updated as well as reduce the number of times we call get code message on each feature in get embeddings.
Edit: Actually, after re-adding code feature caching it didn't change anything; after looking into it more I realized I started calling git diff on every single file (not just the ones with git diffs) by accident! This also makes me think we might want to cache git diffs (and again only update them when the file is updated) since this is by far the most expensive operation we do.

biobootloader · 2023-12-20T23:00:07Z

Actually, after re-adding code feature caching it didn't change anything; after looking into it more I realized I started calling git diff on every single file (not just the ones with git diffs) by accident! This also makes me think we might want to cache git diffs (and again only update them when the file is updated) since this is by far the most expensive operation we do.

Nice work tracking it down! Much faster now. I guess this only matters if there are a lot of files with changes, but that should be super rare. I guess worst case someone is using Mentat to help them review a big PR that changes dozens of files?

Perhaps we could call git diff just once and get diff info for all changed files from that, instead of calling it on each file?

PCSwingle added 6 commits December 15, 2023 16:22

Begin cleanup process; add auto context and cost logging to call llm …

11aa76f

…api, modify get code message

remove codemessagelevel

6ee16f5

remove diff context cache, merge auto and regular features

7a091fc

remove interval parsing from codefeature

f6b4588

fix bugs and tests

18cd72f

fix cost tracker bugs

f814e76

PCSwingle marked this pull request as ready for review December 19, 2023 20:11

PCSwingle commented Dec 19, 2023

View reviewed changes

mentat/code_feature.py Show resolved Hide resolved

PCSwingle commented Dec 19, 2023

View reviewed changes

mentat/cost_tracker.py Show resolved Hide resolved

fix tests

b598f06

PCSwingle mentioned this pull request Dec 19, 2023

/context doesn't update after calling /include /exclude #403

Closed

PCSwingle linked an issue Dec 19, 2023 that may be closed by this pull request

/context doesn't update after calling /include /exclude #403

Closed

PCSwingle mentioned this pull request Dec 19, 2023

Persistent context #415

Merged

granawkins mentioned this pull request Dec 20, 2023

Add auto_context_llm as a config argument #358

Merged

granawkins approved these changes Dec 20, 2023

View reviewed changes

biobootloader approved these changes Dec 20, 2023

View reviewed changes

mentat/cost_tracker.py Show resolved Hide resolved

mentat/code_context.py Outdated Show resolved Hide resolved

biobootloader requested changes Dec 20, 2023

View reviewed changes

suggested changes

645cc2e

add code file manager warning

b5a882b

PCSwingle force-pushed the cleanup-code-context-and-code-features branch from 99e4ab9 to b5a882b Compare December 20, 2023 20:29

PCSwingle added 2 commits December 20, 2023 12:46

fix git diff problem

a515441

fix test

4376ba0

biobootloader approved these changes Dec 20, 2023

View reviewed changes

PCSwingle merged commit 4002e18 into main Dec 20, 2023
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup code context and code features #401

Cleanup code context and code features #401

PCSwingle commented Dec 16, 2023 •

edited

Loading

PCSwingle commented Dec 19, 2023

granawkins left a comment

granawkins Dec 19, 2023

PCSwingle Dec 20, 2023

granawkins Dec 20, 2023

granawkins Dec 20, 2023

PCSwingle Dec 20, 2023

PCSwingle Dec 20, 2023

granawkins Dec 20, 2023

PCSwingle Dec 20, 2023

biobootloader left a comment

PCSwingle commented Dec 20, 2023 •

edited

Loading

biobootloader commented Dec 20, 2023

	if config.auto_context and prompt:
	if config.auto_context and prompt and auto_tokens > 0:

Cleanup code context and code features #401

Cleanup code context and code features #401

Conversation

PCSwingle commented Dec 16, 2023 • edited Loading

PCSwingle commented Dec 19, 2023

granawkins left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

biobootloader left a comment

Choose a reason for hiding this comment

PCSwingle commented Dec 20, 2023 • edited Loading

biobootloader commented Dec 20, 2023

PCSwingle commented Dec 16, 2023 •

edited

Loading

PCSwingle commented Dec 20, 2023 •

edited

Loading