Refactor changes #94

PCSwingle · 2023-09-13T19:56:18Z

Refactor CodeChange into AbstractChange and ConcreteChange as outlined in #91

Edit: After talking with @biobootloader, we've decided to change the structure quite a bit. This is now the general layout:

Model output -> Parser (this parser can be injected in, so any parser will work) -> new FileEdit format that groups by file and only has 'Replacement' subedits -> User filter (make these changes, keep these ones, don't use these ones, etc.) -> Conflict Resolution (handled by FileEdit; no conflict resoution needed on a per-parser basis). The final list of FileEdits can be converted into a final file's code lines, which is written to the file by the CodeFileManager.

Instead of completely changing the Parser and CodeChange files, I've extracted them into OriginalFormatParser and OriginalFormatChange classes; while we don't necessarily need intermediate CodeChanges for every parser, in this case it helps us use our previous parsing code instead of re-writing it all.

TODO:

Make Parser interface and make it easily injectable
Add prompts connected to Parser
Add tests for FileEdit, especially conflict resolution
Fix all TODO's scattered around the code
Restructure tests between format specific tests and non-format specific tests

mentat/code_changes/abstract/abstract_change.py

jakethekoenig

I really like the direction here. Thanks!

mentat/conversation.py

mentat/parsers/block_parser.py

jakethekoenig · 2023-09-20T17:07:50Z

mentat/parsers/block_parser.py

+from mentat.prompts import block_parser_prompt
+
+
+class BlockParser(Parser):


It's called block because the changes are given in "blocks" like this?

@@start { "file": "core/script.py", "action": "insert", "insert-after-line": 3, "insert-before-line": 4 } @@code if name == "Bob": print("Nice to see you again!") @@end

Haha that is the reasoning; I don't like the name but I spent too long trying to think of a better name! Definitely let me know if you have any better ideas!

The name is fine. I was worried future formats could also be reasonably called "block" but thinking about it it seems like a lot of future formats we will try will be easier to name e.g. GitDiffParser, FullFileParser

Maybe FileBlockParser or PartialFileParser are better names? If we want to support inline edits in the future we can also have InlineFileParser to keep the naming consistent.

Hmm, I'm not sure I understand what the File means in this context? Most likely all of our formats will need the model to specify the file.

OriginalFormatParser would be unambiguous, but BlockParser seems fine for now.

The name really just matters if we end up keeping this format around, which we'd only do if it is the best for some models / scenarios. I don't expect that but if it does happen we'll find a name that distinguishes it well from the other formats

Hmm, I'm not sure I understand what the File means in this context?

"File" would mean a literal file on the file system. Technically GitDiffParser would be pulling from a file too :/

PCSwingle · 2023-09-20T19:19:02Z

This PR is almost ready to merge; I'm going to wait on #101 to merge and then I'll resolve conflicts.

jakethekoenig · 2023-09-20T19:30:24Z

This PR is almost ready to merge; I'm going to wait on #101 to merge and then I'll resolve conflicts.

Are the todos still in file_edit for a future PR?

PCSwingle · 2023-09-20T19:37:14Z

This PR is almost ready to merge; I'm going to wait on #101 to merge and then I'll resolve conflicts.

Are the todos still in file_edit for a future PR?

The Replacement 'owner' TODO yes; the current parsing format has a one to one 'change' to Replacement ratio, but eventually we might have a format that creates 2 replacements that we want to represent as one 'change' that the user can filter. It isn't a big addition though and just complicates a few things, so I figured we might as well add that when we need it. As for the conflict resolution TODO's, I personally don't think user input for resolving them is super important, but if you or @biobootloader think we should get that done before merging this PR I'm 100% ok with doing it now.

granawkins · 2023-09-20T23:54:24Z

mentat/code_file_manager.py

+            else:
+                stored_lines = []
+
+            if file_edit.rename_file_path is not None:


Might be better to use git mv <oldname> <newname> here, instead of adding and deleting. With the current method, GitHub would show a bunch of deletions and additions, instead of just a rename.

Hmm, I'm not entirely sure about this because that stages the rename; I don't think we want mentat to stage anything unless the user specifically asks us to; and of course when the user eventually stages the changes git will recognize the rename either way. It is nice having git recognize it immediately though. What do you think?

hmm yeah we don't want to stage. I believe git mv forces git to recognize it as a rename (which it sometimes fails, if similarity isn't high enough). Let's leave as is for now so we don't stage. Perhaps in the future we could have Mentat "remember" a rename and so when it commits for the user it'll inform git

mentat/conversation.py

granawkins · 2023-09-21T00:42:40Z

mentat/parsers/file_edit.py

+                    other.ending_line = replacement.starting_line
+                    other.starting_line = min(other.starting_line, other.ending_line)
+
+    def get_file_lines(self, file_lines: list[str]):


A more descriptive name might be helpful here, maybe like edit_file_lines?

Hmm, that makes it sound a bit like it's editing the file; how about get_new_file_lines? Or get_edited_file_lines?

get_updated_file_lines ?

pyrightconfig.json

mentat/parsers/block_parser.py

waydegg · 2023-09-20T23:59:42Z

mentat/parsers/block_parser.py

+from mentat.prompts import block_parser_prompt
+
+
+class BlockParser(Parser):


Maybe FileBlockParser or PartialFileParser are better names? If we want to support inline edits in the future we can also have InlineFileParser to keep the naming consistent.

mentat/parsers/file_edit.py

mentat/parsers/original_format/original_format_change.py

mentat/parsers/file_edit.py

mentat/parsers/block_parser.py

waydegg

Left some comments!

Didn't mean to request changes, just wanted to leave general feedback

biobootloader · 2023-09-21T15:49:04Z

Thanks @PCSwingle for this important and complicated refactor! And thanks everyone for the quick reviews 🚀

I'll take a closer look later today after you've had a chance to address comments, let me know when you're ready for that!

I'm excited to try some new output formats!

mentat/code_file_manager.py

…fore user input

biobootloader · 2023-09-21T23:08:32Z

mentat/parsers/file_edit.py

+                    # TODO: Ask user for conflict resolution
+                    other.ending_line = replacement.starting_line
+                    other.starting_line = min(other.starting_line, other.ending_line)


Let's print something here so the user knows what happened. Perhaps just say "these two changes conflicted, they've been auto-merged back to back, double check: {the changes back to back}". From the user's perspective they may have seen each change individually and thought everything looked good.

biobootloader · 2023-09-21T23:09:11Z

mentat/parsers/file_edit.py

+        self.replacements.sort(reverse=True)
+        for index, replacement in enumerate(self.replacements):
+            for other in self.replacements[index + 1 :]:
+                # TODO: another type of conflict (not caught here) would be both replacements being inserts on same line


so what happens in that scenario now? does Mentat crash? or does it just randomly put one first?

Right now it just puts the insertion first in the list first and the next insertion last

how about we treat that scenario the same as the other conflict scenario then, and alert / display to the user what's happening

biobootloader

Nice stuff! Feel free to merge after addressing comments

PCSwingle requested review from jakethekoenig, waydegg and biobootloader September 13, 2023 19:56

biobootloader reviewed Sep 13, 2023

View reviewed changes

mentat/code_changes/abstract/abstract_change.py Outdated Show resolved Hide resolved

PCSwingle added 6 commits September 18, 2023 11:10

add abstract change and logic to sort subchanges

057b3ef

use abstractchanges in codefilemanager instead of regular codechanges

2a7d89c

add abstractchange tests

63a65e5

rename Rename to FileUpdate

87c88be

add concrete change abstract class

49569aa

switch design away from abstractchange to fileedit

93ef90e

PCSwingle force-pushed the refactor-changes branch from 3f001f8 to 93ef90e Compare September 18, 2023 18:14

PCSwingle added 2 commits September 18, 2023 13:10

fix bugs and tests

a12deaa

remove Self since it isnt in 3.10

5134258

PCSwingle mentioned this pull request Sep 18, 2023

Refactor CodeChange code #91

Closed

add parser interface with stream and parse and system prompt

47cbe62

jakethekoenig reviewed Sep 20, 2023

View reviewed changes

PCSwingle added 2 commits September 20, 2023 10:28

add tests, slightly change replacements

b9018a0

add interrupt catching

35fb12e

PCSwingle marked this pull request as ready for review September 20, 2023 19:19

fix display bugs

60d6b82

PCSwingle force-pushed the refactor-changes branch from 3b74801 to 60d6b82 Compare September 20, 2023 22:12

PCSwingle added 2 commits September 20, 2023 15:35

Merge branch 'main' into refactor-changes

a067401

switch to attr.define

0709607

granawkins reviewed Sep 21, 2023

View reviewed changes

waydegg previously requested changes Sep 21, 2023

View reviewed changes

waydegg reviewed Sep 21, 2023

View reviewed changes

make OriginalFormatChangeAction more enum-like

7f56524

biobootloader reviewed Sep 21, 2023

View reviewed changes

mentat/code_file_manager.py Show resolved Hide resolved

add file checking to fileedit is_valid and check after parsing but be…

d6a6afb

…fore user input

biobootloader reviewed Sep 21, 2023

View reviewed changes

biobootloader approved these changes Sep 21, 2023

View reviewed changes

tell user when conflict occurs and how it was resolved

a5d9ca9

PCSwingle merged commit f791458 into main Sep 21, 2023
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor changes #94

Refactor changes #94

PCSwingle commented Sep 13, 2023 •

edited

Loading

jakethekoenig left a comment

jakethekoenig Sep 20, 2023

PCSwingle Sep 20, 2023

jakethekoenig Sep 20, 2023

waydegg Sep 20, 2023

PCSwingle Sep 21, 2023

biobootloader Sep 21, 2023

waydegg Sep 21, 2023

PCSwingle commented Sep 20, 2023

jakethekoenig commented Sep 20, 2023

PCSwingle commented Sep 20, 2023

granawkins Sep 20, 2023

PCSwingle Sep 21, 2023

biobootloader Sep 21, 2023

granawkins Sep 21, 2023

PCSwingle Sep 21, 2023

biobootloader Sep 21, 2023

waydegg Sep 20, 2023

waydegg left a comment

biobootloader commented Sep 21, 2023

biobootloader Sep 21, 2023

biobootloader Sep 21, 2023

PCSwingle Sep 21, 2023

biobootloader Sep 21, 2023

biobootloader left a comment

		from mentat.prompts import block_parser_prompt


		class BlockParser(Parser):

Refactor changes #94

Refactor changes #94

Conversation

PCSwingle commented Sep 13, 2023 • edited Loading

jakethekoenig left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PCSwingle commented Sep 20, 2023

jakethekoenig commented Sep 20, 2023

PCSwingle commented Sep 20, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

waydegg left a comment

Choose a reason for hiding this comment

biobootloader commented Sep 21, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

biobootloader left a comment

Choose a reason for hiding this comment

PCSwingle commented Sep 13, 2023 •

edited

Loading