requirement: Implement `--fix` for `RequirementSource` #225

tetsuo-cpp · 2022-01-19T12:07:55Z

Closes #214

tetsuo-cpp · 2022-01-19T12:21:55Z

@woodruffw @di
I wanted to run this by you both before going too far with this. At the moment, I use pip_api.parse_requirements and then rewrite the file line by line, replacing requirements that correspond to the fixed dependency.

This could be optimised to do all the fixes in-memory and then write the file out at the end but it doesn't fit the fix interface and I don't anticipate this being a serious performance issue, so I'm thinking we can do it like this initially.

One problem with this approach is that we lose comments in the requirement file since they don't get parsed by parse_requirements. I'm thinking that the solution might be to add a keep_comments optional argument to pip_api.parse_requirements that returns strings for comments also.
The other issue is that the fact that we support multiple requirements files makes this harder to get right (apply fixes transactionally). With a single file, this is easy as we just need to construct the new file with the fixes and then we can move it into place to replace the old one. Multiple files throw a spanner in the works since, if one of our moves fails, we'll have some requirement files with fixes applied and some without. Therefore, we'll have to keep the old ones around and then have some recovery procedure to undo the moves in the case of a failure. If we want to keep things simple and avoid dealing with this, we could also just say that --fix only supports a single -r argument.

What are your thoughts on these two points?

woodruffw · 2022-01-19T14:05:57Z

This could be optimised to do all the fixes in-memory and then write the file out at the end but it doesn't fit the fix interface and I don't anticipate this being a serious performance issue, so I'm thinking we can do it like this initially.

Yeah, this sounds good to me. Our execution is probably overwhelmingly dominated by network round-trips and pip subprocesses, so this isn't something we need to immediately optimize.

One problem with this approach is that we lose comments in the requirement file since they don't get parsed by parse_requirements. I'm thinking that the solution might be to add a keep_comments optional argument to pip_api.parse_requirements that returns strings for comments also.

@di should probably have the final say on this, but IMO we should skip on comment handling for the MVP. I do think we should ultimately preserve comments, but we might need to rethink the parse_requirements API to do so, since we'll need to support both freestanding and inline comments:

# foo
blah==1.2.3 # bar

The other issue is that the fact that we support multiple requirements files makes this harder to get right (apply fixes transactionally).

Hmm, this is a good point. Maybe this suggests that we redesign our initial pip_api._fix APIs to be more transactional to begin with, since we have a similar "partial fix" problem even in non--r mode: we can imagine that upgrading foo 1.2.3 -> 1.2.4 is a correct fix, but that pip install -U foo==1.2.4 fails for whatever reason. If that happens amidst other successful fixes, we perhaps should fail the entire --fix (and revert any upgrades already performed).

If we go the above route, we'll probably want to refactor DependencySource.fix into a non-abstract method like this:

def fix(self, fix_versions: Iterator[ResolvedFixVersion]):
    with self.fix_transaction():
        for fix in fix_versions:
            self.fix_one(fix)

...where fix_one is the previous body of DependencySource.fix for each implementation, and fix_transaction is another abstract method that provides a transactional context manager.

OTOH, that's a lot of complexity, and maybe --fix should really just be "buyer beware": there's no guarantee that the --fix will succeed or leave multiple requirements inputs in a consistent state, only that failures are accompanied by error handling on our side (a nonzero exit). With that approach, it's the user's responsibility to rollback any changes made.

tetsuo-cpp · 2022-01-20T00:17:46Z

Hmm, this is a good point. Maybe this suggests that we redesign our initial pip_api._fix APIs to be more transactional to begin with, since we have a similar "partial fix" problem even in non--r mode: we can imagine that upgrading foo 1.2.3 -> 1.2.4 is a correct fix, but that pip install -U foo==1.2.4 fails for whatever reason. If that happens amidst other successful fixes, we perhaps should fail the entire --fix (and revert any upgrades already performed).

Personally, I think this is ok. Our summary specifically states that we fixed X vulnerabilities out of Y packages (compared with the Z vulnerabilities found in the audit), which indicates that we will fix everything we can but we don't guarantee we can fix everything. At the moment, if the package update fails, we yield a SkippedFixVersion and pip-audit behaves similarly to if we can't resolve a fix version.

But I don't think it's ok to partially apply a single fix to some requirements files but not others.

tetsuo-cpp · 2022-01-20T03:41:41Z

How does this look?

I've attempted to deal with the case where to fail to write to one of the requirements files (or some other error gets thrown). I start by copying all requirements files to temporary files, then attempt to rewrite the requirements files. If we're successful, the temporary files will automatically get cleaned up. If not, I restore the requirements files by copying the temporary files back to the original location.

woodruffw · 2022-01-20T15:27:51Z

How does this look?

This overall structure looks good to me! This approach also looks good, and I think it's what we'll want to encapsulate with the "fix transaction" interfaces above. But that can be a task for a follow-up PR 🙂

pip_audit/_dependency_source/requirement.py

Co-authored-by: William Woodruff <[email protected]>

tetsuo-cpp · 2022-01-21T06:20:28Z

I'm still working on getting this tested. Mocking out things so the tests don't mess around with the filesystem has been a bit of a pain.

This overall structure looks good to me! This approach also looks good, and I think it's what we'll want to encapsulate with the "fix transaction" interfaces above. But that can be a task for a follow-up PR 🙂

Hmm, I'm not following. The way I see it, the "fix transaction" interface is only necessary if we're intending to roll back fixes for other vulnerabilities in the case that one of them failed. Is that what we want? The recovery scheme I'm using here is for when a single vulnerability exists in multiple requirements files and we didn't manage to fix it in all files (as opposed to rolling back successful fixes for vulnerabilities across all files because a vulnerability further down the list couldn't be fixed).

test/dependency_source/test_requirement.py

woodruffw · 2022-01-21T14:48:53Z

The way I see it, the "fix transaction" interface is only necessary if we're intending to roll back fixes for other vulnerabilities in the case that one of them failed. Is that what we want?

Whoops, this was a misunderstanding on my part. Yeah, we don't need the "transaction" interface in this case.

test/dependency_source/test_requirement.py

test/conftest.py

woodruffw

LGTM overall! I left some small nits that I can take care of.

di · 2022-01-24T17:26:41Z

I'm still working on getting this tested. Mocking out things so the tests don't mess around with the filesystem has been a bit of a pain.

There might be some useful fixtures for test isolation in https://github.com/di/pip-api/blob/dec3a5e30c911b794763483ed985960a6732a40e/tests/conftest.py you could use to improve this.

tetsuo-cpp added 5 commits January 19, 2022 15:48

pip, _fix: Add missing CLI feedback

c54a170

_cli, _fix: Add debug logging when fixes are skipped

36eb437

requirement: Implement --fix for RequirementSource

814400b

requirement: Preserve invalid lines

7f4cfec

requirement: Replace existing requirements files

1449aac

tetsuo-cpp marked this pull request as draft January 19, 2022 12:08

Base automatically changed from alex/pip-fix-logging to main January 19, 2022 14:06

tetsuo-cpp added 2 commits January 20, 2022 11:19

Merge branch 'main' into alex/requirements-fix

ef8e78b

requirement: Try to recover requirements files when a fix fails

3a274aa

tetsuo-cpp requested review from woodruffw and di January 20, 2022 03:47

woodruffw reviewed Jan 20, 2022

View reviewed changes

pip_audit/_dependency_source/requirement.py Outdated Show resolved Hide resolved

woodruffw reviewed Jan 20, 2022

View reviewed changes

pip_audit/_dependency_source/requirement.py Outdated Show resolved Hide resolved

woodruffw reviewed Jan 20, 2022

View reviewed changes

pip_audit/_dependency_source/requirement.py Outdated Show resolved Hide resolved

tetsuo-cpp and others added 3 commits January 21, 2022 10:41

Apply suggestions from code review

81fdfad

Co-authored-by: William Woodruff <[email protected]>

requirement: Use os.replace instead of copying

256cf25

requirement: Use Path.open

6b50028

test: Create basic test for RequirementSource.fix

03a279e

tetsuo-cpp commented Jan 21, 2022

View reviewed changes

test/dependency_source/test_requirement.py Outdated Show resolved Hide resolved

tetsuo-cpp added 2 commits January 24, 2022 15:19

test: Add more tests for fixing dependencies in requirements files

fdc4fdc

test: Add test for requirements files with comments

2c1993b

tetsuo-cpp commented Jan 24, 2022

View reviewed changes

test/dependency_source/test_requirement.py Outdated Show resolved Hide resolved

tetsuo-cpp marked this pull request as ready for review January 24, 2022 04:25

tetsuo-cpp requested a review from woodruffw January 24, 2022 04:25

tetsuo-cpp added 2 commits January 24, 2022 18:10

CHANGELOG: Update changelog

b51dbf1

test: Fix wording

319a0de

woodruffw reviewed Jan 24, 2022

View reviewed changes

test/dependency_source/test_requirement.py Show resolved Hide resolved

woodruffw reviewed Jan 24, 2022

View reviewed changes

test/conftest.py Outdated Show resolved Hide resolved

woodruffw approved these changes Jan 24, 2022

View reviewed changes

test: refactor

8f89fbb

woodruffw mentioned this pull request Jan 24, 2022

Preserve comments and pre-evaluated markers when parsing requirements di/pip-api#120

Open

woodruffw merged commit 3b42427 into main Jan 24, 2022

woodruffw deleted the alex/requirements-fix branch January 24, 2022 16:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

requirement: Implement `--fix` for `RequirementSource` #225

requirement: Implement `--fix` for `RequirementSource` #225

tetsuo-cpp commented Jan 19, 2022

tetsuo-cpp commented Jan 19, 2022 •

edited

Loading

woodruffw commented Jan 19, 2022

tetsuo-cpp commented Jan 20, 2022

tetsuo-cpp commented Jan 20, 2022

woodruffw commented Jan 20, 2022

tetsuo-cpp commented Jan 21, 2022

woodruffw commented Jan 21, 2022

woodruffw left a comment

di commented Jan 24, 2022

requirement: Implement --fix for RequirementSource #225

requirement: Implement --fix for RequirementSource #225

Conversation

tetsuo-cpp commented Jan 19, 2022

tetsuo-cpp commented Jan 19, 2022 • edited Loading

woodruffw commented Jan 19, 2022

tetsuo-cpp commented Jan 20, 2022

tetsuo-cpp commented Jan 20, 2022

woodruffw commented Jan 20, 2022

tetsuo-cpp commented Jan 21, 2022

woodruffw commented Jan 21, 2022

woodruffw left a comment

Choose a reason for hiding this comment

di commented Jan 24, 2022

requirement: Implement `--fix` for `RequirementSource` #225

requirement: Implement `--fix` for `RequirementSource` #225

tetsuo-cpp commented Jan 19, 2022 •

edited

Loading