Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Account for maven bundle plugin and fix filename matching #2220

Merged
merged 4 commits into from
Oct 19, 2023

Conversation

wagoodman
Copy link
Contributor

@wagoodman wagoodman commented Oct 11, 2023

This PR makes a few changes to the java archive cataloger:

  • takes into account the maven bundle plugin, which is common tooling that we can key off of patterns from the Bundle-SymbolicName field to get a group ID and artifact ID. Then we can extract the artifact ID as the package name.
  • If a filename looks like a group ID + artifact ID, attempt to parse it as such and extract the artifact ID as the package name.
  • When attempting to find if a pom.xml matches with an existing package, allow for the artifact ID to be a suffix of the filename (not just the filename as a prefix of the artifact ID as it is today)

Blocked until the following are verified:

  • CPE generation logic can additionally consider the new sources of information when crafting a vendor (I think this is done, but need to double check)
  • purl generation is using the right group ID found (again, I think this is done, but need to check)
  • incorporate locally into grype and run against the quality gate -- are we better or worse off on the whole? Incorporate Syft java detection improvements grype#1555

Closes #2217
Fixes #2130

@wagoodman wagoodman force-pushed the gid-aid-for-maven-bundle-plugin branch from f2920df to 237cffc Compare October 12, 2023 14:25
wagoodman added a commit to anchore/grype that referenced this pull request Oct 12, 2023
Signed-off-by: Alex Goodman <[email protected]>
@wagoodman wagoodman marked this pull request as ready for review October 12, 2023 14:45
@wagoodman wagoodman requested a review from a team October 12, 2023 14:46
Copy link
Contributor

@spiffcs spiffcs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM - thanks for carving out this path to get a better artifact ID and the subsequent groupID changes. The examples were really easy to follow and the testing added is really clear.

I had a questions about the following:

When attempting to find if a pom.xml matches with an existing package, allow for the artifact ID to be a suffix of the filename (not just the filename as a prefix of the artifact ID as it is today)

Memory: I thought we had experimented with this in the past and found it surfaced too many incorrect pom.xml. Basically, suffix checks were too lose and the naming conventions in java were too broad to allow this kind of pom to be checked. Yes we might miss some things where this case exists, but limiting the data here was correct in that we were not getting more wrong hits than correct hits.

What was the trigger for making this change?

@@ -237,7 +237,7 @@ func (j *archiveParser) guessMainPackageNameAndVersionFromPomInfo() (name, versi
projects, _ := pomProjectByParentPath(j.archivePath, j.location, pomMatches)

for parentPath, propertiesObj := range properties {
if propertiesObj.ArtifactID != "" && j.fileInfo.name != "" && strings.HasPrefix(propertiesObj.ArtifactID, j.fileInfo.name) {
if artifactIDMatchesFilename(propertiesObj.ArtifactID, j.fileInfo.name) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 to untangle this many thanks

@spiffcs
Copy link
Contributor

spiffcs commented Oct 19, 2023

Ahh - I'll fix the merge commit - apologies

Signed-off-by: Christopher Phillips <[email protected]>
@wagoodman
Copy link
Contributor Author

@spiffcs

What was the trigger for making this change?

Essentially it was from taking a closer look at some of the artifacts referenced in the attached issues.

I thought we had experimented with this in the past and found it surfaced too many incorrect pom.xml

I can't recall the specifics of what was experimented on from 3-ish months ago, but this seems different than the original implementation from about a year ago.

@wagoodman wagoodman merged commit 07f1304 into main Oct 19, 2023
9 checks passed
@wagoodman wagoodman deleted the gid-aid-for-maven-bundle-plugin branch October 19, 2023 21:57
wagoodman added a commit to anchore/grype that referenced this pull request Oct 20, 2023
* incorporate anchore/syft#2220

Signed-off-by: Alex Goodman <[email protected]>

* incorporate .net core improvements

Signed-off-by: Alex Goodman <[email protected]>

---------

Signed-off-by: Alex Goodman <[email protected]>
@spiffcs spiffcs added the enhancement New feature or request label Oct 31, 2023
GijsCalis pushed a commit to GijsCalis/syft that referenced this pull request Feb 19, 2024
* account for maven bundle plugin and fix filename matching

Signed-off-by: Alex Goodman <[email protected]>

* add in-repo jar tests based on metadata to cover anchore#2130

Signed-off-by: Alex Goodman <[email protected]>

* tests: fix test merge commit

Signed-off-by: Christopher Phillips <[email protected]>

---------

Signed-off-by: Alex Goodman <[email protected]>
Signed-off-by: Christopher Phillips <[email protected]>
Co-authored-by: Christopher Angelo Phillips <[email protected]>
Co-authored-by: Christopher Phillips <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Java archives not from Maven Java archive is listed twice
2 participants