Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve npm package declared license detection #2488

Open
pombredanne opened this issue Apr 11, 2021 · 0 comments
Open

Improve npm package declared license detection #2488

pombredanne opened this issue Apr 11, 2021 · 0 comments

Comments

@pombredanne
Copy link
Member

pombredanne commented Apr 11, 2021

The goal of this ticket is to improve npm package license detection across the board. While scancode-toolkit's npm package detection basics are OK, there a few repeat cases where license information is not properly gathered from npm package metadata. Usually this is because a declared_license value contains things we did not expect (like a URL) or is improperly formed. There are also specific things we do not support such as SEE LICENSE (defined in https://docs.npmjs.com/cli/v7/configuring-npm/package-json#license and related ticket in #1364 )

Resolving this would likely require a mix of:

  • adding new license detection rules to scancode,
  • adding new and improved code to handle the specific patterns of license,
  • creating new license mappings
  • and possibly working with upstream maintainers to improve their license declarations.

The approach should be to start with a complete data set of all package manifests and find patterns of license issues and establish the baseline, possibly helped by heuristics, classifiers and ML if needed. The end results should be a significant improvement to the license detection quality for the npm packages.

See also these projects and pages of interest:

There are also other related ticket for other package types such as:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant