Feature: re-visit outcome definition in findings #2928

laurentsimon · 2023-04-28T18:06:52Z

Use iota or explicit values
Do we need a NotApplicable outcome?

spencerschrock · 2023-09-06T18:17:26Z

From a discussion: Should probes have a numeric value?

e.g number of contributors for the contributor check. or number of minimum reviewers in code review check.

laurentsimon · 2023-09-06T18:33:43Z

AdamKorcz · 2023-10-03T11:15:05Z

From a discussion: Should probes have a numeric value?

e.g number of contributors for the contributor check. or number of minimum reviewers in code review check.

The checks that do proportional scoring will benefit from this, as the corresponding probes will need to return a numeric value. The checks that do proportional scoring are:

In most of the cases, the probes can handle this by themselves. For example. they can return the number of findings corresponding to a numeric value, however it does increase complexity in the probes, especially regarding understanding the probe outcomes; Some probes can return the number of findings, some probes can only return the number of findings in certain cases, other probes will not return the number of cases. A numeric field in the Outcome would in most cases replace a scenario where the number of finding outcomes is used as the numeric value. As such, which scenario is better: Using the number of findings as the numeric value or using a field in the Outcome?

A related question is how we should use the Outcome vs the numeric value across all probes. Why have OutcomePositive and OutcomeNegative at all if the probe can return numeric values? The purpose of a numeric field is to help in cases where a Positive/Negative outcome is not enough, however, not all developers would take that approach to begin with. For consistency across all probes, it would be good with documentation on the numeric values and that developers should aim to solve a given probe first by using the Outcome value. The documentation could make clear what the "ideal" outcome looks from a probe that returns numeric values.

Do we need a NotApplicable outcome?

From a high level, this is useful for probes that consider whether one/how many of SOMETHING is/are a security risk or not. Two checks that I can think of do that:

Webhooks.
Dangerous workflows.

If a project has either no webhooks or workflows, Scorecard cannot evaluate whether any of them are a risk. I could imagine that there can be more of these in the future, whether publicly maintained or not; For example, some users might want to check integrations with 3rd-party apps - for the sake of argument let's say an integration with Slack: A scorecard check could be whether this integration is set up securely or insecurely, however, you can only evaluate it if the project actually has a Slack integration.

Another example could be fuzzing: Scorecard could consider it positive if non-security findings from OSS-Fuzz are auto-created as a Github issue, however, if the project is not on Github, then Scorecard cannot evaluate this.

This could also be relevant to specific types of projects. For example, whether cloud-native projects have configured their .yaml files securely or not.

laurentsimon · 2023-10-03T20:49:21Z

Following up on the discussion we had this morning:

Numerical values

Sounds useful. Maybe we define an outcome structure like:

type Outcome Struct {
   Type enum {Int, Bool}
   Value interface{}
}

The thing to keep in mind is that we will ultimately allow end-users to define their own checks using a (hopefully simple) config file. See an example https://github.com/laurentsimon/scorecard/blob/mvp/flexible-checks/evaluation/policy.yml If we can keep the evaluation engine simple, it's better.

I think @AdamKorcz will write up a short proposal about what he thinks is an appropriate solution.

NotApplicable

Not a blocker, there was consensus among the maintainers that this is useful

Am I missing anything from today's conversation?

AdamKorcz · 2023-10-06T13:27:59Z

This is in my opinion the optimal way to add the option for numeric return values from probes. I am keeping this text short and am focusing on the details of what I see as the best solution, and am not including trade offs between different types of solutions. Please let me know if I should add more context.

I consider finding.Finding to a suitable place to add this option. Adding it there has little overhead and requires little rewriting of most uses of finding.Finding. The new finding.Finding would look like this:

type Finding struct {
	Probe       string             `json:"probe"`
	Outcome     Outcome            `json:"outcome"`
	Message     string             `json:"message"`
	Location    *Location          `json:"location,omitempty"`
	Remediation *probe.Remediation `json:"remediation,omitempty"`
        Values
}

I consider the best data structure to be a map[string]int. The string key allows us to have good readability when evaluating the numeric values. The int value allows us to easily compare it against other values in custom checks (in .yaml files) and in the Scorecard code.

I don’t see a need for creating nested maps now or in the near future.

As such, the complete, changed finding.Finding would look like this:

type Finding struct {
	Probe       string             `json:"probe"`
	Outcome     Outcome            `json:"outcome"`
	Message     string             `json:"message"`
	Location    *Location          `json:"location,omitempty"`
	Remediation *probe.Remediation `json:"remediation,omitempty"`
        Values       map[string]int    `json:”values,omitempty”`
}

With this change, there is no need for returning multiple findings from each probe. By adding the option to return a numeric value, we should make the following high-level changes to the existing (already merged) probes:

Change the return value of probes to be finding.Finding instead of []finding.Finding.
All fuzzing probes: Add a field the to Values map called “number of fuzzers” and return a single finding.
securityPolicyContainsLinks: Add two fieldsthe Values map : 1) Number of emails and 2) Number of URLs and return a single finding.
securityPolicyContainsText: Add a KV pair to the Values map called “number of policies” and return a single finding.
securityPolicyContainsVulnerabilityDisclosure: Add a KV pair to the Values map called “number of discvuls” and return a single finding.
securityPolicyPresent: Add a field the VALUES map called “number of security policies” and return a single finding.

Please scrutinize this suggestion.

spencerschrock · 2023-10-09T18:10:01Z

With this change, there is no need for returning multiple findings from each probe. By adding the option to return a numeric value, we should make the following high-level changes to the existing (already merged) probes:

Change the return value of probes to be finding.Finding instead of []finding.Finding.

I think there's still value in multiple findings, namely because their location differs. Consider any of the checks that evaluate file contents. There can be more than one dangerous workflow, unpinned dependency, etc.

AdamKorcz · 2023-10-09T18:26:25Z

With this change, there is no need for returning multiple findings from each probe. By adding the option to return a numeric value, we should make the following high-level changes to the existing (already merged) probes:

Change the return value of probes to be finding.Finding instead of []finding.Finding.

I think there's still value in multiple findings, namely because their location differs. Consider any of the checks that evaluate file contents. There can be more than one dangerous workflow, unpinned dependency, etc.

Fair point, but wouldn't the number of dangerous workflows and unpinned dependencies be placed in the numeric fields, ie. the map?

spencerschrock · 2023-10-09T20:43:03Z

Yes, but I think there's value in being able to point it to a location in the repo. When done properly, this location data lets us feed it into the Code Scanning dashboard (or PR feedback).

spencerschrock · 2023-10-24T18:13:35Z

There seems to be no way to do debug messages with findings? Is this something we are expecting? Should all debug logging be done at the raw results level?

One discussion was at https://github.com/ossf/scorecard/pull/3486/files#r1366854736

spencerschrock · 2024-04-05T18:45:15Z

In #2919 (comment), Laurent declared Outcome as an integer type as a way of comparing outcomes. I synced with him offline recently and that's not something we're envisioning anymore. Plus as he points out in that comment, comparing the other outcomes (e.g. OutcomeError, OutcomeNotAvailable) is complicated.

Changing the outcome type from int to string would make the current results more consumer friendly:
before:

{
"probe": "hasOSVVulnerabilities",
"message": "Project does not contain OSV vulnerabilities",
"outcome": 12
}

After

{
"probe": "hasOSVVulnerabilities",
"message": "Project does not contain OSV vulnerabilities",
"outcome": "Positive"
}

laurentsimon added the kind/enhancement New feature or request label Apr 28, 2023

laurentsimon mentioned this issue Apr 28, 2023

✨ [experimental] Create probes within findings #2919

Merged

laurentsimon changed the title ~~Feature: re-visit the need for NotApplicableOutcome type~~ Feature: re-visit outcome definition in findings Apr 28, 2023

laurentsimon added the arae:finding label Apr 28, 2023

naveensrinivasan added the finding label Apr 28, 2023

laurentsimon removed the area:finding label May 3, 2023

spencerschrock mentioned this issue Jun 23, 2023

Change in Dangerous-Workflow and Token-Permissions scores for repos with no workflows #3205

Closed

laurentsimon mentioned this issue Jul 24, 2023

✨ [experimental] Probe support for security policy check #3241

Merged

spencerschrock added this to the Structured results milestone Aug 18, 2023

AdamKorcz mentioned this issue Oct 3, 2023

🌱 convert Webhook check to probes #3522

Merged

2 tasks

AdamKorcz mentioned this issue Oct 6, 2023

🌱 Add OutcomeNotApplicable #3539

Merged

2 tasks

spencerschrock mentioned this issue Oct 10, 2023

🌱 convert vulnerabilities check to probe #3487

Merged

2 tasks

AdamKorcz mentioned this issue Oct 11, 2023

🌱 Add map to Finding #3558

Merged

2 tasks

afmarcum removed the finding label Oct 31, 2023

This was referenced Jan 27, 2024

Avoiding probe explosion and boilerplate #3824

Closed

🌱 Change finding Values to map[string]string #3837

Merged

afmarcum added this to Scorecard - NEW Mar 5, 2024

spencerschrock moved this to Todo in Scorecard - NEW Mar 7, 2024

spencerschrock mentioned this issue Apr 5, 2024

⚠️ Switch Outcome type to string #4006

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: re-visit outcome definition in findings #2928

Feature: re-visit outcome definition in findings #2928

laurentsimon commented Apr 28, 2023 •

edited

Loading

spencerschrock commented Sep 6, 2023

laurentsimon commented Sep 6, 2023

AdamKorcz commented Oct 3, 2023 •

edited

Loading

laurentsimon commented Oct 3, 2023 •

edited

Loading

AdamKorcz commented Oct 6, 2023

spencerschrock commented Oct 9, 2023

AdamKorcz commented Oct 9, 2023

spencerschrock commented Oct 9, 2023

spencerschrock commented Oct 24, 2023

spencerschrock commented Apr 5, 2024

Feature: re-visit outcome definition in findings #2928

Feature: re-visit outcome definition in findings #2928

Comments

laurentsimon commented Apr 28, 2023 • edited Loading

spencerschrock commented Sep 6, 2023

laurentsimon commented Sep 6, 2023

AdamKorcz commented Oct 3, 2023 • edited Loading

laurentsimon commented Oct 3, 2023 • edited Loading

AdamKorcz commented Oct 6, 2023

spencerschrock commented Oct 9, 2023

AdamKorcz commented Oct 9, 2023

spencerschrock commented Oct 9, 2023

spencerschrock commented Oct 24, 2023

spencerschrock commented Apr 5, 2024

laurentsimon commented Apr 28, 2023 •

edited

Loading

AdamKorcz commented Oct 3, 2023 •

edited

Loading

laurentsimon commented Oct 3, 2023 •

edited

Loading