Add Percy visual regression testing #2551

vanitabarrett · 2022-02-18T17:02:06Z

Part of #2070

What

This PR adds Percy visual regression testing to GOV.UK Frontend. Screenshots are generated for each default example for every component, both with JavaScript enabled and disabled.

Proposal

As discussed in dev catch-up, we generally think visual regression tools may help us catch unintended visual changes when we're working. There were some concerns around hitting screenshot limits, Percy checks being ignored if approvals aren't required (PRs can get rushed through into main without checking the screenshots), and generally it not being a replacement for manual testing as it doesn't cover older browsers, high contrast mode etc.

I'm proposing a trial period with this PR. I suggest we:

add Percy to GOV.UK Frontend until the end of the quarter (~ early April) and revisit our experience then
~~initially make Percy a required check~~ Edit to add: unfortunately, because PRs from forks don't have access to Github secrets, Percy won't run on these PRs. We therefore can't make it a required check, as it'll never run/pass for these PRs.
put the responsibility for approving Percy runs onto the person reviewing the pull request

With all of the above, we should revisit decisions earlier than April if they're causing us issues and slowing down development.

Why

Visual regression tools can help us catch visual changes which we weren't expecting, for example: if we make changes to one area of the codebase, without realising that it will have an effect elsewhere.

fofr · 2022-02-22T09:29:49Z

Nice! Will the Percy screenshots and diffs be open to the public?

docs/releasing/testing-and-linting.md

domoscargin · 2022-02-23T10:00:43Z

I love this so, so much. Left a comment about findability.

36degrees

Exciting stuff! Have you by any chance looked into how many snapshots you think we might end up taking in an average month?

36degrees · 2022-02-23T10:41:45Z

.github/workflows/tests.yml

-        run: npm test
+        run: node_modules/.bin/percy exec -- npm test
+        env:
+          PERCY_TOKEN: ${{ secrets.PERCY_TOKEN }}


This secret won't be available to PRs from forks, so we'll need to think about how this works for external contributors.

At the very least, we'll need to make sure this fails gracefully (the tests still run and pass) – based on the documentation you added I'm guessing it will still run the tests with '[percy] Percy is not running, disabling snapshots' but would be good to check this.

We'll also need to think about how this works if we aren't able to check for visual regressions in external PRs until after they've been merged.

That's a good point. I've checked that behaviour, and you're right, the tests are run without Percy generating snapshots and you see this in the output:

[percy] Skipping visual tests [percy] Error: Missing Percy token [percy] Running "npm test" .... [normal test output here]

I'll add some documentation for that, and have a think about what the process might look like for that..

@36degrees I've been digging into this a bit more, and I think the fact that the checks can't run on a PR from a fork means that we can't make Percy a required check either (it'll never run for PRs from forks, and I can't find a way to manually trigger any kind of Percy build from our side).

I think that means our only option is to not mark Percy as a required check and just be disciplined in trying to review the builds when we review. Can you think of any other alternatives?

It would be significantly more work, but if we wanted to we could explore using the pull_request_target or workflow_run events, which run in the context of the base of the pull request (likely main) and have access to secrets.

A workflow might do something like:

When a PR is opened, a workflow triggered by the pull_request event compile the JS and CSS from that branch and saves them as an artefact

Kick off another workflow using the workflow_run event, which runs in the context of the base branch and has access to secrets

That workflow downloads the built JS and CSS and runs the review app using the compiled JS and CSS, then runs the tests to take snapshots

Pass the context of the original PR to Percy to allow it to associate it with the PR, add the status checks etc.

We'd need to be very careful to ensure that we're not introducing potential vulnerabilities to our workflows – there's a good article on this here: https://securitylab.github.com/research/github-actions-preventing-pwn-requests/

My initial thought is we should probably try the simple thing first and explore this route in the future if we find it's causing friction?

docs/releasing/testing-and-linting.md

domoscargin

This is great and should hopefully give us a bit more confidence that our changes aren't breaking unexpected things.

One concern is that Percy.io appears to be fully broken when JavaScript is disabled, so I could foresee potential issues in reaching and approving changes during patchy internet times (and also JS is just more likely to fail).

Happy to see this being trialled!

vanitabarrett · 2022-02-23T15:25:19Z

@36degrees In terms of estimated snapshots per month, I did do a quick calculation when I raised the PR last week.

Each Percy build generates approx 62 snapshots * 2 (browsers we have enabled) = 124 snapshots per run. I think the 62 snapshots already accounts for generating it across 2 screen sizes. Looking at the number of PRs we've had open so far in Feb, and looking at the approx number of pushes, that's around 70 Percy builds in half a month = 8,680. So for a full month we may be looking at around 17,360 snapshots.

We get 100,000 screenshots a month (for the whole Percy account). Taking GOV.UK usage into account, which is fairly low, we should still be well under the maximum each month even if the 62 snapshots above doesn't account for screen sizes and/or we have busier months.

docs/releasing/testing-and-linting.md

This installs the @percy/cli that we need to run our tests using Percy. Follows the Percy documentation https://docs.percy.io/docs/puppeteer This commit does not include setting up our Puppeteer tests to capture any Percy snapshots - this commit installs what we need, ready to go.

Adds a test which takes a snapshot of each component 'default' example.

Enhances the test we added in the previous commit, so that we're now generating snapshots of each component 'default' example when JavaScript is enabled and when it's disabled. We need to make sure each snapshot has a unique name, so we prefix the snapshot name with text to indicate whether it was taken with JavaScript enabled or disabled.

Add documentation to explain how we use Percy, who is responsible for reviewing it, and what screenshots are taken.

docs/releasing/testing-and-linting.md

vanitabarrett · 2022-02-24T12:01:59Z

Thanks @EoinShaughnessy , I think I've addressed those comments if you want to take a second look.

docs/releasing/testing-and-linting.md

EoinShaughnessy

@vanitabarrett Left one last tiny (optional!) suggestion - but yeah, content looks great!

govuk-design-system-ci temporarily deployed to govuk-frontend-pr-2551 February 18, 2022 17:02 Inactive

vanitabarrett marked this pull request as ready for review February 22, 2022 10:54

vanitabarrett requested a review from a team as a code owner February 22, 2022 10:54

vanitabarrett added the tooling label Feb 22, 2022

domoscargin reviewed Feb 23, 2022

View reviewed changes

docs/releasing/testing-and-linting.md Outdated Show resolved Hide resolved

vanitabarrett linked an issue Feb 23, 2022 that may be closed by this pull request

Add visual regression testing #2070

Closed

2 tasks

36degrees reviewed Feb 23, 2022

View reviewed changes

domoscargin approved these changes Feb 23, 2022

View reviewed changes

govuk-design-system-ci temporarily deployed to govuk-frontend-pr-2551 February 23, 2022 15:14 Inactive

vanitabarrett force-pushed the test-percy branch from 5fe7353 to c16972f Compare February 24, 2022 09:54

govuk-design-system-ci temporarily deployed to govuk-frontend-pr-2551 February 24, 2022 09:54 Inactive

vanitabarrett requested a review from EoinShaughnessy February 24, 2022 10:01

vanitabarrett commented Feb 24, 2022

View reviewed changes

docs/releasing/testing-and-linting.md Show resolved Hide resolved

Vanita Barrett added 5 commits February 24, 2022 10:27

Install @percy/puppeteer and setup tests to capture snapshots

b7e1d19

Adds a test which takes a snapshot of each component 'default' example.

Add documentation for Percy visual regression testing

c35fb2c

Add documentation to explain how we use Percy, who is responsible for reviewing it, and what screenshots are taken.

Add documentation for PRs from forks

8b09a33

vanitabarrett force-pushed the test-percy branch from c16972f to 8b09a33 Compare February 24, 2022 10:32

govuk-design-system-ci temporarily deployed to govuk-frontend-pr-2551 February 24, 2022 10:32 Inactive

EoinShaughnessy reviewed Feb 24, 2022

View reviewed changes

govuk-design-system-ci temporarily deployed to govuk-frontend-pr-2551 February 24, 2022 12:00 Inactive

EoinShaughnessy reviewed Feb 24, 2022

View reviewed changes

docs/releasing/testing-and-linting.md Outdated Show resolved Hide resolved

EoinShaughnessy approved these changes Feb 24, 2022

View reviewed changes

Docs improvements from review

a2fc893

vanitabarrett force-pushed the test-percy branch from f005c9e to a2fc893 Compare February 24, 2022 13:44

govuk-design-system-ci temporarily deployed to govuk-frontend-pr-2551 February 24, 2022 13:45 Inactive

vanitabarrett merged commit fd4952f into main Feb 24, 2022

vanitabarrett deleted the test-percy branch February 24, 2022 13:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Percy visual regression testing #2551

Add Percy visual regression testing #2551

vanitabarrett commented Feb 18, 2022 •

edited

Loading

fofr commented Feb 22, 2022

domoscargin commented Feb 23, 2022

36degrees left a comment

36degrees Feb 23, 2022

vanitabarrett Feb 23, 2022 •

edited

Loading

vanitabarrett Feb 23, 2022

36degrees Feb 23, 2022

domoscargin left a comment

vanitabarrett commented Feb 23, 2022

vanitabarrett commented Feb 24, 2022

EoinShaughnessy left a comment

Add Percy visual regression testing #2551

Add Percy visual regression testing #2551

Conversation

vanitabarrett commented Feb 18, 2022 • edited Loading

What

Proposal

Why

fofr commented Feb 22, 2022

domoscargin commented Feb 23, 2022

36degrees left a comment

Choose a reason for hiding this comment

36degrees Feb 23, 2022

Choose a reason for hiding this comment

vanitabarrett Feb 23, 2022 • edited Loading

Choose a reason for hiding this comment

vanitabarrett Feb 23, 2022

Choose a reason for hiding this comment

36degrees Feb 23, 2022

Choose a reason for hiding this comment

domoscargin left a comment

Choose a reason for hiding this comment

vanitabarrett commented Feb 23, 2022

vanitabarrett commented Feb 24, 2022

EoinShaughnessy left a comment

Choose a reason for hiding this comment

vanitabarrett commented Feb 18, 2022 •

edited

Loading

vanitabarrett Feb 23, 2022 •

edited

Loading