Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor performance tests artifacts handling #48684

Merged
merged 32 commits into from
Mar 28, 2023

Conversation

WunderBart
Copy link
Member

@WunderBart WunderBart commented Mar 2, 2023

What?

TL;DR:

Save all the performance result files to the main artifacts folder

Instead of storing files around the source code, let's use the already available WP_ARTIFACTS_PATH directory.

The process.env.WP_ARTIFACTS_PATH has been primarily used for storing the failed test artifacts (screenshots, HTML snapshots), but we can also use it for storing any other artifacts, like performance results or temporary trace files. Those files are currently stored alongside the source code and then copied to an intermediate ./__test-results/ folder, which is then picked up by the results upload CI action. After this PR, those files will be saved directly to the WP_ARTIFACTS_PATH folder, so we won't need to:

  • guess the paths,
  • copy results saved alongside the relative test file, and
  • create any extra folders to handle the artifacts upload.

Once created, they're ready to be picked up by the upload action because the CI will define the WP_ARTIFACTS_PATH variable.

Additionally, the final (calculated) results file that we currently save to the enigmatic path (before uploading them to codehealth), will now be saved to the WP_ARTIFACTS_PATH as well and zipped with the other (raw) results.

Finally, if the CI job fails, the results upload step will not occur. However, if there are any results that have already been created, they will still be uploaded along with the failure artifacts.

Note
The performance runner script spawns the branch jobs in separate child processes, so they don't have access to the main process.env object provided by the CI. To solve that, I'm passing the relevant env variables to the child processes via the runShellScript util.

Archive performance results for each comparison step

Currently, we are archiving performance results only for the Compare performance with trunk step. Now, the results will be uploaded for every comparison step as they provide useful information:

  • Compare performance with trunk
  • Compare performance with current WordPress Core and previous Gutenberg versions
  • Compare performance with base branch
  • Compare performance with custom branches

In case of a failure, the results upload action will not be dispatched. If any result files are written during a failed job, though, they will be uploaded within the failure artifacts upload, along with the screenshot and HTML snapshot of the failed test.

Sort results and test logs primarily by the suite name

This aligns with the test running order (test suite -> branch). See the comparison below of the current vs. new results upload content.

performance results upload (current) performance results upload (new)
Screenshot 2023-03-10 at 11 44 42 Screenshot 2023-03-10 at 11 45 03

I've updated the test log to reflect the above and added the test rounds info that was missing:

runner info (current) runner info (new)
Screenshot 2023-03-10 at 11 47 34 Screenshot 2023-03-10 at 11 47 55

Note
While working on this one, I've found out that the release performance tests have been failing for the past three months (#48920)

Write the env-specific configs directly

For the performance tests, we're creating "environment" folders for each branch we want to test against. Those folders need to contain env-specific wp-env config, which we're currently copying from a boilerplate config that we copy over and amend with the target env config values. There's no point keeping this boilerplate and doing the extra copy step, because we can write this file directly. Also, this way we can provide absolute paths which makes it more readable.

Testing Instructions

CI:

  • Tests should pass,
  • Results should be uploaded as described above (3 files for each test - branch A raw results, branch B raw results & computed results),
  • Result filenames should begin with the test name,
  • Logs for the comparison* jobs should be updated as described above.

Locally:

  • Checkout this branch
  • Make sure your local wp-env is stopped,
  • Run bin/plugin/cli.js perf refactor/perf-test-results-path trunk,
  • (Go for lunch or sth - it will take a while),
  • The job should pass,
  • All results should be saved to the ./artifacts folder.
  • The same should happen if you run tests individually with e.g. npm run test:performance test-editor

To test if the failure artifacts are still uploaded as expected:

  • Make any of the perf specs fail by adding expect(false).toBe(true),
  • Run bin/plugin/cli.js perf refactor/perf-test-results-path trunk again,
  • Once the job fail, the failure artifacts (screenshot + HTML snapshot) should be uploaded to ./artifacts as well.

@WunderBart WunderBart self-assigned this Mar 2, 2023
@WunderBart WunderBart marked this pull request as draft March 2, 2023 12:59
@github-actions
Copy link

github-actions bot commented Mar 2, 2023

Size Change: 0 B

Total Size: 1.34 MB

ℹ️ View Unchanged
Filename Size
build/a11y/index.min.js 993 B
build/annotations/index.min.js 2.78 kB
build/api-fetch/index.min.js 2.27 kB
build/autop/index.min.js 2.15 kB
build/blob/index.min.js 483 B
build/block-directory/index.min.js 7.2 kB
build/block-directory/style-rtl.css 1.04 kB
build/block-directory/style.css 1.04 kB
build/block-editor/content-rtl.css 4.11 kB
build/block-editor/content.css 4.1 kB
build/block-editor/default-editor-styles-rtl.css 403 B
build/block-editor/default-editor-styles.css 403 B
build/block-editor/index.min.js 197 kB
build/block-editor/style-rtl.css 14.4 kB
build/block-editor/style.css 14.4 kB
build/block-library/blocks/archives/editor-rtl.css 61 B
build/block-library/blocks/archives/editor.css 60 B
build/block-library/blocks/archives/style-rtl.css 90 B
build/block-library/blocks/archives/style.css 90 B
build/block-library/blocks/audio/editor-rtl.css 150 B
build/block-library/blocks/audio/editor.css 150 B
build/block-library/blocks/audio/style-rtl.css 122 B
build/block-library/blocks/audio/style.css 122 B
build/block-library/blocks/audio/theme-rtl.css 138 B
build/block-library/blocks/audio/theme.css 138 B
build/block-library/blocks/avatar/editor-rtl.css 116 B
build/block-library/blocks/avatar/editor.css 116 B
build/block-library/blocks/avatar/style-rtl.css 91 B
build/block-library/blocks/avatar/style.css 91 B
build/block-library/blocks/block/editor-rtl.css 305 B
build/block-library/blocks/block/editor.css 305 B
build/block-library/blocks/button/editor-rtl.css 587 B
build/block-library/blocks/button/editor.css 587 B
build/block-library/blocks/button/style-rtl.css 628 B
build/block-library/blocks/button/style.css 627 B
build/block-library/blocks/buttons/editor-rtl.css 337 B
build/block-library/blocks/buttons/editor.css 337 B
build/block-library/blocks/buttons/style-rtl.css 332 B
build/block-library/blocks/buttons/style.css 332 B
build/block-library/blocks/calendar/style-rtl.css 239 B
build/block-library/blocks/calendar/style.css 239 B
build/block-library/blocks/categories/editor-rtl.css 84 B
build/block-library/blocks/categories/editor.css 83 B
build/block-library/blocks/categories/style-rtl.css 100 B
build/block-library/blocks/categories/style.css 100 B
build/block-library/blocks/code/editor-rtl.css 53 B
build/block-library/blocks/code/editor.css 53 B
build/block-library/blocks/code/style-rtl.css 121 B
build/block-library/blocks/code/style.css 121 B
build/block-library/blocks/code/theme-rtl.css 124 B
build/block-library/blocks/code/theme.css 124 B
build/block-library/blocks/columns/editor-rtl.css 108 B
build/block-library/blocks/columns/editor.css 108 B
build/block-library/blocks/columns/style-rtl.css 406 B
build/block-library/blocks/columns/style.css 406 B
build/block-library/blocks/comment-author-avatar/editor-rtl.css 125 B
build/block-library/blocks/comment-author-avatar/editor.css 125 B
build/block-library/blocks/comment-content/style-rtl.css 92 B
build/block-library/blocks/comment-content/style.css 92 B
build/block-library/blocks/comment-template/style-rtl.css 199 B
build/block-library/blocks/comment-template/style.css 198 B
build/block-library/blocks/comments-pagination-numbers/editor-rtl.css 123 B
build/block-library/blocks/comments-pagination-numbers/editor.css 121 B
build/block-library/blocks/comments-pagination/editor-rtl.css 222 B
build/block-library/blocks/comments-pagination/editor.css 209 B
build/block-library/blocks/comments-pagination/style-rtl.css 235 B
build/block-library/blocks/comments-pagination/style.css 231 B
build/block-library/blocks/comments-title/editor-rtl.css 75 B
build/block-library/blocks/comments-title/editor.css 75 B
build/block-library/blocks/comments/editor-rtl.css 840 B
build/block-library/blocks/comments/editor.css 839 B
build/block-library/blocks/comments/style-rtl.css 637 B
build/block-library/blocks/comments/style.css 636 B
build/block-library/blocks/cover/editor-rtl.css 612 B
build/block-library/blocks/cover/editor.css 613 B
build/block-library/blocks/cover/style-rtl.css 1.6 kB
build/block-library/blocks/cover/style.css 1.59 kB
build/block-library/blocks/embed/editor-rtl.css 293 B
build/block-library/blocks/embed/editor.css 293 B
build/block-library/blocks/embed/style-rtl.css 410 B
build/block-library/blocks/embed/style.css 410 B
build/block-library/blocks/embed/theme-rtl.css 138 B
build/block-library/blocks/embed/theme.css 138 B
build/block-library/blocks/file/editor-rtl.css 300 B
build/block-library/blocks/file/editor.css 300 B
build/block-library/blocks/file/style-rtl.css 265 B
build/block-library/blocks/file/style.css 265 B
build/block-library/blocks/file/view.min.js 353 B
build/block-library/blocks/freeform/editor-rtl.css 2.44 kB
build/block-library/blocks/freeform/editor.css 2.44 kB
build/block-library/blocks/gallery/editor-rtl.css 984 B
build/block-library/blocks/gallery/editor.css 988 B
build/block-library/blocks/gallery/style-rtl.css 1.55 kB
build/block-library/blocks/gallery/style.css 1.55 kB
build/block-library/blocks/gallery/theme-rtl.css 122 B
build/block-library/blocks/gallery/theme.css 122 B
build/block-library/blocks/group/editor-rtl.css 654 B
build/block-library/blocks/group/editor.css 654 B
build/block-library/blocks/group/style-rtl.css 57 B
build/block-library/blocks/group/style.css 57 B
build/block-library/blocks/group/theme-rtl.css 78 B
build/block-library/blocks/group/theme.css 78 B
build/block-library/blocks/heading/style-rtl.css 76 B
build/block-library/blocks/heading/style.css 76 B
build/block-library/blocks/html/editor-rtl.css 332 B
build/block-library/blocks/html/editor.css 333 B
build/block-library/blocks/image/editor-rtl.css 830 B
build/block-library/blocks/image/editor.css 829 B
build/block-library/blocks/image/style-rtl.css 652 B
build/block-library/blocks/image/style.css 652 B
build/block-library/blocks/image/theme-rtl.css 137 B
build/block-library/blocks/image/theme.css 137 B
build/block-library/blocks/latest-comments/style-rtl.css 357 B
build/block-library/blocks/latest-comments/style.css 357 B
build/block-library/blocks/latest-posts/editor-rtl.css 213 B
build/block-library/blocks/latest-posts/editor.css 212 B
build/block-library/blocks/latest-posts/style-rtl.css 478 B
build/block-library/blocks/latest-posts/style.css 478 B
build/block-library/blocks/list/style-rtl.css 88 B
build/block-library/blocks/list/style.css 88 B
build/block-library/blocks/media-text/editor-rtl.css 266 B
build/block-library/blocks/media-text/editor.css 263 B
build/block-library/blocks/media-text/style-rtl.css 507 B
build/block-library/blocks/media-text/style.css 505 B
build/block-library/blocks/more/editor-rtl.css 431 B
build/block-library/blocks/more/editor.css 431 B
build/block-library/blocks/navigation-link/editor-rtl.css 716 B
build/block-library/blocks/navigation-link/editor.css 715 B
build/block-library/blocks/navigation-link/style-rtl.css 115 B
build/block-library/blocks/navigation-link/style.css 115 B
build/block-library/blocks/navigation-submenu/editor-rtl.css 299 B
build/block-library/blocks/navigation-submenu/editor.css 299 B
build/block-library/blocks/navigation/editor-rtl.css 2.13 kB
build/block-library/blocks/navigation/editor.css 2.14 kB
build/block-library/blocks/navigation/style-rtl.css 2.22 kB
build/block-library/blocks/navigation/style.css 2.2 kB
build/block-library/blocks/navigation/view-modal.min.js 2.81 kB
build/block-library/blocks/navigation/view.min.js 447 B
build/block-library/blocks/nextpage/editor-rtl.css 395 B
build/block-library/blocks/nextpage/editor.css 395 B
build/block-library/blocks/page-list/editor-rtl.css 401 B
build/block-library/blocks/page-list/editor.css 401 B
build/block-library/blocks/page-list/style-rtl.css 175 B
build/block-library/blocks/page-list/style.css 175 B
build/block-library/blocks/paragraph/editor-rtl.css 174 B
build/block-library/blocks/paragraph/editor.css 174 B
build/block-library/blocks/paragraph/style-rtl.css 279 B
build/block-library/blocks/paragraph/style.css 281 B
build/block-library/blocks/post-author/style-rtl.css 175 B
build/block-library/blocks/post-author/style.css 176 B
build/block-library/blocks/post-comments-form/editor-rtl.css 96 B
build/block-library/blocks/post-comments-form/editor.css 96 B
build/block-library/blocks/post-comments-form/style-rtl.css 501 B
build/block-library/blocks/post-comments-form/style.css 501 B
build/block-library/blocks/post-date/style-rtl.css 61 B
build/block-library/blocks/post-date/style.css 61 B
build/block-library/blocks/post-excerpt/editor-rtl.css 71 B
build/block-library/blocks/post-excerpt/editor.css 71 B
build/block-library/blocks/post-excerpt/style-rtl.css 134 B
build/block-library/blocks/post-excerpt/style.css 134 B
build/block-library/blocks/post-featured-image/editor-rtl.css 586 B
build/block-library/blocks/post-featured-image/editor.css 584 B
build/block-library/blocks/post-featured-image/style-rtl.css 322 B
build/block-library/blocks/post-featured-image/style.css 322 B
build/block-library/blocks/post-navigation-link/style-rtl.css 153 B
build/block-library/blocks/post-navigation-link/style.css 153 B
build/block-library/blocks/post-template/editor-rtl.css 99 B
build/block-library/blocks/post-template/editor.css 98 B
build/block-library/blocks/post-template/style-rtl.css 282 B
build/block-library/blocks/post-template/style.css 282 B
build/block-library/blocks/post-terms/style-rtl.css 96 B
build/block-library/blocks/post-terms/style.css 96 B
build/block-library/blocks/post-title/style-rtl.css 100 B
build/block-library/blocks/post-title/style.css 100 B
build/block-library/blocks/preformatted/style-rtl.css 103 B
build/block-library/blocks/preformatted/style.css 103 B
build/block-library/blocks/pullquote/editor-rtl.css 135 B
build/block-library/blocks/pullquote/editor.css 135 B
build/block-library/blocks/pullquote/style-rtl.css 326 B
build/block-library/blocks/pullquote/style.css 325 B
build/block-library/blocks/pullquote/theme-rtl.css 167 B
build/block-library/blocks/pullquote/theme.css 167 B
build/block-library/blocks/query-pagination-numbers/editor-rtl.css 122 B
build/block-library/blocks/query-pagination-numbers/editor.css 121 B
build/block-library/blocks/query-pagination/editor-rtl.css 221 B
build/block-library/blocks/query-pagination/editor.css 211 B
build/block-library/blocks/query-pagination/style-rtl.css 288 B
build/block-library/blocks/query-pagination/style.css 284 B
build/block-library/blocks/query-title/style-rtl.css 63 B
build/block-library/blocks/query-title/style.css 63 B
build/block-library/blocks/query/editor-rtl.css 463 B
build/block-library/blocks/query/editor.css 463 B
build/block-library/blocks/quote/style-rtl.css 222 B
build/block-library/blocks/quote/style.css 222 B
build/block-library/blocks/quote/theme-rtl.css 223 B
build/block-library/blocks/quote/theme.css 226 B
build/block-library/blocks/read-more/style-rtl.css 132 B
build/block-library/blocks/read-more/style.css 132 B
build/block-library/blocks/rss/editor-rtl.css 149 B
build/block-library/blocks/rss/editor.css 149 B
build/block-library/blocks/rss/style-rtl.css 289 B
build/block-library/blocks/rss/style.css 288 B
build/block-library/blocks/search/editor-rtl.css 165 B
build/block-library/blocks/search/editor.css 165 B
build/block-library/blocks/search/style-rtl.css 409 B
build/block-library/blocks/search/style.css 406 B
build/block-library/blocks/search/theme-rtl.css 114 B
build/block-library/blocks/search/theme.css 114 B
build/block-library/blocks/separator/editor-rtl.css 146 B
build/block-library/blocks/separator/editor.css 146 B
build/block-library/blocks/separator/style-rtl.css 234 B
build/block-library/blocks/separator/style.css 234 B
build/block-library/blocks/separator/theme-rtl.css 194 B
build/block-library/blocks/separator/theme.css 194 B
build/block-library/blocks/shortcode/editor-rtl.css 474 B
build/block-library/blocks/shortcode/editor.css 474 B
build/block-library/blocks/site-logo/editor-rtl.css 489 B
build/block-library/blocks/site-logo/editor.css 489 B
build/block-library/blocks/site-logo/style-rtl.css 203 B
build/block-library/blocks/site-logo/style.css 203 B
build/block-library/blocks/site-tagline/editor-rtl.css 86 B
build/block-library/blocks/site-tagline/editor.css 86 B
build/block-library/blocks/site-title/editor-rtl.css 116 B
build/block-library/blocks/site-title/editor.css 116 B
build/block-library/blocks/site-title/style-rtl.css 57 B
build/block-library/blocks/site-title/style.css 57 B
build/block-library/blocks/social-link/editor-rtl.css 184 B
build/block-library/blocks/social-link/editor.css 184 B
build/block-library/blocks/social-links/editor-rtl.css 674 B
build/block-library/blocks/social-links/editor.css 673 B
build/block-library/blocks/social-links/style-rtl.css 1.4 kB
build/block-library/blocks/social-links/style.css 1.39 kB
build/block-library/blocks/spacer/editor-rtl.css 332 B
build/block-library/blocks/spacer/editor.css 332 B
build/block-library/blocks/spacer/style-rtl.css 48 B
build/block-library/blocks/spacer/style.css 48 B
build/block-library/blocks/table/editor-rtl.css 433 B
build/block-library/blocks/table/editor.css 433 B
build/block-library/blocks/table/style-rtl.css 651 B
build/block-library/blocks/table/style.css 650 B
build/block-library/blocks/table/theme-rtl.css 157 B
build/block-library/blocks/table/theme.css 157 B
build/block-library/blocks/tag-cloud/style-rtl.css 251 B
build/block-library/blocks/tag-cloud/style.css 253 B
build/block-library/blocks/template-part/editor-rtl.css 404 B
build/block-library/blocks/template-part/editor.css 404 B
build/block-library/blocks/template-part/theme-rtl.css 101 B
build/block-library/blocks/template-part/theme.css 101 B
build/block-library/blocks/text-columns/editor-rtl.css 95 B
build/block-library/blocks/text-columns/editor.css 95 B
build/block-library/blocks/text-columns/style-rtl.css 166 B
build/block-library/blocks/text-columns/style.css 166 B
build/block-library/blocks/verse/style-rtl.css 99 B
build/block-library/blocks/verse/style.css 99 B
build/block-library/blocks/video/editor-rtl.css 552 B
build/block-library/blocks/video/editor.css 555 B
build/block-library/blocks/video/style-rtl.css 179 B
build/block-library/blocks/video/style.css 179 B
build/block-library/blocks/video/theme-rtl.css 139 B
build/block-library/blocks/video/theme.css 139 B
build/block-library/classic-rtl.css 179 B
build/block-library/classic.css 179 B
build/block-library/common-rtl.css 1.11 kB
build/block-library/common.css 1.11 kB
build/block-library/editor-elements-rtl.css 75 B
build/block-library/editor-elements.css 75 B
build/block-library/editor-rtl.css 11.6 kB
build/block-library/editor.css 11.6 kB
build/block-library/elements-rtl.css 54 B
build/block-library/elements.css 54 B
build/block-library/index.min.js 201 kB
build/block-library/reset-rtl.css 478 B
build/block-library/reset.css 478 B
build/block-library/style-rtl.css 12.7 kB
build/block-library/style.css 12.7 kB
build/block-library/theme-rtl.css 698 B
build/block-library/theme.css 703 B
build/block-serialization-default-parser/index.min.js 1.13 kB
build/block-serialization-spec-parser/index.min.js 2.83 kB
build/blocks/index.min.js 51 kB
build/components/index.min.js 208 kB
build/components/style-rtl.css 11.7 kB
build/components/style.css 11.7 kB
build/compose/index.min.js 12.4 kB
build/core-data/index.min.js 16.3 kB
build/customize-widgets/index.min.js 12.2 kB
build/customize-widgets/style-rtl.css 1.41 kB
build/customize-widgets/style.css 1.41 kB
build/data-controls/index.min.js 663 B
build/data/index.min.js 8.58 kB
build/date/index.min.js 40.4 kB
build/deprecated/index.min.js 518 B
build/dom-ready/index.min.js 336 B
build/dom/index.min.js 4.72 kB
build/edit-post/classic-rtl.css 571 B
build/edit-post/classic.css 571 B
build/edit-post/index.min.js 34.8 kB
build/edit-post/style-rtl.css 7.55 kB
build/edit-post/style.css 7.54 kB
build/edit-site/index.min.js 65 kB
build/edit-site/style-rtl.css 10.2 kB
build/edit-site/style.css 10.2 kB
build/edit-widgets/index.min.js 17.3 kB
build/edit-widgets/style-rtl.css 4.56 kB
build/edit-widgets/style.css 4.56 kB
build/editor/index.min.js 45.8 kB
build/editor/style-rtl.css 3.54 kB
build/editor/style.css 3.53 kB
build/element/index.min.js 4.95 kB
build/escape-html/index.min.js 548 B
build/format-library/index.min.js 7.26 kB
build/format-library/style-rtl.css 557 B
build/format-library/style.css 556 B
build/hooks/index.min.js 1.66 kB
build/html-entities/index.min.js 454 B
build/i18n/index.min.js 3.79 kB
build/is-shallow-equal/index.min.js 535 B
build/keyboard-shortcuts/index.min.js 1.79 kB
build/keycodes/index.min.js 1.94 kB
build/list-reusable-blocks/index.min.js 2.14 kB
build/list-reusable-blocks/style-rtl.css 865 B
build/list-reusable-blocks/style.css 865 B
build/media-utils/index.min.js 2.99 kB
build/notices/index.min.js 977 B
build/plugins/index.min.js 1.95 kB
build/preferences-persistence/index.min.js 2.23 kB
build/preferences/index.min.js 1.35 kB
build/primitives/index.min.js 960 B
build/priority-queue/index.min.js 1.52 kB
build/private-apis/index.min.js 937 B
build/react-i18n/index.min.js 702 B
build/react-refresh-entry/index.min.js 8.44 kB
build/react-refresh-runtime/index.min.js 7.31 kB
build/redux-routine/index.min.js 2.75 kB
build/reusable-blocks/index.min.js 2.26 kB
build/reusable-blocks/style-rtl.css 265 B
build/reusable-blocks/style.css 265 B
build/rich-text/index.min.js 11 kB
build/server-side-render/index.min.js 2.09 kB
build/shortcode/index.min.js 1.52 kB
build/style-engine/index.min.js 1.53 kB
build/token-list/index.min.js 650 B
build/url/index.min.js 3.74 kB
build/vendors/inert-polyfill.min.js 2.48 kB
build/vendors/react-dom.min.js 41.8 kB
build/vendors/react.min.js 4.02 kB
build/viewport/index.min.js 1.09 kB
build/warning/index.min.js 280 B
build/widgets/index.min.js 7.3 kB
build/widgets/style-rtl.css 1.18 kB
build/widgets/style.css 1.18 kB
build/wordcount/index.min.js 1.06 kB

compressed-size-action

@github-actions
Copy link

github-actions bot commented Mar 2, 2023

Flaky tests detected in 7c86fa4.
Some tests passed with failed attempts. The failures may not be related to this commit but are still reported for visibility. See the documentation for more information.

🔍 Workflow run URL: https://github.com/WordPress/gutenberg/actions/runs/4415097223
📝 Reported issues:

@WunderBart WunderBart force-pushed the refactor/perf-test-results-path branch from 0c299b1 to d27f085 Compare March 2, 2023 15:12
@WunderBart WunderBart force-pushed the refactor/perf-test-results-path branch 2 times, most recently from 89698f2 to 0c1446a Compare March 6, 2023 14:00
Base automatically changed from fix/site-editor-performance-tests to trunk March 7, 2023 08:59
@WunderBart WunderBart force-pushed the refactor/perf-test-results-path branch 5 times, most recently from b4eb3c0 to 1d8219e Compare March 7, 2023 14:45
@WunderBart WunderBart changed the title Refactor performance test results path handling Refactor performance tests artifacts handling Mar 10, 2023
metricsPrefix: 'block-theme-',
},
{
file: 'front-end-classic-theme-performance-results.json',
file: 'front-end-classic-theme.results.json',
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is there a need to remove this? it was my assumption that we might end up with more test results in the future and this could help make it obvious which result belongs to which test.

it's fine to change it, and we can handle it later of course, but it has been helpful to me at least seeing all the information in the filename

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The idea here was that if CI zips it into performance-results folder, then it doesn't need a .performance-results.js suffix. On second thought, though, there's no zipping when running locally, and everything lands in the artifacts folder, so it would be better to keep that suffix AND add it to the raw result jsons as well. Will make an update.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

`packages/e2e-tests/specs/performance/${ testSuite }.test.results.json`
)
await runShellScript(
`npm run test:performance -- ${ testSuite } --wordpress-artifacts-path=${ artifactsPath } --results-filename=${ resultsFilename }`,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if we're relying on the ENV is it necessary to pass this here or could the performance test read the same ENV?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I thought it would, but it looks like only the top-level process (node cli.js perf) receives the ENV from CI. The await runShellScript( "npm run ... spawns a separate process that doesn't receive the same ENV so I needed to figure out a way to pass it down. Maybe there's a better way to share the env, but at my current level of understanding, this made the most sense.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we can manually pass through any ENV we want in runShellScript

/**
* Utility to run a child script
*
* @typedef {NodeJS.ProcessEnv} Env
*
* @param {string} script Script to run.
* @param {string=} cwd Working directory.
* @param {Env=} env Additional environment variables to pass to the script.
*/
function runShellScript( script, cwd, env = {} ) {

@youknowriad would it make sense to pass env: { ...process.env, …, ...env } as a default? instead of env: { …, ...env }? would that be dangerous?

Copy link
Member Author

@WunderBart WunderBart Mar 20, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we can manually pass through any ENV we want in runShellScript

Yeah I've just noticed that! 😄 Made already some changes in that regard in a53065b. I'm not passing the whole parent env though - just the relevant bits. I'm curious though if passing the whole thing would be safe.

@@ -225,6 +210,7 @@ async function runTestSuite( testSuite, performanceTestDirectory, runKey ) {
async function runPerformanceTests( branches, options ) {
const runningInCI = !! process.env.CI || !! options.ci;
const TEST_ROUNDS = options.rounds || 1;
const artifactsPath = process.env.WP_ARTIFACTS_PATH || '';
Copy link
Member

@dmsnell dmsnell Mar 10, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

to double-check, if this is empty, it will print to the current directory? or should we pass './' as the default? or something like __dirname?

Copy link
Member Author

@WunderBart WunderBart Mar 13, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If this is empty, it will be resolved down the stream (per job) when the puppeteer env is set up (when running npm run test:performance):

const root = process.env.GITHUB_WORKSPACE || process.cwd();
const ARTIFACTS_PATH = path.resolve(
root,
process.env.WP_ARTIFACTS_PATH || 'artifacts'
);

Note that the GITHUB_WORKSPACE var is not available here when running in CI (as mentioned here) so it will always resolve to the cwd. This was causing the artifacts to be saved along the env folders that we create during the perf tests setup and the necessity to copy them later on to the workspace's ./__test-results folder, so that they're available for the CI.

Copy link
Member Author

@WunderBart WunderBart Mar 20, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note that the GITHUB_WORKSPACE var is not available here when running in CI (...)

⚠️ To be clear, this is (currently) true only for the performance tests because they're run via child processes. For the E2E tests that run directly from the CI, the parent process env variables are accessible here. I've made some changes in a53065b to hopefully streamline this bit.

'/.wp-env.json'
// Create the config file for the current env.
fs.writeFileSync(
path.join( environmentDirectory, '.wp-env.json' ),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

similar question here about removing performance from the name. when I download these files they lose some inherent linkage to the performance CI job that created them.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It was the same before - we were copying the .wp-env.performance.json and renaming it to .wp-env.json, so in this regard nothing has changed. I assumed the default .wp-env.json name is used for the wp-env start script to pick it up without any extra arguments.

rawResults[ i ] = {};
for ( const branch of branches ) {
const runKey = `${ branch }_${ testSuite }_run-${ i }`;
const runKey = `${ testSuite }_${ branch }_run-${ i }`;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is there context behind this swap? not the most important thing, but it stands out in this PR where we're working on the path changes.

I guess there's a claim that grouping by test suite is more favorable than grouping by branch?

Copy link
Member Author

@WunderBart WunderBart Mar 13, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've swapped it because it's the actual order in which we iterate through those tests and how we compare the results (suite A: branch X vs. branch Y, suite B: branch X vs. branch Y, etc.). So both the log info and results filenames will now be compiled in this fashion:

runner info (current) runner info (new)
Screenshot 2023-03-10 at 11 47 34 Screenshot 2023-03-10 at 11 47 55
results upload (current) results upload (new)
Screenshot 2023-03-10 at 11 44 42 Screenshot 2023-03-10 at 11 45 03

Copy link
Member Author

@WunderBart WunderBart Mar 13, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As a side note, I think we should also replace trunk in the reference measurements with commit SHA, because trunk alone doesn't say much. Also, the summary should point to what is the reference and what is subject of the comparison, and display a 3rd column (delta) to show whether we're slowing down or speeding up, for example:

>> Comparing trunk (13a14ca) with the current branch (32b4c12)

┌──────────────────────┬─────────────────────────┬────────────────────────┬─────────────────┐
│       (index)        │    13a14ca (reference)  │    32b4c12 (subject)   │        Δ        │
├──────────────────────┼─────────────────────────┼────────────────────────┼─────────────────┤
│    serverResponse    │       '235.12 ms'       │      '233.46 ms'       │      -0.5%      │
│      firstPaint      │       '251.88 ms'       │      '186.88 ms'       │     +18.2%      │
│   domContentLoaded   │       '685.96 ms'       │      '651.18 ms'       │      +0.5%      │
│        loaded        │       '686.72 ms'       │      '652.04 ms'       │      x.xx%      │
│ firstContentfulPaint │       '8436.28 ms'      │       '8057 ms'        │      x.xx%      │
│      firstBlock      │       '9095.38 ms'      │      '8713.32 ms'      │      x.xx%      │
│         type         │        '42.98 ms'       │       '43.03 ms'       │      x.xx%      │
│       minType        │        '41.9 ms'        │       '41.92 ms'       │      x.xx%      │
│       maxType        │        '45.71 ms'       │       '46.52 ms'       │      x.xx%      │
│    typeContainer     │        '14.81 ms'       │       '14.99 ms'       │      x.xx%      │
│   minTypeContainer   │        '13.35 ms'       │       '13.69 ms'       │      x.xx%      │
│   maxTypeContainer   │        '18.87 ms'       │       '17.7 ms'        │      x.xx%      │
│        focus         │        '40.99 ms'       │       '44.78 ms'       │      x.xx%      │
│       minFocus       │        '35.64 ms'       │       '35.16 ms'       │      x.xx%      │
│       maxFocus       │        '64.35 ms'       │       '71.31 ms'       │      x.xx%      │
│     inserterOpen     │        '33.11 ms'       │       '32.39 ms'       │      x.xx%      │
│   minInserterOpen    │        '27.96 ms'       │       '27.73 ms'       │      x.xx%      │
│   maxInserterOpen    │        '52.56 ms'       │       '52.9 ms'        │      x.xx%      │
│    inserterSearch    │        '13.53 ms'       │       '12.29 ms'       │      x.xx%      │
│  minInserterSearch   │        '5.76 ms'        │       '6.11 ms'        │      x.xx%      │
│  maxInserterSearch   │        '19.31 ms'       │       '18.43 ms'       │      x.xx%      │
│    inserterHover     │        '26.31 ms'       │       '26.4 ms'        │      x.xx%      │
│   minInserterHover   │        '22.51 ms'       │       '22.38 ms'       │      x.xx%      │
│   maxInserterHover   │        '47.75 ms'       │       '46.44 ms'       │      x.xx%      │
│     listViewOpen     │       '157.32 ms'       │      '159.09 ms'       │      x.xx%      │
│   minListViewOpen    │       '142.89 ms'       │      '142.65 ms'       │      x.xx%      │
│   maxListViewOpen    │       '235.52 ms'       │      '253.54 ms'       │      x.xx%      │
└──────────────────────┴─────────────────────────┴────────────────────────┴─────────────────┘

What do you think?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should also replace trunk in the reference measurements with commit SHA

if we can display both that also has value. I think different needs use them differently. for most purposes trunk does a better job communicating than 13a14ca because we recognize that when a CI job runs it's running against the latest trunk at the time the CI job ran. we can always lookup what 13a14ca is, but that involves additional steps when trunk is pretty bare and simple.

what is the reference and what is subject of the comparison

for most runs this matters, but the script supports multiple "branches" beyond two, and in those cases I'm not sure one is the reference. if we include this we will be making a decision that one of the "branches" is the reference, likely the first one. (branches is in quotes because it's really just a git ref).

and display a 3rd column (delta) to show whether we're slowing down or speeding up, for example:

no strong opinion here on what we should do, but I can share my personal hesitancy with this due to the fact that we're still hitting some wild variation across tests (even with the recent fixes), and percentages always make changes look simpler than they are. I like making the easy-to-misuse things harder to use, whereas simply reporting ms is just that, a fact reported without any interpretation, because the interpretation is itself extremely difficult to automate.

@WunderBart WunderBart requested a review from dmsnell March 16, 2023 11:40
'test/emptytheme'
),
'https://downloads.wordpress.org/theme/twentytwentyone.1.7.zip',
'https://downloads.wordpress.org/theme/twentytwentythree.1.0.zip',
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are these things we want to hard-code here in this seemingly independent function? I could easily imagine someone wanting to update the themes and then overlooking these lines…

Copy link
Member Author

@WunderBart WunderBart Mar 21, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is only carried over from the .wp-env.performance.json boilerplate file. Hard-coding those here was a solution to an issue described in #49063. In my mind, it was an intermediate solution because those themes should be installed within the tests that use them so that when running tests in isolation (e.g. via npm run test:performance theme-tests), same themes are used as in CI or when we're running locally via the CLI. I would consider this a blocker - I plan to address this in a follow-up PR.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would consider this a blocker - I plan to address this in a follow-up PR.

if you meant "I would not consider this a blocker" then that sounds good. if not, then I'm confused 😄

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, good catch! Missed the important 'not' there 😅

@dmsnell
Copy link
Member

dmsnell commented Mar 20, 2023

Really love where you've taken this.

What do you see as continued blockers to get this in, or do you think it's ready to go?

@WunderBart
Copy link
Member Author

WunderBart commented Mar 21, 2023

What do you see as continued blockers to get this in, or do you think it's ready to go?

@dmsnell, I don't see any blockers at this point. In a follow-up PR which I'm already working on, I plan to make the builds reusable for subsequent jobs (at least locally) and parallelize them (builds) to make things faster.

Copy link
Member

@dmsnell dmsnell left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the work. I don't see any reason to block this.

@WunderBart WunderBart merged commit 2cf161f into trunk Mar 28, 2023
@WunderBart WunderBart deleted the refactor/perf-test-results-path branch March 28, 2023 14:00
@github-actions github-actions bot added this to the Gutenberg 15.5 milestone Mar 28, 2023
jeherve added a commit to Automattic/jetpack that referenced this pull request Apr 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
GitHub Actions Pull requests that update GitHub Actions code [Type] Performance Related to performance efforts
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants