Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automate Graceful Recovery NFR #1832

Merged
merged 36 commits into from
May 3, 2024
Merged

Conversation

bjee19
Copy link
Contributor

@bjee19 bjee19 commented Apr 17, 2024

Problem: We want to automate the Graceful Recovery NFR.

Solution: Extend existing automation to cover the following test cases in the Graceful Recovery NFR: "restart nginx-gateway container" and "restart NGINX container".

Testing: Test correctly matches results of manual run of graceful-recovery NFR test.

Closes #1365

Output of test w/o Skip:

Will run 2 of 12 specs
------------------------------
[BeforeSuite] 
/home/username/nginx-gateway-fabric/tests/suite/system_suite_test.go:238
[BeforeSuite] PASSED [73.895 seconds]
------------------------------
SSSSSSSSSS
------------------------------
Graceful Recovery test recovers when NGF container is restarted [nfr, graceful-recovery]
/home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:70
• [45.174 seconds]
------------------------------
Graceful Recovery test recovers when nginx container is restarted [nfr, graceful-recovery]
/home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:74
  [FAILED] in [It] - /home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:233 @ 04/30/24 18:39:04.063
• [FAILED] [3.790 seconds]
Graceful Recovery test [It] recovers when nginx container is restarted [nfr, graceful-recovery]
/home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:74

  [FAILED] /docker-entrypoint.sh: /docker-entrypoint.d/ is not empty, will attempt to perform configuration
  /docker-entrypoint.sh: Looking for shell scripts in /docker-entrypoint.d/
  /docker-entrypoint.sh: Launching /docker-entrypoint.d/10-listen-on-ipv6-by-default.sh
  10-listen-on-ipv6-by-default.sh: info: /etc/nginx/conf.d/default.conf is not a file or does not exist
  /docker-entrypoint.sh: Sourcing /docker-entrypoint.d/15-local-resolvers.envsh
  /docker-entrypoint.sh: Launching /docker-entrypoint.d/20-envsubst-on-templates.sh
  /docker-entrypoint.sh: Launching /docker-entrypoint.d/30-tune-worker-processes.sh
  /docker-entrypoint.sh: Configuration complete; ready for start up
  2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/run/nginx/nginx-config-version.sock failed (98: Address in use)
  2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/lib/nginx/nginx-502-server.sock failed (98: Address in use)
  2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/lib/nginx/nginx-500-server.sock failed (98: Address in use)
  2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/run/nginx/nginx-status.sock failed (98: Address in use)
  2024/04/30 18:39:03 [notice] 92#92: try again to bind() after 500ms
  2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/run/nginx/nginx-config-version.sock failed (98: Address in use)
  2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/lib/nginx/nginx-502-server.sock failed (98: Address in use)
  2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/lib/nginx/nginx-500-server.sock failed (98: Address in use)
  2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/run/nginx/nginx-status.sock failed (98: Address in use)
  2024/04/30 18:39:03 [notice] 92#92: try again to bind() after 500ms

  Expected
      <string>: /docker-entrypoint.sh: /docker-entrypoint.d/ is not empty, will attempt to perform configuration
      /docker-entrypoint.sh: Looking for shell scripts in /docker-entrypoint.d/
      /docker-entrypoint.sh: Launching /docker-entrypoint.d/10-listen-on-ipv6-by-default.sh
      10-listen-on-ipv6-by-default.sh: info: /etc/nginx/conf.d/default.conf is not a file or does not exist
      /docker-entrypoint.sh: Sourcing /docker-entrypoint.d/15-local-resolvers.envsh
      /docker-entrypoint.sh: Launching /docker-entrypoint.d/20-envsubst-on-templates.sh
      /docker-entrypoint.sh: Launching /docker-entrypoint.d/30-tune-worker-processes.sh
      /docker-entrypoint.sh: Configuration complete; ready for start up
      2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/run/nginx/nginx-config-version.sock failed (98: Address in use)
      2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/lib/nginx/nginx-502-server.sock failed (98: Address in use)
      2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/lib/nginx/nginx-500-server.sock failed (98: Address in use)
      2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/run/nginx/nginx-status.sock failed (98: Address in use)
      2024/04/30 18:39:03 [notice] 92#92: try again to bind() after 500ms
      2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/run/nginx/nginx-config-version.sock failed (98: Address in use)
      2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/lib/nginx/nginx-502-server.sock failed (98: Address in use)
      2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/lib/nginx/nginx-500-server.sock failed (98: Address in use)
      2024/04/30 18:39:03 [emerg] 92#92: bind() to unix:/var/run/nginx/nginx-status.sock failed (98: Address in use)
      2024/04/30 18:39:03 [notice] 92#92: try again to bind() after 500ms
      
  not to contain substring
      <string>: emerg
  In [It] at: /home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:233 @ 04/30/24 18:39:04.063

  Full Stack Trace
    github.com/nginxinc/nginx-gateway-fabric/tests/suite.checkContainerLogsForErrors({0xc0005d6090, 0x2e})
        /home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:233 +0x1ba
    github.com/nginxinc/nginx-gateway-fabric/tests/suite.runRecoveryTest({0x1bc67de, 0x1c}, {0x1bc9929, 0x1e}, {0xc0005d6090, 0x2e}, {0x1ba931f, 0x5}, {0xc000228800, 0x4, ...}, ...)
        /home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:92 +0x112
    github.com/nginxinc/nginx-gateway-fabric/tests/suite.init.func3.5()
        /home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:75 +0x59
------------------------------
[AfterSuite] 
/home/username/nginx-gateway-fabric/tests/suite/system_suite_test.go:264
[AfterSuite] PASSED [56.574 seconds]
------------------------------

Summarizing 1 Failure:
  [FAIL] Graceful Recovery test [It] recovers when nginx container is restarted [nfr, graceful-recovery]
  /home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:233

Ran 2 of 12 Specs in 179.434 seconds
FAIL! -- 1 Passed | 1 Failed | 0 Pending | 10 Skipped
--- FAIL: TestNGF (179.44s)
FAIL

Output of test with Skip:

• [26.809 seconds]
------------------------------
Graceful Recovery test recovers when nginx container is restarted [nfr, graceful-recovery]
/home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:81
  [SKIPPED] in [It] - /home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:84 @ 05/01/24 15:59:36.41
S [SKIPPED] [0.872 seconds]
Graceful Recovery test [It] recovers when nginx container is restarted [nfr, graceful-recovery]
/home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:81

  [SKIPPED] Test currently fails due to this issue: https://github.com/nginxinc/nginx-gateway-fabric/issues/1108
  In [It] at: /home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:84 @ 05/01/24 15:59:36.41

  Full Stack Trace
    github.com/nginxinc/nginx-gateway-fabric/tests/suite.init.func3.5()
        /home/username/nginx-gateway-fabric/tests/suite/graceful_recovery_test.go:84 +0x92
------------------------------
SS
------------------------------
[AfterSuite] 
/home/username/nginx-gateway-fabric/tests/suite/system_suite_test.go:264
[AfterSuite] PASSED [63.160 seconds]
------------------------------

Ran 1 of 12 Specs in 163.326 seconds
SUCCESS! -- 1 Passed | 0 Failed | 0 Pending | 11 Skipped
PASS

Checklist

Before creating a PR, run through this checklist and mark each as complete.

  • I have read the CONTRIBUTING doc
  • I have added tests that prove my fix is effective or that my feature works
  • I have checked that all unit tests pass after adding my changes
  • I have updated necessary documentation
  • I have rebased my branch onto main
  • I will ensure my PR is targeting the main branch and pulling from my branch from my own fork

Release notes

If this PR introduces a change that affects users and needs to be mentioned in the release notes,
please add a brief note that summarizes the change.

NONE

@github-actions github-actions bot added tests Pull requests that update tests helm-chart Relates to helm chart labels Apr 17, 2024
tests/framework/request.go Fixed Show resolved Hide resolved
deploy/helm-chart/templates/deployment.yaml Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
@bjee19 bjee19 marked this pull request as ready for review April 25, 2024 21:36
@bjee19 bjee19 requested a review from a team as a code owner April 25, 2024 21:36
@bjee19 bjee19 requested review from pleshakov and sjberman April 25, 2024 22:06
Copy link
Collaborator

@sjberman sjberman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Be sure to move all existing results for this test to the results/ directory, and you can delete the graceful-recovery/ directory.

tests/framework/request.go Fixed Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Show resolved Hide resolved
@bjee19 bjee19 force-pushed the tests/automate-graceful branch from 9bf650c to 92d4ebe Compare April 29, 2024 20:27
tests/framework/request.go Dismissed Show dismissed Hide dismissed
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Show resolved Hide resolved
@bjee19
Copy link
Contributor Author

bjee19 commented May 1, 2024

Be sure to move all existing results for this test to the results/ directory, and you can delete the graceful-recovery/ directory.

Should I put this on hold until #1910 and #1901 go through?

@sjberman
Copy link
Collaborator

sjberman commented May 1, 2024

@bjee19

Should I put this on hold until #1910 and #1901 go through?

Yep

tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Outdated Show resolved Hide resolved
tests/suite/graceful_recovery_test.go Show resolved Hide resolved
@bjee19 bjee19 requested a review from pleshakov May 1, 2024 21:41
@bjee19 bjee19 requested a review from pleshakov May 2, 2024 22:17
@bjee19 bjee19 requested a review from sjberman May 3, 2024 15:37
@bjee19 bjee19 force-pushed the tests/automate-graceful branch from 478f5b1 to e326e96 Compare May 3, 2024 17:49
@bjee19 bjee19 merged commit fdde6ba into nginx:main May 3, 2024
40 checks passed
@bjee19 bjee19 deleted the tests/automate-graceful branch May 3, 2024 18:05
miledxz added a commit to miledxz/nginx-gateway-fabric that referenced this pull request Jan 14, 2025
Problem: We want to automate the Graceful Recovery NFR.

Solution: Extend existing automation to cover the following test cases in the Graceful Recovery NFR: "restart nginx-gateway container" and "restart NGINX container".

Testing: Test correctly matches results of manual run of graceful-recovery NFR test.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
tests Pull requests that update tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Automate Graceful Recovery from Restarts Test
3 participants