-
Notifications
You must be signed in to change notification settings - Fork 364
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
e2e: Add backend health check e2e case via active http #3677
Merged
arkodg
merged 12 commits into
envoyproxy:main
from
aoledk:backend-http-active-health-check-e2e
Jul 3, 2024
Merged
Changes from 3 commits
Commits
Show all changes
12 commits
Select commit
Hold shift + click to select a range
3509d8b
Add backend health check e2e case via active http
aoledk 85079f7
Merge branch 'main' into backend-http-active-health-check-e2e
aoledk db1e92a
Use membership_healthy stats to check backend health check
aoledk c0f0c79
Merge branch 'main' into backend-http-active-health-check-e2e
aoledk 5486deb
Use non-zero cluster health check stats to verify backend health check
aoledk 62f8088
Merge branch 'main' into backend-http-active-health-check-e2e
aoledk 14e09dd
Merge branch 'main' into backend-http-active-health-check-e2e
aoledk d7e66aa
Merge branch 'main' into backend-http-active-health-check-e2e
aoledk 344a380
Merge branch 'main' into backend-http-active-health-check-e2e
aoledk 1fd4c33
Merge branch 'main' into backend-http-active-health-check-e2e
aoledk fb9b7a1
Merge branch 'main' into backend-http-active-health-check-e2e
zirain bf9cabf
Merge branch 'main' into backend-http-active-health-check-e2e
aoledk File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
110 changes: 110 additions & 0 deletions
110
test/e2e/testdata/backend-health-check-active-http.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,110 @@ | ||
apiVersion: gateway.envoyproxy.io/v1alpha1 | ||
kind: BackendTrafficPolicy | ||
metadata: | ||
name: health-check-active-http-pass-btp | ||
namespace: gateway-conformance-infra | ||
spec: | ||
targetRefs: | ||
- group: gateway.networking.k8s.io | ||
kind: HTTPRoute | ||
name: http-with-health-check-active-http-pass | ||
namespace: gateway-conformance-infra | ||
healthCheck: | ||
active: | ||
timeout: 3s | ||
interval: 5s | ||
unhealthyThreshold: 3 | ||
healthyThreshold: 1 | ||
type: HTTP | ||
http: | ||
path: "/status/200" | ||
expectedStatuses: | ||
- 200 | ||
--- | ||
apiVersion: gateway.envoyproxy.io/v1alpha1 | ||
kind: BackendTrafficPolicy | ||
metadata: | ||
name: health-check-active-http-fail-btp | ||
namespace: gateway-conformance-infra | ||
spec: | ||
targetRefs: | ||
- group: gateway.networking.k8s.io | ||
kind: HTTPRoute | ||
name: http-with-health-check-active-http-fail | ||
namespace: gateway-conformance-infra | ||
healthCheck: | ||
active: | ||
timeout: 3s | ||
interval: 5s | ||
unhealthyThreshold: 3 | ||
healthyThreshold: 1 | ||
type: HTTP | ||
http: | ||
path: "/status/418" | ||
expectedStatuses: | ||
- 200 | ||
--- | ||
apiVersion: gateway.networking.k8s.io/v1 | ||
kind: HTTPRoute | ||
metadata: | ||
name: http-with-health-check-active-http-pass | ||
namespace: gateway-conformance-infra | ||
spec: | ||
parentRefs: | ||
- name: same-namespace | ||
rules: | ||
- matches: | ||
- path: | ||
type: PathPrefix | ||
value: /health-check-active-http-pass | ||
backendRefs: | ||
- name: infra-backend-v1 | ||
port: 8080 | ||
--- | ||
apiVersion: gateway.networking.k8s.io/v1 | ||
kind: HTTPRoute | ||
metadata: | ||
name: http-with-health-check-active-http-fail | ||
namespace: gateway-conformance-infra | ||
spec: | ||
parentRefs: | ||
- name: same-namespace | ||
rules: | ||
- matches: | ||
- path: | ||
type: PathPrefix | ||
value: /health-check-active-http-fail | ||
backendRefs: | ||
- name: infra-backend-v1 | ||
port: 8080 | ||
--- | ||
# cluster panic threshold should be set to 0.0 to disable panic, to check whether health check works. | ||
apiVersion: gateway.envoyproxy.io/v1alpha1 | ||
kind: EnvoyPatchPolicy | ||
metadata: | ||
name: disable-cluster-panic-patch-policy | ||
namespace: gateway-conformance-infra | ||
spec: | ||
targetRef: | ||
group: gateway.networking.k8s.io | ||
kind: Gateway | ||
name: same-namespace | ||
namespace: gateway-conformance-infra | ||
type: JSONPatch | ||
jsonPatches: | ||
- type: "type.googleapis.com/envoy.config.cluster.v3.Cluster" | ||
name: "httproute/gateway-conformance-infra/http-with-health-check-active-http-pass/rule/0" | ||
operation: | ||
op: add | ||
path: "/common_lb_config" | ||
value: | ||
healthy_panic_threshold: | ||
value: 0.0 | ||
- type: "type.googleapis.com/envoy.config.cluster.v3.Cluster" | ||
name: "httproute/gateway-conformance-infra/http-with-health-check-active-http-fail/rule/0" | ||
operation: | ||
op: add | ||
path: "/common_lb_config" | ||
value: | ||
healthy_panic_threshold: | ||
value: 0.0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,142 @@ | ||
// Copyright Envoy Gateway Authors | ||
// SPDX-License-Identifier: Apache-2.0 | ||
// The full text of the Apache license is available in the LICENSE file at | ||
// the root of the repo. | ||
|
||
//go:build e2e | ||
// +build e2e | ||
|
||
package tests | ||
|
||
import ( | ||
"fmt" | ||
"testing" | ||
"time" | ||
|
||
"github.com/stretchr/testify/require" | ||
"k8s.io/apimachinery/pkg/types" | ||
gwapiv1 "sigs.k8s.io/gateway-api/apis/v1" | ||
gwapiv1a2 "sigs.k8s.io/gateway-api/apis/v1alpha2" | ||
"sigs.k8s.io/gateway-api/conformance/utils/http" | ||
"sigs.k8s.io/gateway-api/conformance/utils/kubernetes" | ||
"sigs.k8s.io/gateway-api/conformance/utils/suite" | ||
|
||
"github.com/envoyproxy/gateway/internal/gatewayapi" | ||
"github.com/envoyproxy/gateway/test/e2e/utils/prometheus" | ||
) | ||
|
||
func init() { | ||
ConformanceTests = append(ConformanceTests, BackendHealthCheckActiveHTTPTest) | ||
} | ||
|
||
var BackendHealthCheckActiveHTTPTest = suite.ConformanceTest{ | ||
ShortName: "BackendHealthCheckActiveHTTP", | ||
Description: "Resource with BackendHealthCheckActiveHTTP enabled", | ||
Manifests: []string{"testdata/backend-health-check-active-http.yaml"}, | ||
Test: func(t *testing.T, suite *suite.ConformanceTestSuite) { | ||
t.Run("active http", func(t *testing.T) { | ||
ns := "gateway-conformance-infra" | ||
passRouteNN := types.NamespacedName{Name: "http-with-health-check-active-http-pass", Namespace: ns} | ||
failRouteNN := types.NamespacedName{Name: "http-with-health-check-active-http-fail", Namespace: ns} | ||
gwNN := types.NamespacedName{Name: "same-namespace", Namespace: ns} | ||
gwAddr := kubernetes.GatewayAndHTTPRoutesMustBeAccepted(t, suite.Client, suite.TimeoutConfig, suite.ControllerName, kubernetes.NewGatewayRef(gwNN), passRouteNN, failRouteNN) | ||
|
||
ancestorRef := gwapiv1a2.ParentReference{ | ||
Group: gatewayapi.GroupPtr(gwapiv1.GroupName), | ||
Kind: gatewayapi.KindPtr(gatewayapi.KindGateway), | ||
Namespace: gatewayapi.NamespacePtr(gwNN.Namespace), | ||
Name: gwapiv1.ObjectName(gwNN.Name), | ||
} | ||
BackendTrafficPolicyMustBeAccepted(t, suite.Client, types.NamespacedName{Name: "health-check-active-http-pass-btp", Namespace: ns}, suite.ControllerName, ancestorRef) | ||
BackendTrafficPolicyMustBeAccepted(t, suite.Client, types.NamespacedName{Name: "health-check-active-http-fail-btp", Namespace: ns}, suite.ControllerName, ancestorRef) | ||
|
||
promAddr, err := prometheus.Address(suite.Client, | ||
types.NamespacedName{Name: "prometheus", Namespace: "monitoring"}, | ||
) | ||
require.NoError(t, err) | ||
|
||
passClusterName := fmt.Sprintf("httproute/%s/%s/rule/0", ns, passRouteNN.Name) | ||
failClusterName := fmt.Sprintf("httproute/%s/%s/rule/0", ns, failRouteNN.Name) | ||
gtwName := "same-namespace" | ||
|
||
// health check requests will be distributed to the cluster with configured path. | ||
// we can use membership_healthy stats to check whether health check works as expected. | ||
passPromQL := fmt.Sprintf(`envoy_cluster_membership_healthy{envoy_cluster_name="%s",gateway_envoyproxy_io_owning_gateway_name="%s"}`, passClusterName, gtwName) | ||
failPromQL := fmt.Sprintf(`envoy_cluster_membership_healthy{envoy_cluster_name="%s",gateway_envoyproxy_io_owning_gateway_name="%s"}`, failClusterName, gtwName) | ||
|
||
http.AwaitConvergence( | ||
t, | ||
suite.TimeoutConfig.RequiredConsecutiveSuccesses, | ||
suite.TimeoutConfig.MaxTimeToConsistency, | ||
func(_ time.Duration) bool { | ||
// check membership_healthy stats from Prometheus | ||
v, err := prometheus.QuerySum(promAddr, passPromQL) | ||
if err != nil { | ||
// wait until Prometheus sync stats | ||
return false | ||
} | ||
t.Logf("cluster pass health check: membership_healthy stats query count: %v", v) | ||
|
||
if v == 0 { | ||
t.Error("healthy membership is not the same as expected") | ||
} else { | ||
t.Log("healthy membership is the same as expected") | ||
} | ||
|
||
return true | ||
}, | ||
) | ||
|
||
http.AwaitConvergence( | ||
t, | ||
suite.TimeoutConfig.RequiredConsecutiveSuccesses, | ||
suite.TimeoutConfig.MaxTimeToConsistency, | ||
func(_ time.Duration) bool { | ||
// check membership_healthy stats from Prometheus | ||
v, err := prometheus.QuerySum(promAddr, failPromQL) | ||
if err != nil { | ||
// wait until Prometheus sync stats | ||
return false | ||
} | ||
t.Logf("cluster fail health check: membership_healthy stats query count: %v", v) | ||
|
||
if v == 0 { | ||
t.Log("healthy membership is same as expected") | ||
} else { | ||
t.Error("healthy membership is not same as expected") | ||
} | ||
|
||
return true | ||
}, | ||
) | ||
|
||
t.Run("health check pass", func(t *testing.T) { | ||
expectedResponse := http.ExpectedResponse{ | ||
Request: http.Request{ | ||
Path: "/health-check-active-http-pass", | ||
}, | ||
Response: http.Response{ | ||
StatusCode: 200, | ||
}, | ||
Namespace: ns, | ||
} | ||
|
||
http.MakeRequestAndExpectEventuallyConsistentResponse(t, suite.RoundTripper, suite.TimeoutConfig, gwAddr, expectedResponse) | ||
}) | ||
|
||
t.Run("health check fail", func(t *testing.T) { | ||
expectedResponse := http.ExpectedResponse{ | ||
Request: http.Request{ | ||
Path: "/health-check-active-http-fail", | ||
}, | ||
Response: http.Response{ | ||
StatusCode: 503, | ||
}, | ||
Namespace: ns, | ||
} | ||
|
||
http.MakeRequestAndExpectEventuallyConsistentResponse(t, suite.RoundTripper, suite.TimeoutConfig, gwAddr, expectedResponse) | ||
}) | ||
}) | ||
}, | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
shouldn't this be non zero ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you prob need to wait until non zero is reached
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've turned to use non-zero cluster health check stats to verify whether backend health check works, for both pass and fail cases. And undo changes in
e2e/utils/prometheus.sum
to restore the behavior of only returnnil
when non-zero value is queried.