Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
bugtool: Dump envoy metrics for troubleshooting
[ upstream commit 0307add ] Users might not have prometheus metrics endpoint enabled as part of existing Cilium installation. This commit is to add the capability to dump envoy metrics without the need of re-installation with additional helm flag, or updating existing cilium config map. One common use case is to check if there is any connectivity issue (e.g. 503, timeout, etc) for egress traffic. For example, the below metrics are part of the dump, these two metrics clearly signal some configuration issues with TLS egress. ```bash envoy_cluster_upstream_rq{envoy_response_code="503",envoy_cluster_name="egress-cluster-tls"} 100 envoy_cluster_upstream_cx_connect_fail{envoy_cluster_name="egress-cluster-tls"} 300 ``` Testing was done locally by running curl command in pod manually ```bash $ kubectl exec -n kube-system ds/cilium -- curl --unix-socket /var/run/cilium/envoy-admin.sock http:/admin/stats/prometheus > metrics_dump.txt $ cat metrics_dump.txt | wc -l 2753 ``` Signed-off-by: Tam Mach <[email protected]> Signed-off-by: Paul Chaignon <[email protected]>
- Loading branch information