-
Notifications
You must be signed in to change notification settings - Fork 146
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Flaky Test]: TestLongRunningAgentForLeaks/TestHandleLeak – Metricbeat input status reporting makes Windows agent permanently degraded #5300
Comments
Pinging @elastic/elastic-agent (Team:Elastic-Agent) |
This is the regression test for elastic/beats#37142 so we need to get this working again as soon as we can. As long as it is failing this way, the test isn't testing what it is supposed to so I'm going to skip it to give us time to sort that out. |
I am going to relax the check in the test to allow the degraded state while we figure out how to handle it. #5301 We do not want all Windows Metricbeat instances to report degraded by default. There is nothing unique about this test (except perhaps the increased metrics collection interval). |
This has been mitigated by #5301 so the test can still detect memory leaks but the underlying problems causing the degraded state still remain. There appear to be two separate errors happening: units:
input-beat/metrics-monitoring-metrics-monitoring-beats:
message: 'Error fetching data for metricset beat.stats: error making http request: Get "http://unix/stats": dial unix /opt/Elastic/Agent/data/tmp/iThI_df0cBKC6YUNGGlKscMkOfz3FBH3.sock: connect: no such file or directory'
payload:
streams:
metrics-monitoring-filebeat:
error: ""
status: HEALTHY
metrics-monitoring-metricbeat:
error: 'Error fetching data for metricset beat.stats: error making http request: Get "http://unix/stats": dial unix /opt/Elastic/Agent/data/tmp/iThI_df0cBKC6YUNGGlKscMkOfz3FBH3.sock: connect: no such file or directory'
status: DEGRADED
state: 3 and input-system/metrics-default-system/metrics-system-5f5e65eb-2fd6-41e1-8c29-f24d57e66509:
message: |-
Error fetching data for metricset system.process_summary: Not enough privileges to fetch information: Not enough privileges to fetch information: GetInfoForPid: could not get all information for PID 0: error fetching name: OpenProcess failed for pid=0: The parameter is incorrect.
error fetching status: OpenProcess failed for pid=0: The parameter is incorrect.
GetInfoForPid: could not get all information for PID 4: error fetching name: GetProcessImageFileName failed for pid=4: GetProcessImageFileName failed: invalid argument
payload:
streams:
system/metrics-system.process-5f5e65eb-2fd6-41e1-8c29-f24d57e66509:
error: |-
Error fetching data for metricset system.process: Not enough privileges to fetch information: Not enough privileges to fetch information: GetInfoForPid: could not get all information for PID 0: error fetching name: OpenProcess failed for pid=0: The parameter is incorrect.
error fetching status: OpenProcess failed for pid=0: The parameter is incorrect.
GetInfoForPid: could not get all information for PID 4: error fetching name: GetProcessImageFileName failed for pid=4: GetProcessImageFileName failed: invalid argument
non fatal error fetching PID some info for 116, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
non fatal error fetching PID some info for 360, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
non fatal error fetching PID some info for 472, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
non fatal error fetching PID some info for 556, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
non fatal error fetching PID some info for 564, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
non fatal error fetching PID some info for 696, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
non fatal error fetching PID some info for 4304, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
non fatal error fetching PID some info for 3108, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
non fatal error fetching PID some info for 2116, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
non fatal error fetching PID some info for 1716, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
non fatal error fetching PID some info for 4856, metrics are valid, but partial: FillMetricsRequiringMoreAccess: error fetching process args: Not enough privileges to fetch information: OpenProcess failed: Access is denied.
status: DEGRADED
system/metrics-system.process.summary-5f5e65eb-2fd6-41e1-8c29-f24d57e66509:
error: |-
Error fetching data for metricset system.process_summary: Not enough privileges to fetch information: Not enough privileges to fetch information: GetInfoForPid: could not get all information for PID 0: error fetching name: OpenProcess failed for pid=0: The parameter is incorrect.
error fetching status: OpenProcess failed for pid=0: The parameter is incorrect.
GetInfoForPid: could not get all information for PID 4: error fetching name: GetProcessImageFileName failed for pid=4: GetProcessImageFileName failed: invalid argument
status: DEGRADED |
A way to permanently mitigate the second error is elastic/beats#40542 which would let us revert #5301 |
@ycombinator @pierrehilbert this is fixed via elastic/beats#40565. Passing extended runtime leak tests. I'm closing this as of now. |
Failing test case
TestLongRunningAgentForLeaks/TestHandleLeak
Error message
agent isn't healthy, current status: DEGRADED
Build
OS
Linux, Windows
Stacktrace and notes
The text was updated successfully, but these errors were encountered: