-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
System.Diagnostics.Tests.EventLogSourceCreationTests failing on PRs #36135
Comments
Tagging subscribers to this area: @ViktorHofer |
Presumably this was caused by #35911 ? |
Hmm, yeah looks suspicious. These tests intention was to not run them on CI. According to: 48a222c but given this is potential break, should we reconsider that? |
@danmosemsft |
Also, another point: 48a222c -- was in February 18th, and I don't think these tests that were left enabled have failed ever since until now. |
cc: @tarekgh |
I'll revert #35911. it is good we have a test catch such issues. |
Unfortunately that test is being made to not run in CI in #36138. How often are these tests run locally? |
could someone approve #36143 and merge to unblock? |
I have no idea about these tests. @Anipik @maryamariyan may know better. |
how often this test used to fail before 48a222c? |
Looking at the kusto data now and I’ll update the issue. |
It seems like it has had its hicups, some failures on 4/26, some on 2/13-2/16 and the ones today. Will keep looking at the data to make sure the revert fixed it, let's leave it enabled for now and see how it behaves. Will leave the issue open to track the data. |
Actually closing the issue and if we see it again we can reopen. |
I found an interesting pattern. It seems like there is a test that is corrupting a machine to get in a weird state and when this test runs in that particular machine after that it fails every time. So until that Machine is recycled to a new one if the test happens to run again on that machine it will fail. This test has had 4 bad days since February, and all days that we have had a failure it shows the same pattern, it always fails only on the same machine. cc: @Anipik |
I re-opened: #36138 |
@Anipik @safern I saw this failure in a couple one-off runs I'm doing: https://helix.dot.net/api/2019-06-17/jobs/9b2efc67-36d9-4a8e-9e98-0b969f156ce4/workitems/System.Diagnostics.EventLog.Tests/console Both look to be on DDARM64S-026 |
So @Anipik it seems like your fix didn’t mitigate the issue and the machine is somehow still getting busted and subsequent runs fail in that machine? |
@MattGal would it be possible to get hold of this machine ? |
It's possible but something we generally avoid. I'd prefer we have someone from DDFUN or the First Responders team check it out to ensure the Helix client isn't affected as part of investigation. You'll also need access to corpnet as the only way onto these machines is over a local KVM device. If you think you know what's wrong with the machine, we should probably just have DDFUN fix that. Otherwise ping me on Teams and we can coordinate something. |
@Anipik happened again: Will go ahead and revert the change that re-enabled the tests. |
I don't suppose anyone happened to capture the logged failure? All the links in this issue are now dead. |
Can confirm all the Helix-side stuff is long deleted. |
No failures in last 21 days. Should probably close. Can re-open if it happens again. |
No failures because the test is active-issued. :-/ |
I suspect this is file-in use on the log file, failure to delete, then future tests fail to recreate. These tests are writing to a custom log name which will result in a new file. I might see if I can refactor the tests a bit to avoid this (like using a unique log name). |
Console Log Summary
Builds
Configurations
Helix Logs
I will put up a PR to disable the test.
cc: @dotnet/runtime-infrastructure
The text was updated successfully, but these errors were encountered: