Spurious failure in coverage report test #83262

RalfJung · 2021-03-18T15:07:19Z

In #83257 (comment), run-make-fulldeps\coverage-reports failed on x86_64-msvc-1 even though the PR just bumped the Miri submodule -- so this has to be a spurious failure, right?

@rust-lang/infra maybe you have a good idea for whom to ping about this.

RalfJung · 2021-03-18T15:08:43Z

Cc'ing some people who worked on that test: @richkadel @tmiasko

richkadel · 2021-03-18T15:42:48Z

The MIR pass produced from -Zinstrument-coverage did produce different results, on Windows.

--- expected_show_coverage.doctest.txt	2021-03-18 13:45:52.989271700 +0000
+++ /d/a/rust/rust/build/x86_64-pc-windows-msvc/test/run-make-fulldeps/coverage-reports/coverage-reports/actual_show_coverage.doctest.txt	2021-03-18 14:40:14.025645000 +0000
@@ -74,13 +74,13 @@
    70|      1|/// doctest_crate::fn_run_in_doctests(3);
    71|      1|/// ```
    72|       |///
-   73|      1|fn main() {
-   74|      1|    if true {
-   75|      1|        assert_eq!(1, 1);
+   73|       |fn main() {
+   74|       |    if true {
+   75|       |        assert_eq!(1, 1);
    76|       |    } else {
    77|       |        assert_eq!(1, 2);
    78|       |    }
-   79|      1|}
+   79|       |}

richkadel · 2021-03-18T15:45:32Z

Submitted that comment prematurely.

I'm putting more comments in #83257 because I think it's just as likely the MIRI changed the MIR that the coverage pass needs to generate coverage.

But it's not clear.

It's also from a doctest, which is something @Swatinem worked on in the last couple of months.

Maybe something changed affecting how doctest works.

RalfJung · 2021-03-18T15:47:07Z

The MIR pass produced from -Zinstrument-coverage did produce different results, on Windows.

Indeed, but my PR does not change rustc at all, so this cannot be caused by my PR.

richkadel · 2021-03-18T15:53:06Z

@tmiasko - The timing is very coincidental with your PR landing last night. Based on Ralf's last comment, I would expect other PRs to fail similarly. I don't know how your PR passed, or how this could be spurious, but it is suspect.

If we see more of these failures, you may need to look at reverting the PR. But I think we should make sure this is repeatable first.

@wesleywiser FYI

tmiasko · 2021-03-18T20:14:33Z

Timing indeed makes that PR suspect. That being said, I don't see how it could introduce any functional changes for builds without MIR inlining, like in this test case.

richkadel · 2021-03-18T20:33:08Z

Also, @RalfJung's PR succeeded on @bors retry.

Hard to understand how that could have happened, but for now let's assume it was a random and hopefully uncommon issue.

Mark-Simulacrum · 2021-05-13T23:22:25Z

@richkadel we've seen this several times since then, and it's likely that we'll want to address it either by removing the test or repeatedly running it if it fails, etc. -- do you have any thoughts on how it may have occurred?

The next step is likely trying to run builds in a loop locally, though unfortunately it looks like the issue is somehow tied to MSVC which will make many of the infra team members not have immediate access to machines on that platform I believe...

richkadel · 2021-05-13T23:33:35Z

Is it always the doc test?

Fyi, @Swatinem if it is.

Swatinem · 2021-05-14T08:53:48Z

Hm, this might be related to this comment here:

rust/src/test/run-make-fulldeps/coverage-reports/expected_show_coverage.doctest.txt

Lines 70 to 75 in 91f2e2d

    
           60|       |//! // this `main` is not shown as covered, as it clashes with all the other 
        
           61|       |//! // `main` functions that were automatically generated for doctests 
        
           62|       |//! fn main() -> Result<(), SomeError> { 
        
           63|       |//!     doctest_main() 
        
           64|       |//! } 
        
           65|       |//! ```

LLVM assumes functions are unique by name. But every doctest has its own main functions which clash. And it seems like they clash with the main function defined in the crate as well.

Not sure on what this depends. But the .profraw files have the pid in their name, so the ordering is pretty random. Might be a reason for these spurious failures.

rust/src/test/run-make-fulldeps/coverage-reports/Makefile

Lines 108 to 110 in 91f2e2d

	"$(LLVM_BIN_DIR)"/llvm-profdata merge --sparse \
	"$(TMPDIR)"/$@-*.profraw \
	-o "$(TMPDIR)"/[email protected]

rust/src/test/run-make-fulldeps/coverage-reports/Makefile

Lines 119 to 125 in 91f2e2d

    
           $(call BIN,"$(TMPDIR)"/$@) \ 
        
           $$( \ 
        
           	for file in $(TMPDIR)/rustdoc-$@/*/rust_out; \ 
        
           	do \ 
        
           	[ -x "$$file" ] && printf "%s %s " -object $$file; \ 
        
           	done \ 
        
           ) \

I think the action item here is:

rename the main function in the crate so it does not clash
maybe add an explicit coverage-ignore attribute to the main inside of the one doctest, so it does not accidentally have coverage.

Swatinem · 2021-05-14T09:02:07Z

Or, I think I named the function main specifically to make sure that it overrides any main defined in the doctests. (and because its a binary crate so it needs a main 🙄)
In that case it might be better to just name the .profraw of the main coverage run differently and explicitly order it as the first argument to the merge.

RalfJung · 2021-05-14T09:02:51Z

That sounds more like a bug in coverage collection/reporting then? Surely that should work properly also when users have multiple functions with the same name (to the extend that Rust allows that).

Swatinem · 2021-05-14T10:33:07Z

@RalfJung I think that is some kind of logic in llvm that uses only one functions data if multiple functions with the same mangled (C) name are present.

I have been playing around with ordering of CLI parameters and file naming, but wasn’t able to reproduce this.

My hunch is that maybe it is a pid reuse issue, in which case the earlier coverage file is overwritten. I have a PR that tries to reduce the chance of this.

Or maybe llvm is using some kind of timestamp to decide which record to keep and which to discard. I have too little insight to know.

richkadel · 2021-05-14T17:14:10Z

It looks like @Aaron1011 just hit this same spurious failure, in #85211.

I hadn't heard anything about this issue since @RalfJung first reported it on March 18. I don't know if the rate is accelerating, but I see there were other reports mentioned here, and agree we should do something.

@Swatinem - I don't think MSVC testing of the coverage of doctests adds a lot of additional value. It probably tests the variability in Makefile and shell scripting more than any variability in coverage of doc tests on Windows.

I suggest just disabling this test on MSVC, at least for now, until we someday migrate coverage tests off of Makefiles.

I am out of office today through next Monday (my daughter is graduating from University of Illinois!) and don't have much time to work (and I'm slow to respond, sorry).

@Swatinem - I don't know if you have time, but it may not be hard to add a directive to ignore MSVC for specific coverage tests. I think we have one or two similar directives in the Makefile.

richkadel · 2021-05-14T17:29:27Z

My hunch is that maybe it is a pid reuse issue, in which case the earlier coverage file is overwritten. I have a PR that tries to reduce the chance of this.

@Swatinem - I think my hunch is similar or the same. I suspect it might have something to do with the OS/shell environment in Windows for MSVC builds. PID reuse, or timing, or file lock, or something that we take for granted on a Unixy environment but doesn't work as consistently on Windows (or at least not how we might expect, coming from more of a Unix background).

That seems more likely than an LLVM bug, to me.

I think migrating off of Makefiles would most likely help (#85009) if our hunches are right.

FYI: @tmandry @wesleywiser

Avoid possible filename collision in coverage tests Previously, coverage tests were writing profiler data to files based on their pid. As rustdoc spawns each doctest as its own process, it might be possible in rare cases that a pid is being reused which would cause a file to be overwritten, leading to incorrect coverage results. should help with rust-lang#83262 r? `@tmandry`

Enselic · 2023-12-17T12:59:22Z

Triage: Has anyone of you seen this spurious failure in the last say two years? Or can we close this issue as obsolete?

RalfJung changed the title ~~Spurious failure in coverage report~~ Spurious failure in coverage report test Mar 18, 2021

jonas-schievink added A-code-coverage Area: Source-based code coverage (-Cinstrument-coverage) A-spurious Area: Spurious failures in builds (spuriously == for no apparent reason) labels Mar 18, 2021

ghost mentioned this issue Apr 26, 2021

Fix 'const-stable since' of reverse_bits #84543

Merged

Mark-Simulacrum mentioned this issue May 3, 2021

[beta] remove assert_matches #84759

Closed

the8472 mentioned this issue May 6, 2021

lazify backtrace formatting for delayed diagnostics #84965

Merged

Mark-Simulacrum mentioned this issue May 10, 2021

Fix tidy platform-specific code check #84677

Merged

ssomers mentioned this issue May 11, 2021

BTree: no longer copy keys and values before dropping them #84904

Merged

Swatinem mentioned this issue May 14, 2021

Avoid possible filename collision in coverage tests #85283

Merged

Aaron1011 mentioned this issue May 14, 2021

Preserve SyntaxContext for invalid/dummy spans in crate metadata #85211

Merged

jhpratt mentioned this issue May 29, 2021

Make Step trait safe to implement #83772

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spurious failure in coverage report test #83262

Spurious failure in coverage report test #83262

RalfJung commented Mar 18, 2021

RalfJung commented Mar 18, 2021

richkadel commented Mar 18, 2021

richkadel commented Mar 18, 2021

RalfJung commented Mar 18, 2021

richkadel commented Mar 18, 2021

tmiasko commented Mar 18, 2021

richkadel commented Mar 18, 2021

Mark-Simulacrum commented May 13, 2021

richkadel commented May 13, 2021

Swatinem commented May 14, 2021

Swatinem commented May 14, 2021

RalfJung commented May 14, 2021

Swatinem commented May 14, 2021

richkadel commented May 14, 2021

richkadel commented May 14, 2021

Enselic commented Dec 17, 2023

Spurious failure in coverage report test #83262

Spurious failure in coverage report test #83262

Comments

RalfJung commented Mar 18, 2021

RalfJung commented Mar 18, 2021

richkadel commented Mar 18, 2021

richkadel commented Mar 18, 2021

RalfJung commented Mar 18, 2021

richkadel commented Mar 18, 2021

tmiasko commented Mar 18, 2021

richkadel commented Mar 18, 2021

Mark-Simulacrum commented May 13, 2021

richkadel commented May 13, 2021

Swatinem commented May 14, 2021

Swatinem commented May 14, 2021

RalfJung commented May 14, 2021

Swatinem commented May 14, 2021

richkadel commented May 14, 2021

richkadel commented May 14, 2021

Enselic commented Dec 17, 2023