Stabilize unit tests with non-`()` return type #48854

nikomatsakis · 2018-03-08T22:54:55Z

CURRENT STATUS:

Awaiting someone to write the stabilization PR! Mentoring instructions here.

This is a sub-issue of the larger tracking issue devoted to the ? in main RFC. This issue corresponds to stabilizing the use of unit tests whose return type is something other than () -- it basically an extension of #48453, which was discussing the same thing but for the main function.

What would be stabilized

As before, unit tests that return ():

Succeed unless they panic
Can be annotated with #[should_panic]

However, unit tests can now have other return types:

The return type must implement the Termination trait
- Test and expected error
The test passes if the return value is considered a "success" (e.g., an Ok from Result)
- Test
If the type is not (), then #[should_panic] is disallowed
- Test and expected error

Unknowns

Question: what's up with #[should_panic] and #[bench] anyway? (Example)
Can we use this with rustdoc yet?

Older proposal

Possible changes needed before stabilizing

Adjust libtest runner to distinguish #[should_panic] from #[should_error], as suggested by @scottmcm?
Successful examples of using with rustdoc

What would be stabilized

Unit tests (and #[bench] functions) would join the main function in being allowed to return any value that implements the Termination trait.

This commits us to the following (each link is to a test):

tests that return Result and other Termination types. In the case of Result:
- Ok means test passes
- Err causes test failure
- more generally, success or failure is determined by invoking report(), although that detail is not being stabilized
benchmarks work the same way (Ok, Err)
- note that #[bench] is still effectively unstable though in general

What remains unstable

The library details: where the trait is and its methods
- Which methods the test runner invokes on the trait -- this is unstable as the trait itself is unstable

The text was updated successfully, but these errors were encountered:

nikomatsakis · 2018-03-08T22:58:02Z

@rfcbot fcp merge

I propose that we further stabilize unit tests etc that return non-() return types.

rfcbot · 2018-03-08T22:58:02Z

Team member @nikomatsakis has proposed to merge this. The next step is review by the rest of the tagged teams:

Concerns:

experience (Stabilize unit tests with non-() return type #48854 (comment))

Once a majority of reviewers approve (and none object), this will enter its final comment period. If you spot a major issue that hasn't been raised at any point in this process, please speak up!

See this document for info about what commands tagged team members can give me.

scottmcm · 2018-03-08T23:25:36Z

Thanks for including the test links! Super helpful.

I think letting Ok(()) (and ExitCode::SUCCESS 🙂) be test pass is great.

This one surprised me, though:

#[test]
#[should_panic]
fn not_a_num() -> Result<(), ParseIntError> {
    let _: u32 = "abc".parse()?;
    Ok(())
}

I get cognitive dissonance from that, because I my brain says "what do you mean parse should panic? I thought the point was that it doesn't panic!", and I think it would be unfortunate if the test were to continue to pass should it actually start panicking.

I think I was expecting

(unannotated): test pass is .report()==SUCCESS, everything else is test fail
[should_panic]: catch_panic caught a panic is test pass, everything else is test fail
[should_error] (hypothetical): test pass is .report()!=SUCCESS, everything else is test fail

(Unimportant thought: a -> ! test seems fundamentally useless. But probably not worth prohibiting?)

cramertj · 2018-03-09T00:07:50Z

Unimportant thought: a -> ! test seems fundamentally useless. But probably not worth prohibiting?

#[test]
#[should_panic]
fn panic_panics() -> ! { panic!() }

😛

sgrif · 2018-03-09T00:08:35Z

I think this definitely warrants eventually adding an alias for should_panic (perhaps should_error or should_fail), but that seems like it's strictly orthogonal to stabilizing this

aturon · 2018-03-09T00:12:07Z

cc @jonhoo re: test frameworks

scottmcm · 2018-03-09T00:14:50Z

@sgrif Note that I'm explicitly not suggesting that it should be an alias: I would like should_panic to only result in a test pass if there was an actual panic, which is something that I don't think we'd want to change after stabilization if there were a bunch of tests using should_panic to check for Err.

sgrif · 2018-03-09T00:17:28Z

@scottmcm Fair point. I agree.

jonhoo · 2018-03-09T00:30:44Z

In the context of custom test frameworks, this seems fine. The path we're taking going forward is that what ships with Rust by default is the same thing that libtest does at the time of the change, so committing to this now just means that the libtest we build using ctf will also have to support it (which seems fine).

Note that with ctfs, new test frameworks can implement entirely new testing annotations, and are not bound to the annotations, signatures, and semantics as the current built-in test support.

Also cc @Manishearth .

nrc · 2018-03-09T03:55:56Z

@rfcbot concern experience

Has this had much use? It's the first I knew about it even being a thing (I've not been following the main return types stuff) so I wouldn't be surprised if this has not had much usage. In which case I'm wary of stabilising without getting some experience of it in practice.

jonhoo · 2018-03-09T05:46:25Z

Yeah, I didn't even know this had landed on nightly yet? Super excited about the feature though!

nikomatsakis · 2018-03-12T14:00:03Z

I don't think it's had much experience. I agree with @scottmcm that #[should_panic] surprised me a bit -- so perhaps I was premature in moving to stabilize. (I'd welcome a PR to adjust that, though.)

I'm ok with waiting a bit, although it makes stabilizing the "main" case a bit harder, since we have to split the two. But I would encourage those who have doubts about the feature to experiment -- there really isn't much more to it, I don't think.

In other words: under this feature, there are really three (and a half) distinct possible outcomes:

Test panics
Test returns a "successful" result (libc::EXIT_SUCCESS)
Test returns a "failed" result
- This could be subdivided to depend on the specific return code

I'm personally not inclined to subdivide the final case: I feel like if you want to test for a specific return code, you ought to just add an assert into your test.

(One other reason for holding back on this feature: a major motivation -- perhaps the motivation -- was using it from within rustdoc, so that those examples can show idiomatic code -- I'm not sure if that works yet or what it takes to make it work.)

nikomatsakis · 2018-03-12T14:03:50Z

@rfcbot fcp cancel

I'm going to cancel the FCP. I added @scottmcm's concern to the header, as well as the desire to see a rustdoc example in practice. =)

It might however be hard to make #[should_panic] not cover the case of a failed return. Or, well, maybe not hard, but that will require deeper changes to libtest.

Thinking more about it, I'm not sure how desirable it is. I agree #[should_panic] is confusing but I wonder if it makes sense to introduce more than one way sense of unit test "failure". Is that making things overly complex? (We could rename #[should_panic])

rfcbot · 2018-03-12T14:03:51Z

@nikomatsakis proposal cancelled.

shepmaster · 2018-03-12T15:05:25Z

I agree #[should_panic] is confusing

I'd use stronger language than "confusing". If the only test framework we can use (and will forever be the default test framework) says that "returning a Result is the same as a panic", that's going to work against any kind of teaching that we have ever done that they are different methods of error handling with different use cases.

Tests are something that should check specific conditions. #[should_panic] is already fairly loose in that you can't use it to identify a specific line of code that should generate a panic:

#[test]
#[should_panic(expected = "index out of bounds")]
fn e() {
    let a = [1, 2, 100];
    let b = a[3]; // oops
    let c = t(&a, b);
}

fn t(a: &[usize], idx: usize) {
    a[idx];
}

It's my belief that should_panic was basically a workaround for the fact that we didn't have a lightweight, stable way of catching panics at Rust 1.0. In a magical world of test frameworks, I'd want such an assertion to be part of the test, not just metadata:

#[test]
fn e() {
    let a = [1, 2, 100];
    let b = a[3]; // oops
    assert_that(|| {
        let c = t(&a, b);
    }).panics()
        .with("index out of bounds")
}

IMO, returning a Result::Err from a test should always cause the test to fail. If it's expected that the operation fails, that should be part of the test:

assert!(f.is_err());

nikomatsakis · 2018-03-13T15:28:06Z

IMO, returning a Result::Err from a test should always cause the test to fail. If it's expected that the operation fails, that should be part of the test:

Hmm. Plausible. Maybe that's how we should refactor the test runner's internal workings, actually. That is, we could refactor #[should_panic] tests so that they catch the panic (and make the test runner consider any uncaught panic as a failure).

nikomatsakis · 2018-03-22T15:41:56Z

I've been thinking about this a bit. Here is one simple possible design:

#[should_panic] can only be applied to tests with () return type and continues to have the same meaning.
For non-should-panic tests, the return type T defines the error result using the Termination trait.

Under this version, if we ignore #[should_panic] then:

A panic is always a failure.
The "non-Ok" variant of the return is also a failure.
- If you want that to be considered success, you should write assert!(foo.is_err());
This seems nicely biased towards "all the code you see in the test executed without panic'ing or returning an error".

With #[should_panic], the test either returns () (fails) or panics (succeeds).

Thoughts?

shepmaster · 2018-03-23T14:55:07Z

all the code you see in the test executed without panic'ing or returning an error

I think this is the right mindset. I've always envisioned ? in tests to be a shorthand for what is currently an unwrap/expect: "theoretically this can fail, but it shouldn't in this test".

scottmcm · 2018-03-25T08:05:29Z

That sounds great, @nikomatsakis; it resolves all my concerns about #[should_panic].

And it allows the great "replace .unwrap() with ?" crusade to continue 🙂 I added unit tests as another place a "whatever, I just want to use ?" type would be nice (rust-lang/rfcs#2367).

pnkfelix · 2018-04-30T21:52:31Z

@rfcbot reviewed

rfcbot · 2018-04-30T21:52:35Z

🔔 This is now entering its final comment period, as per the review above. 🔔

rfcbot · 2018-05-10T21:55:34Z

The final comment period, with a disposition to merge, as per the review above, is now complete.

nikomatsakis · 2018-05-24T18:50:42Z

The final comment period is up, let's do this! There are general instructions on how to stabilize a feature given here:

https://forge.rust-lang.org/stabilization-guide.html

In this case, the feature name in question is termination_trait_test.

Dylan-DPC-zz · 2018-05-24T19:16:24Z

hi @nikomatsakis I can work on this

…t, r=nikomatsakis Stabilize unit tests with non-`()` return type References rust-lang#48854

Dylan-DPC-zz · 2018-06-11T19:11:25Z

Since #51298 is merged, we can close this issue?

frewsxcv · 2018-07-03T01:20:28Z

Pretty sure this can be closed now that this is stabilized, thanks @Dylan-DPC!

shepmaster · 2018-07-25T02:41:57Z

I'm missing something. I went to use this, but what's all this junk about 1 and 0 that has nothing to do with my test? This is what we stabilized?

#[test]
fn example() -> Result<(), Box<dyn std::error::Error>> {
    Err(String::from("boom"))?;
    Ok(())
}

(Playground)

Output:

running 1 test
test example ... FAILED

failures:

---- example stdout ----
Error: StringError("boom")
thread 'example' panicked at 'assertion failed: `(left == right)`
  left: `1`,
 right: `0`', libtest/lib.rs:326:5
note: Run with `RUST_BACKTRACE=1` for a backtrace.


failures:
    example

test result: FAILED. 0 passed; 1 failed; 0 ignored; 0 measured; 0 filtered out

ehuss · 2018-07-25T02:58:16Z

@shepmaster I think there is a PR at #52453 to fix that.

shepmaster · 2018-07-25T12:56:25Z

Thanks. It's a shame this will go out in the suboptimal form in 1.28, and presumably 1.29, but hopefully we will see it in 1.30!

nalply · 2020-12-03T08:41:16Z

Very late to the party! Sorry!

@nikomatsakis wrote at the top:

If the type is not (), then #[should_panic] is disallowed

Why?

I wrote a test runner for a complicated test setup and tear-down use case using catch_unwind() and resume_unwind(). I would like to test where a test returns the Err case and check whether the test runner panics.

In other words, I wrote a test where I return Result together with #[should_panic]:

  #[test]
  #[should_panic]
  fn return_err() -> Result<()> {
    let _ = Test::init().run_silent(|_| display("TEST ERROR")?);
    Ok(())
  }

Meanwhile, I commented out this test because it does not compile. I have to admit that I am confused and probably what I am trying is based on faulty assumptions, however the question remains:

Why is #[should_panic] disallowed on non-unit returning tests?

dralley · 2021-06-30T03:25:04Z

@nalply I think the answer is #48854 (comment) but I'd love clarification on this also.

nikomatsakis added T-lang Relevant to the language team, which will review and decide on the PR/issue. C-tracking-issue Category: An issue tracking the progress of sth. like the implementation of an RFC labels Mar 8, 2018

rfcbot added the proposed-final-comment-period Proposed to merge/close by relevant subteam, see T-<team> label. Will enter FCP once signed off. label Mar 8, 2018

This was referenced Mar 8, 2018

Tracking issue for RFC 1937: ? in main #43301

Closed

Stabilize main with non-() return types #48453

Closed

nrc added the T-dev-tools Relevant to the dev-tools subteam, which will review and decide on the PR/issue. label Mar 9, 2018

rfcbot removed the proposed-final-comment-period Proposed to merge/close by relevant subteam, see T-<team> label. Will enter FCP once signed off. label Mar 12, 2018

scottmcm mentioned this issue Mar 25, 2018

Make an "I just want to use ?" type for use as a main return type rust-lang/rfcs#2367

Open

This was referenced Apr 3, 2018

update the book for unit tests with a non-() return type rust-lang/book#1284

Closed

update for unit tests with a non-() return type rust-lang/rust-by-example#1067

Closed

nikomatsakis mentioned this issue Apr 12, 2018

Adjust design of unit tests with non-() type #49909

Closed

rfcbot added final-comment-period In the final comment period and will be merged soon unless new substantive objections are raised. and removed proposed-final-comment-period Proposed to merge/close by relevant subteam, see T-<team> label. Will enter FCP once signed off. labels Apr 30, 2018

rfcbot added finished-final-comment-period The final comment period is finished for this PR / Issue. and removed final-comment-period In the final comment period and will be merged soon unless new substantive objections are raised. labels May 10, 2018

nikomatsakis added E-easy Call for participation: Easy difficulty. Experience needed to fix: Not much. Good first issue. E-mentor Call for participation: This issue has a mentor. Use #t-compiler/help on Zulip for discussion. labels May 24, 2018

Dylan-DPC-zz mentioned this issue Jun 2, 2018

Stabilize unit tests with non-() return type #51298

Merged

Mark-Simulacrum added a commit to Mark-Simulacrum/rust that referenced this issue Jun 6, 2018

Rollup merge of rust-lang#51298 - Dylan-DPC:stabilise/termination-tes…

879254b

…t, r=nikomatsakis Stabilize unit tests with non-`()` return type References rust-lang#48854

Mark-Simulacrum added a commit to Mark-Simulacrum/rust that referenced this issue Jun 7, 2018

Rollup merge of rust-lang#51298 - Dylan-DPC:stabilise/termination-tes…

b252339

…t, r=nikomatsakis Stabilize unit tests with non-`()` return type References rust-lang#48854

Mark-Simulacrum added a commit to Mark-Simulacrum/rust that referenced this issue Jun 8, 2018

Rollup merge of rust-lang#51298 - Dylan-DPC:stabilise/termination-tes…

d68098a

…t, r=nikomatsakis Stabilize unit tests with non-`()` return type References rust-lang#48854

frewsxcv closed this as completed Jul 3, 2018

nikomatsakis mentioned this issue Nov 2, 2018

Tracking issue for RFC #2175: or-patterns in if / while let expressions #48215

Closed

3 tasks

scottmcm mentioned this issue Feb 18, 2019

Modify doctest's auto-fn main() to allow Results #56470

Merged

meiomorphism mentioned this issue Dec 18, 2020

#[should_panic]'s return-type check ignores procedural-macro transformations #80143

Open

carols10cents mentioned this issue Nov 28, 2021

Chapter 11.1 - Explain that it's not possible to expect a test to return an error when the return type is Result rust-lang/book#2085

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stabilize unit tests with non-`()` return type #48854

Stabilize unit tests with non-`()` return type #48854

nikomatsakis commented Mar 8, 2018 •

edited

Loading

Possible changes needed before stabilizing

What would be stabilized

What remains unstable

nikomatsakis commented Mar 8, 2018

rfcbot commented Mar 8, 2018 •

edited by eddyb

Loading

scottmcm commented Mar 8, 2018 •

edited

Loading

cramertj commented Mar 9, 2018 •

edited

Loading

sgrif commented Mar 9, 2018

aturon commented Mar 9, 2018

scottmcm commented Mar 9, 2018

sgrif commented Mar 9, 2018

jonhoo commented Mar 9, 2018

nrc commented Mar 9, 2018

jonhoo commented Mar 9, 2018

nikomatsakis commented Mar 12, 2018

nikomatsakis commented Mar 12, 2018 •

edited

Loading

rfcbot commented Mar 12, 2018

shepmaster commented Mar 12, 2018

nikomatsakis commented Mar 13, 2018

nikomatsakis commented Mar 22, 2018 •

edited

Loading

shepmaster commented Mar 23, 2018

scottmcm commented Mar 25, 2018

pnkfelix commented Apr 30, 2018

rfcbot commented Apr 30, 2018

rfcbot commented May 10, 2018

nikomatsakis commented May 24, 2018

Dylan-DPC-zz commented May 24, 2018

Dylan-DPC-zz commented Jun 11, 2018

frewsxcv commented Jul 3, 2018

shepmaster commented Jul 25, 2018

ehuss commented Jul 25, 2018

shepmaster commented Jul 25, 2018 •

edited

Loading

nalply commented Dec 3, 2020 •

edited

Loading

dralley commented Jun 30, 2021

Stabilize unit tests with non-() return type #48854

Stabilize unit tests with non-() return type #48854

Comments

nikomatsakis commented Mar 8, 2018 • edited Loading

What would be stabilized

Unknowns

Possible changes needed before stabilizing

What would be stabilized

What remains unstable

nikomatsakis commented Mar 8, 2018

rfcbot commented Mar 8, 2018 • edited by eddyb Loading

scottmcm commented Mar 8, 2018 • edited Loading

cramertj commented Mar 9, 2018 • edited Loading

sgrif commented Mar 9, 2018

aturon commented Mar 9, 2018

scottmcm commented Mar 9, 2018

sgrif commented Mar 9, 2018

jonhoo commented Mar 9, 2018

nrc commented Mar 9, 2018

jonhoo commented Mar 9, 2018

nikomatsakis commented Mar 12, 2018

nikomatsakis commented Mar 12, 2018 • edited Loading

rfcbot commented Mar 12, 2018

shepmaster commented Mar 12, 2018

nikomatsakis commented Mar 13, 2018

nikomatsakis commented Mar 22, 2018 • edited Loading

shepmaster commented Mar 23, 2018

scottmcm commented Mar 25, 2018

pnkfelix commented Apr 30, 2018

rfcbot commented Apr 30, 2018

rfcbot commented May 10, 2018

nikomatsakis commented May 24, 2018

Dylan-DPC-zz commented May 24, 2018

Dylan-DPC-zz commented Jun 11, 2018

frewsxcv commented Jul 3, 2018

shepmaster commented Jul 25, 2018

ehuss commented Jul 25, 2018

shepmaster commented Jul 25, 2018 • edited Loading

nalply commented Dec 3, 2020 • edited Loading

dralley commented Jun 30, 2021

Stabilize unit tests with non-`()` return type #48854

Stabilize unit tests with non-`()` return type #48854

nikomatsakis commented Mar 8, 2018 •

edited

Loading

rfcbot commented Mar 8, 2018 •

edited by eddyb

Loading

scottmcm commented Mar 8, 2018 •

edited

Loading

cramertj commented Mar 9, 2018 •

edited

Loading

nikomatsakis commented Mar 12, 2018 •

edited

Loading

nikomatsakis commented Mar 22, 2018 •

edited

Loading

shepmaster commented Jul 25, 2018 •

edited

Loading

nalply commented Dec 3, 2020 •

edited

Loading