Document (or fix) limitations of stateful testing #1300

DRMacIver · 2018-05-22T10:11:04Z

Because stateful testing is currently built on find, it had a number of features supported by @given that do not work. These include:

Coverage guided testing (this is a particularly glaring omission because stateful testing is where this would be most useful!)
Statistics
(Most) health checks
Deadlines

Currently this isn't documented anywhere. We should document this at a minimum, though better yet would be rebuilding stateful testing and given to have more of a common mechanism.

The text was updated successfully, but these errors were encountered:

Zac-HD · 2018-05-22T10:21:40Z

Related to #1213, #1190, #164.

DRMacIver · 2018-05-22T10:23:23Z

I'm unsure how I feel about #164 these days. I think the reality may be that nobody dislikes the stateful API as much as I do and everyone hates my attempts to improve it more. Agreed those tickets are related, but I think #1213 is the only one that's directly related.

Zac-HD · 2018-05-22T10:44:43Z

I find bundles to be endlessly confusing, and would ~~just rip them out~~ deprecate them in favor of instance variables and the data() strategy.

Other than that, the inconsistencies you note above, and a trivial sense that I would have named things differently, it's not bad! The structure of a class with methods makes sense, and while I would probably use a class decorator (ala Attrs) to transform it and put a test in the global namespace IIRC that's hard to do compatibly.

Uh, that's a "hate your tools" rant, but I actually do quite like stateful testing despite seeing the flaws. ❤️

DRMacIver · 2018-05-22T11:10:15Z

I find bundles to be endlessly confusing, and would just rip them out deprecate them in favor of instance variables and the data() strategy.

I think that's more evidence that they need better documentation, because instance variables and the data strategy don't come close to replacing their functionality!

DRMacIver · 2018-05-22T11:11:46Z

(But I agree that most places where you can replace a bundle with an instance variable, you probably should)

lennax · 2018-05-24T16:43:51Z

This is a minor limitation, but @rule and @pytest.mark.parametrize are not compatible (unlike @given). I didn't see this reported in a separate issue.

MWE:

import pytest

from hypothesis import stateful
from hypothesis.strategies import booleans


class PytestMachine(stateful.RuleBasedStateMachine):

    @stateful.rule(i=booleans())
    @pytest.mark.parametrize('hi', (1, 2, 3))
    def parametrize_after_rule(self, hi, i):
        pass


TestPytestMachine = PytestMachine.TestCase

Failure:

 test_compat.py::TestPytestMachine::runTest <- [...]/python3.6/site-packages/hypothesis/stateful.py  
[...]/python3.6/site-packages/hypothesis/stateful.py:192: in runTest
    run_state_machine_as_test(state_machine_class)
[...]/python3.6/site-packages/hypothesis/stateful.py:110: in run_state_machine_as_test
    breaker.run(state_machine_factory(), print_steps=True)
[...]/python3.6/site-packages/hypothesis/stateful.py:248: in run
    state_machine.execute_step(value)
[...]/python3.6/site-packages/hypothesis/stateful.py:589: in execute_step
    result = rule.function(self, **data)
E   TypeError: parametrize_after_rule() missing 1 required positional argument: 'hi'
---------------------------------- Hypothesis ----------------------------------
Step #1: parametrize_after_rule(i=False)

Zac-HD · 2018-06-29T13:32:12Z

Some notes on rule-based stateful API friendliness, prompted by #1370:

It would be nice to have @invariant work (you actually need @invariant())
Similarly, @rule has a terrible error if used without calling it (TypeError trying to iterate over the decorated method as targets), which is an obvious mistake if you try to use it @given(...) to supply inputs and @rule to mark the method as a rule. I'm not inclined to do runtime checking for mixing with @given but a better message for decorate-time misuse seems reasonable.

Zac-HD · 2018-07-05T07:33:12Z

We would also like to provide the parts required to implement stateful testing downstream as public API, for use by e.g. async libraries. See #1371 for a proof-of-concept of how this might work.

Technically speaking you can just inherit from GenericStateMachine and replace the run method, but in practice I'm not sure how well this would work, mostly because cu.many(...) is crucial to have nice looping behavior. On the one hand a public equivalent would be nice; on the other just shrinking magically would be even better!

njsmith · 2018-07-06T08:03:20Z

Regarding async stateful testing, @Zac-HD wrote over here, on @touilleMan's PR:

Async stateful tests pose some harder problems in general though, because the order of the steps is no longer the only factor in the ordering - we might need a way for Hypothesis to take charge of the scheduler. In Trio's case this is actually OK as it's mostly-fair and we already seize the global PRNG so the random portion is reproducible. However, adversarial scheduling of coroutines and fault injection (timeouts, cancellations, mock network issues, etc.) would be very powerful for testing - as well as demanding library- and domain-specific implementation and user-facing controls

I'm replying here so my comment doesn't get lost on a random closed PR :-).

There are a lot of sources of possible non-determinism in your average Trio program (or any other program that uses concurrency or I/O). Providing ways to control this for testing is really important, which is why Trio provides things like a deterministic clock and deterministic "network connections". I'd love to have Hypothesis strategies like "a network connection with random chunking and delays", or "scheduler that randomly reassigned priorities to catch race conditions". However, I think all this is mostly orthogonal to stateful testing? All of these things are useful/necessary with regular @given-based tests (or even old-school non-Hypothesis-based tests), right?

Zac-HD · 2018-07-06T15:09:13Z

I agree on all counts - including both usefulness and orthogonality to stateful testing - but the implementation and more detailed discussion of that really belongs on a Trio issue tracker.

(Personally I'd consider having an optional dependency on Hypothesis within trio.testing, or split it out into a new package hypothesis-trio that could provide all those strategies. Getting further off-topic though...)

touilleMan · 2018-07-07T21:33:01Z

@Zac-HD @njsmith so what would be the next step now ?

I'm thinking to divide my previous POC into two PRs:

one for hypothesis containing only the GenericStateMachine._custom_runner hook (btw I think _custom_runner is not a good name so suggestions wanted here ^^)
one for trio adding hypothesis stateful support in trio.testing

Zac-HD · 2018-07-08T14:57:26Z

I don't think we (collectively) have enough experience with this kind of testing to write a useful API on the first attempt, so I'm feeling very cautious about adding something that would have to be supported for a long time.

Instead, I'd rework your proof-of-concept into a standalone library hypothesis-trio, with a class TrioGenericStateMachine that overrides the .run method directly. This can also be our basis for e.g. Hypothesis-based mock network objects, scheduler experiments, etc.

I'll be basically unavailable for the next week, but keen to be involved once my schedule calms down a little.

touilleMan · 2018-07-16T18:08:16Z

@Zac-HD @njsmith I've created the hypothesis-trio project with my POC in it.

Currently there is a monkey patch of hypothesis.stateful.StateMachineRunner.run that kicks in when hypothesis_trio.stateful is importer, it's not something I'm happy about but the alternative was to provide a custom version of StateMachineRunner, StateMachineSearchStrategy, find_breaking_runner and run_state_machine_as_test given they all make reference of each others...

Anyway, the good news is from now on we have hypothesis stateful working with trio \o/

Zac-HD added the enhancement it's not broken, but we want it to be better label May 22, 2018

This was referenced Jul 1, 2018

Lots of small documentation improvements #1375

Merged

RuleBasedStateMachine doesn't use example on initialize #1384

Closed

Zac-HD mentioned this issue Jul 5, 2018

Stateful async POC #1371

Closed

Zac-HD self-assigned this Sep 29, 2018

Zac-HD mentioned this issue Sep 29, 2018

New given-based backend for stateful tests #1609

Merged

Zac-HD added the docs documentation could *always* be better label Dec 5, 2018

Zac-HD mentioned this issue Dec 5, 2018

Confusion about mutliple targets in rule #1681

Closed

Zac-HD closed this as completed in #1609 Dec 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document (or fix) limitations of stateful testing #1300

Document (or fix) limitations of stateful testing #1300

DRMacIver commented May 22, 2018

Zac-HD commented May 22, 2018

DRMacIver commented May 22, 2018

Zac-HD commented May 22, 2018

DRMacIver commented May 22, 2018

DRMacIver commented May 22, 2018

lennax commented May 24, 2018

Zac-HD commented Jun 29, 2018 •

edited

Loading

Zac-HD commented Jul 5, 2018

njsmith commented Jul 6, 2018 •

edited

Loading

Zac-HD commented Jul 6, 2018

touilleMan commented Jul 7, 2018

Zac-HD commented Jul 8, 2018

touilleMan commented Jul 16, 2018

Document (or fix) limitations of stateful testing #1300

Document (or fix) limitations of stateful testing #1300

Comments

DRMacIver commented May 22, 2018

Zac-HD commented May 22, 2018

DRMacIver commented May 22, 2018

Zac-HD commented May 22, 2018

DRMacIver commented May 22, 2018

DRMacIver commented May 22, 2018

lennax commented May 24, 2018

Zac-HD commented Jun 29, 2018 • edited Loading

Zac-HD commented Jul 5, 2018

njsmith commented Jul 6, 2018 • edited Loading

Zac-HD commented Jul 6, 2018

touilleMan commented Jul 7, 2018

Zac-HD commented Jul 8, 2018

touilleMan commented Jul 16, 2018

Zac-HD commented Jun 29, 2018 •

edited

Loading

njsmith commented Jul 6, 2018 •

edited

Loading