implicitly fuzz RNG-dependent doctests with a random random seed #29935

dimpase · 2020-06-22T13:02:46Z

Our documentation and tests include a variety of
examples and tests involving randomness.

Those depend on a randomness seed, and up to Sage 9.1
they always used the same one: 0. Thus every run
of sage -t or make [p]test[all][long] would run
those "random" doctests deterministically.
In many cases, the output of those tests even relied
on that. As a result, random examples and tests
were actually testing that they were run non-randomly!

This is reminiscent of
an xkcd comic illustration of random number generators.

Based on these considerations and the
related sage-devel discussion,
we propose to:

allow specifying a randomness when running tests (Introduce random-seed option to allow fuzzing of doctests #29962),
adapt tests involving randomness, making sure they
test mathematical functionality independent of
the randomness seed used to run them (see roadmap),
default to a random randomness seed when none is specified
(present ticket),

thus making those tests more robust and more useful,
by becoming more likely to reveal bugs in a variety
of cases including corner cases.

The first step (see #29962, merged in Sage 9.2)
adds a --random-seed flag to sage -t, allowing:

sage -t --long --random-seed=9876543210 src/sage/

Still, when no randomness seed is specified,
the default seed 0 is used. This means most testers
test with the same randomness seed, making "random"
doctests still mostly deterministic in practice.

Here is a way to pick a random randomness seed
and run tests with it (can be used to work on
the tickets in the roadmap):

$ randseed() {
    sage -c "import sage.misc.randstate as randstate; \
             randstate.set_random_seed(); \
             print(randstate.initial_seed())";
    }
$ SEED=$(randseed)
$ DIR=src/sage
$ echo "$ sage -t --long --random-seed=${SEED} ${DIR}" \
  && ./sage -t --long --random-seed=${SEED} ${DIR}

Once examples and tests involving randomness
have been adapted, the present ticket puts
the final touch by making it so that
when running tests with no seed specified,
a random one will be used:

sage -t src/sage/all.py 
Running doctests with ID 2020-06-23-23-19-03-8003eea5.
...
sage -t --warn-long 89.5 --random-seed=273987373473433732996760115183658447263 src/sage/all.py
    [16 tests, 0.73 s]
----------------------------------------------------------------------
All tests passed!
----------------------------------------------------------------------
Total time for all tests: 0.8 seconds
    cpu time: 0.7 seconds
    cumulative wall time: 0.7 seconds

Being displayed in the output, the seed used
can be used again if needed:

sage -t --warn-long 89.5 --random-seed=273987373473433732996760115183658447263 src/sage/all.py

allowing to explore any problematic case
revealed by running the tests with that seed.

Roadmap:

Allow fuzzing: Introduce random-seed option to allow fuzzing of doctests #29962
Make all parts of sage ready for default fuzzing:
- Make coding doctests ready for random seeds #29945: coding
- Make geometry doctests ready for random seeds #29963: geometry
- Make libs doctests ready for random seeds #29964: libs
- Make graphs doctests ready for random seeds #29965: graphs
- Make groups doctests ready for random seeds #32107: groups
- Make interfaces doctests ready for random seeds #29967: interfaces
- Make algebras doctests ready for random seeds #29968: algebras
- Make misc doctests ready for random seeds #29969: misc
- Make arith doctests ready for random seeds #29970: arith
- Make categories doctests ready for random seeds #29971: categories
- Make stats doctests ready for random seeds #29972: stats
- Make sets doctests ready for random seeds #29973: sets
- Make combinat doctests ready for random seeds #29974: combinat
- Make numerical and probability doctests ready for random seeds #29975: numerical, probability
- Make matrix doctests ready for random seeds #29976: matrix
- Make modular doctests ready for random seeds #29977: modular
- Make modules doctests ready for random seeds #29978: modules
- Make rings doctests ready for random seeds #29979: rings
- Make crypto doctests ready for random seeds #29980: crypto
- Make documentation doctests ready for random seeds #29981: documentation
- Make dynamics doctests ready for random seeds #29982: dynamics
- Make finance doctests ready for random seeds #29983: finance
- Make symbolic doctests ready for random seeds #29984: symbolic
- Make schemes doctests ready for random seeds #29985: schemes
- Make plot doctests ready for random seeds #29986: plot
- Fix some doctests that fail for various random seeds #32188: various missed tests along the way
Update the developers guide for implicitly fuzzing doctests #32216: Update the developers guide for implicitly fuzzing doctests
Finally make fuzzing default with this ticket.

Follow-up:

Meta-ticket: Fix unstable random doctests detected after #29935 #32544: Meta-ticket: Fix unstable doctests detected after implicitly fuzz RNG-dependent doctests with a random random seed #29935
Remove set_random_seed in doctests where possible.

Errors discovered by this ticket:

Fix moebius_transform, midpoint and perpendicular_bisector #29936: Hyperbolic geometry bugs revealed by fuzzing
Make coding doctests ready for random seeds #29945: Failing doctest in src/sage/coding/linear_code.py
(just a wrong doctest, will be fixed here)
Unstable plotting #29954: Unstable plotting
Bug in KlyachkoBundle_class.random_deformation #29956: Bug in KlyachkoBundle_class.random_deformation
Bug in ContinuedFraction rounding #29957: Bug in ContinuedFraction rounding
Too many strong articulation points #29958: Too many strong articulation points
random symbolic expression is completely unstable #29961: Random symbolic expression is completely unstable
Bug in Reed-Solomon encoder and error-erasure decoder #30045: Bug in Reed-Solomon encoder and error-erasure decoder
Index error with random derangement (fixed in Make combinat doctests ready for random seeds #29974)
simplify_hypergeometric is unstable #31890: simplify_hypergeometric is unstable
ZeroDivisonError when creating polynomial system #31891: ZeroDivisonError when creating polynomial system
Conic parametrization broken #31892: Conic parametrization broken
Polynomial generic power trunk broken #32075: Polynomial generic power trunk broken
Various errors with polybori including segmentation fault #32083: Various errors with polybori including segmentation fault
_nth_root_naive fails for integer mod #32084 _nth_root_naive fails for integer mod
Errors when computing norms of padic elements #32085: Errors when computing norms of padic elements
apply_homography unstable for continued fraction #32086: apply_homography unstable for continue fraction
DiFUB algorithm fails on some random graph #32095: DiFUB algorithm fails on some random graph
Fix random tree on one or less vertices #32108: Fix random tree on one or less vertices
Fix 0/0 in ore function field #32109: Fix 0/0 in ore function field
Unstable minimal polynomial for element of 2-adic Eisenstein Extension Field in pi defined by x^4 - 2*a #32111: Unstable minimal polynomial for element of 2-adic Eisenstein Extension Field in pi defined by x^4 - 2*a
Random relative number field checks only irreducibility over QQ #32117: Random relative number field checks only irreducibility over QQ
AlgebraicForm checks invariance with random matrix that can be the identity #32118: AlgebraicForm checks invariance with random matrix that can be the identity
SL2Z.random_element unstable, ZZ.random_element does not ignore bounds not needed for distribution #32124: SL2Z.random_element unstable
random Ore polynomials do not respect minimum degree bound #32125: random Ore polynomials do not respect minimum degree bound
padic QpLC.random_element is broken #32126: padic QpLC.random_element is broken
gosper_iterator of continued fractions is unstable #32127: gosper_iterator of continued fractions is unstable
sage_input is unreliable for elements of ComplexField #32129: sage_input is unreliable for elements of ComplexField
Cut width of graph with one edge incorrect #32131: Cut width of graph with one edge incorrect
Wrong gyration orbit length #32132: Wrong gyration orbit length
is_groebner fails over fraction fields #32138: is_groebner fails over fraction fields
Unstable doctest involving permutation groups #32141: Unstable doctest involving permutation groups
Bug in edge disjoint spanning trees #32169: Bug in edge disjoint spanning trees
Failing weak order assertion on random symbolic expression #32185: Failing weak order assertion on random symbolic expression
Random bounded tolerance graph #32186: Random bounded tolerance graph
permutation group generated by list perms in L of degree n incorrect when compared to GAP #32187: permutation group generated by list perms in L of degree n incorrect when compared to GAP
plot_vector_field unstable #32657: plot_vector_field unstable

Depends on #32667

CC: @kliem @orlitzky @DaveWitteMorris @slel @mantepse

Component: doctest framework

Keywords: fuzz, random, seed

Author: Jonathan Kliem

Branch/Commit: 047379c

Reviewer: Michael Orlitzky

Issue created by migration from https://trac.sagemath.org/ticket/29935

The text was updated successfully, but these errors were encountered:

kliem · 2020-06-22T13:19:50Z

comment:3

Just so that we don't forget that we will have to update

https://doc.sagemath.org/html/en/developer/coding_basics.html

kliem · 2020-06-22T13:19:50Z

Work Issues: modify coding conventions

kliem · 2020-06-22T13:40:39Z

Dependencies: #29904

kliem · 2020-06-22T13:53:07Z

comment:5

geometry/hyperbolic_space/hyperbolic_geodesic.py makes some claims on how good approximations work, which doesn't seem to work. I had this thing run 7 times now and not one time did all tests pass. E.g. there is one test that gives me absolute errors of 0.7 when running this in the shell, but the test claims it is below 10**-9.

There is also #29904. Other than that the geometry module appears to ready.

Edit: And there is some place where you need to add set_random_seed(0) now or similar.

dimpase · 2020-06-22T14:09:39Z

comment:6

Replying to @kliem:

geometry/hyperbolic_space/hyperbolic_geodesic.py makes some claims on how good approximations work, which doesn't seem to work. I had this thing run 7 times now and not one time did all tests pass. E.g. there is one test that gives me absolute errors of 0.7 when running this in the shell, but the test claims it is below 10**-9.

this is a great example of fuzzing catching an apparent error, it seems.

There is also #29904. Other than that the geometry module appears to ready.

Edit: And there is some place where you need to add set_random_seed(0) now or similar.

kliem · 2020-06-22T14:49:58Z

New commits:

`d5fc5be`	`start from a "random" random seed for doctesting`
`5c7e562`	`fix double description of hypercube`
`6b41bdb`	`Merge branch 'public/29904' of git://trac.sagemath.org/sage into public/29935`
`b2954ce`	`fix random test in geometry/linear_expression`

kliem · 2020-06-22T14:49:58Z

Branch: public/29935

kliem · 2020-06-22T14:49:58Z

Commit: b2954ce

kliem · 2020-06-22T14:54:23Z

comment:8

Even in the developer guide there is an example, where it is confusing:

sage: M = matrix.identity(3) + random_matrix(RR,3,3)/10^3
sage: M^2 # abs tol 1e-2
[1 0 0]
[0 1 0]
[0 0 1]

There is no mentioning there that only one particular matrix is tested.

sagetrac-git · 2020-06-22T14:57:35Z

Branch pushed to git repo; I updated commit sha1. New commits:

`dc49dd0`	`update developer guide`

sagetrac-git · 2020-06-22T14:57:35Z

Changed commit from b2954ce to dc49dd0

kliem · 2020-06-22T14:57:48Z

Changed work issues from modify coding conventions to none

dimpase · 2020-06-22T15:14:16Z

comment:11

Replying to @kliem:

Even in the developer guide there is an example, where it is confusing:
sage: M = matrix.identity(3) + random_matrix(RR,3,3)/10^3
sage: M^2 # abs tol 1e-2
[1 0 0]
[0 1 0]
[0 0 1]
There is no mentioning there that only one particular matrix is tested.

indeed it assumes (?) that the entries of the random matrix are small in abs value

kliem · 2020-06-22T15:16:22Z

comment:12

Nothing to assume there. random_matrix(RR,3,3) has entries between -1 and 1. After dividing we are good.

kliem · 2020-06-22T15:21:25Z

Changed dependencies from #29904 to #29904, #29936

mkoeppe · 2020-06-22T16:14:14Z

comment:14

How about making the random seed an option to sage-runtests. Then people can fuzz the doctests if they want.

kliem · 2020-06-22T16:27:43Z

comment:15

Will people do that? Especially enough to find incorrect claims. If you look into #29936 we have been claiming a preciseness that is far from true. Even if you consider the relative error instead of the absolute error. fuzzed doctests would have easily detected that. Even worse. Maybe the situation was better at some point and we never caught on about the regression.

dimpase · 2020-06-22T16:39:26Z

comment:16

it is not a "real" (time-consuming) fuzzing, it is making the main random seed nondeterministic in tests. making it optional is akin to making tests optional.

we already found a couple of examples which show usefulness of this change in detecting bugs in Sage.

mkoeppe · 2020-06-22T16:40:57Z

comment:17

The option that I propose to add needs to be added in any case, so that when a failure is revealed by a random random seeds, we can reproduce the failure with that seed.

dimpase · 2020-06-22T16:43:35Z

comment:18

it is a good point - and the non-det. seed needs to be logged - if this is not yet in the branch it should get there.

mkoeppe · 2020-06-22T16:46:28Z

comment:19

So I would suggest to use this ticket to introduce the option, not change the default.

dimpase · 2020-06-22T16:51:26Z

comment:20

no, I don't agree, it makes no sense to test less if one can test more, and also not having it default would not force people to make their tests robust.

mkoeppe · 2020-06-22T16:51:45Z

comment:21

By the way I don't think most doctests have the ambition to demonstrate correctness of mathematical claims. In my opinion, if a doctest intends to do that, it should run an explicit loop of several tests. Still deterministically.

mkoeppe · 2020-06-22T16:53:21Z

comment:22

Why would creating the option and changing the default have to be done on the same ticket? I think it's best practices to separate the two steps on two tickets.

kliem · 2020-06-22T16:54:21Z

comment:23

Feel free to do with the branch whatever you think is reasonable.

kliem · 2021-10-13T12:20:37Z

comment:175

Thank you.

orlitzky · 2021-10-13T12:22:35Z

comment:176

No problem, you'll have to rebase this onto the typo-fix too I think.

sagetrac-git · 2021-10-13T12:25:48Z

Changed commit from 5739ef7 to 2c478d1

sagetrac-git · 2021-10-13T12:25:48Z

Branch pushed to git repo; I updated commit sha1 and set ticket back to needs_review. This was a forced push. Last 10 new commits:

`3cfe235`	`implicitly fuzz RNG-dependent doctests`
`1980561`	`reduce vertices for edge disjoint spanning trees`
`e6dd027`	`do not remove fixed doctest`
`22c2b66`	`simplify doctest`
`e32986c`	`fix unstable doctests`
`7b1bd36`	`fixed some doctests for disjoint spanning trees`
`812b555`	`fix unstable doctest in book_stein_ent`
`74e505b`	`edge disjoint spanning tree not as fast as claimed, see #32169`
`44cd7ae`	`fix doctest failure for random matrix`
`2c478d1`	`one more unstable doctest`

kliem · 2021-10-13T12:26:41Z

comment:179

Replying to @orlitzky:

No problem, you'll have to rebase this onto the typo-fix too I think.

I don't know, if this is necessary, but it sure doesn't hurt.

vbraun · 2021-10-13T19:27:44Z

comment:180

Merge conflict

kliem · 2021-10-13T19:39:04Z

comment:181

This is the merge conflict, which permits a trivial solution (at least trivial for a human being):

diff --cc src/bin/sage-runtests
index d1fe6567e7,4fc2062b15..0000000000
--- a/src/bin/sage-runtests
+++ b/src/bin/sage-runtests
@@@ -64,59 -55,60 +55,97 @@@ if __name__ == "__main__"
               'if "external" is listed, will also run tests for available external software; '
               'if "build" is listed, will also run tests specific to Sage\'s build/packaging system; '
               'if set to "all", then all tests will be run')
++<<<<<<< HEAD
 +    parser.add_option("--randorder", type=int, metavar="SEED", help="randomize order of tests")
 +    parser.add_option("--random-seed", dest="random_seed", type=int, metavar="SEED", help="random seed (integer) for fuzzing doctests")
 +    parser.add_option("--global-iterations", "--global_iterations", type=int, default=0, help="repeat the whole testing process this many times")
 +    parser.add_option("--file-iterations", "--file_iterations", type=int, default=0, help="repeat each file this many times, stopping on the first failure")
 +    parser.add_option("--environment", type=str, default="sage.repl.ipython_kernel.all_jupyter", help="name of a module that provides the global environment for tests")
 +
 +    parser.add_option("-i", "--initial", action="store_true", default=False, help="only show the first failure in each file")
 +    parser.add_option("--exitfirst", action="store_true", default=False, help="end the test run immediately after the first failure or unexpected exception")
 +    parser.add_option("--force_lib", "--force-lib", action="store_true", default=False, help="do not import anything from the tested file(s)")
 +    parser.add_option("--abspath", action="store_true", default=False, help="print absolute paths rather than relative paths")
 +    parser.add_option("--verbose", action="store_true", default=False, help="print debugging output during the test")
 +    parser.add_option("-d", "--debug", action="store_true", default=False, help="drop into a python debugger when an unexpected error is raised")
 +    parser.add_option("--only-errors", action="store_true", default=False, help="only output failures, not test successes")
 +
 +    parser.add_option("--gdb", action="store_true", default=False, help="run doctests under the control of gdb")
 +    parser.add_option("--valgrind", "--memcheck", action="store_true", default=False,
 +                      help="run doctests using Valgrind's memcheck tool.  The log "
 +                         "files are named sage-memcheck.PID and can be found in " +
 +                         os.path.join(DOT_SAGE, "valgrind"))
 +    parser.add_option("--massif", action="store_true", default=False,
 +                      help="run doctests using Valgrind's massif tool.  The log "
 +                         "files are named sage-massif.PID and can be found in " +
 +                         os.path.join(DOT_SAGE, "valgrind"))
 +    parser.add_option("--cachegrind", action="store_true", default=False,
 +                      help="run doctests using Valgrind's cachegrind tool.  The log "
 +                         "files are named sage-cachegrind.PID and can be found in " +
 +                         os.path.join(DOT_SAGE, "valgrind"))
 +    parser.add_option("--omega", action="store_true", default=False,
 +                      help="run doctests using Valgrind's omega tool.  The log "
 +                         "files are named sage-omega.PID and can be found in " +
 +                         os.path.join(DOT_SAGE, "valgrind"))
 +
 +    parser.add_option("-f", "--failed", action="store_true", default=False,
++=======
+     parser.add_argument("--randorder", type=int, metavar="SEED", help="randomize order of tests")
+     parser.add_argument("--random-seed", dest="random_seed", type=int, metavar="SEED", help="random seed for fuzzing doctests")
+     parser.add_argument("--global-iterations", "--global_iterations", type=int, default=0, help="repeat the whole testing process this many times")
+     parser.add_argument("--file-iterations", "--file_iterations", type=int, default=0, help="repeat each file this many times, stopping on the first failure")
+     parser.add_argument("--environment", type=str, default="sage.repl.ipython_kernel.all_jupyter", help="name of a module that provides the global environment for tests")
+ 
+     parser.add_argument("-i", "--initial", action="store_true", default=False, help="only show the first failure in each file")
+     parser.add_argument("--exitfirst", action="store_true", default=False, help="end the test run immediately after the first failure or unexpected exception")
+     parser.add_argument("--force_lib", "--force-lib", action="store_true", default=False, help="do not import anything from the test
...

sagetrac-git · 2021-10-13T19:40:57Z

Changed commit from 2c478d1 to 047379c

sagetrac-git · 2021-10-13T19:40:57Z

Branch pushed to git repo; I updated commit sha1. New commits:

`047379c`	`fix merge conflict`

vbraun · 2021-10-19T20:35:13Z

Changed branch from public/29935-reb to 047379c

dimpase added this to the sage-9.2 milestone Jun 22, 2020

dimpase added c: doctest framework labels Jun 22, 2020

This comment has been minimized.

Sign in to view

orlitzky added s: positive review and removed s: needs review labels Oct 13, 2021

sagetrac-git mannequin added s: needs review and removed s: positive review labels Oct 13, 2021

kliem added s: positive review and removed s: needs review labels Oct 13, 2021

vbraun added s: needs work and removed s: positive review labels Oct 13, 2021

kliem added s: positive review and removed s: needs work labels Oct 13, 2021

vbraun removed the s: positive review label Oct 19, 2021

vbraun closed this as completed in c6268d1 Oct 19, 2021

roed314 mentioned this issue Feb 22, 2022

Add seed parameter to GF #33348

Closed

implicitly fuzz RNG-dependent doctests with a random random seed #29935

implicitly fuzz RNG-dependent doctests with a random random seed #29935

Comments

dimpase commented Jun 22, 2020

This comment has been minimized.

kliem commented Jun 22, 2020

kliem commented Jun 22, 2020

kliem commented Jun 22, 2020

kliem commented Jun 22, 2020

dimpase commented Jun 22, 2020

kliem commented Jun 22, 2020

kliem commented Jun 22, 2020

kliem commented Jun 22, 2020

kliem commented Jun 22, 2020

sagetrac-git mannequin commented Jun 22, 2020

sagetrac-git mannequin commented Jun 22, 2020

kliem commented Jun 22, 2020

dimpase commented Jun 22, 2020

kliem commented Jun 22, 2020

kliem commented Jun 22, 2020

mkoeppe commented Jun 22, 2020

kliem commented Jun 22, 2020

dimpase commented Jun 22, 2020

mkoeppe commented Jun 22, 2020

dimpase commented Jun 22, 2020

mkoeppe commented Jun 22, 2020

dimpase commented Jun 22, 2020

mkoeppe commented Jun 22, 2020

mkoeppe commented Jun 22, 2020

kliem commented Jun 22, 2020

kliem commented Oct 13, 2021

orlitzky commented Oct 13, 2021

sagetrac-git mannequin commented Oct 13, 2021

sagetrac-git mannequin commented Oct 13, 2021

kliem commented Oct 13, 2021

vbraun commented Oct 13, 2021

kliem commented Oct 13, 2021

sagetrac-git mannequin commented Oct 13, 2021

sagetrac-git mannequin commented Oct 13, 2021

vbraun commented Oct 19, 2021