mandate tests for additions to MDAnalysis.analysis and introduction of analysis.legacy #743

orbeckst · 2016-02-26T22:15:47Z

tl;dr: Spring cleaning for MDAnalysis.analysis.

With recent discussions about the analysis module (#666, #719) and the feeling that we need to get it up to the same standards as the core, I think an important step is better tests and identifying to the user code that is not well maintained (it's sad having to admit that we have under-maintained code but this is a fact in open source and we simply have a responsibility towards our users).

Mandatory tests for new analysis code: P1
MDAnalysis.analysis.legacy module for unmaintained code: P2

What say ye, @MDAnalysis/coredevs ?

(P1) Mandatory tests for new analysis code

I propose to require new code for MDAnalysis.analysis to come with unit tests. Currently, the Style Guide only encourages unit tests.

Tests for analysis classes and functions should at a minimum perform regression tests, i.e., run on input and compare to values generated when when the code was added so that we know when the output changes in the future. (Even better are tests that test for absolute correctness of results but regression tests are the minimum requirement.)

If we adopt (P1), the Style Guide will be changed to reflect this policy and new analysis code will not be merged until tests are provided and pass.

(P2) `MDAnalysis.analysis.legacy` for unmaintained code

I am also proposing that any code in MDAnalysis.analysis that does not have substantial testing (at least 70% coverage... we can debate this number!) will be moved to a special MDAnalysis.analysis.legacy module that will come with its own warning that this is essentially unmaintained functionality that is still provided because there's no alternative. Legacy packages that receive sufficient upgrades can come back to the normal MDAnalysis.analysis name space.

If we adopt P2 then modules can be moved to legacy as soon as we decide on its creation but until release 1.0 we leave stubs in place so that no old code breaks (and issue deprecation warnings for the import). Once 1.0 is released, the stubs will be removed and the modules will only be available from legacy.

History

2016-02-26 initial proposal
2016-03-12 added @richardjgowers 's suggestion to require regression tests at a minimum
2016-03-24 P1 and P2 accepted

The text was updated successfully, but these errors were encountered:

richardjgowers · 2016-02-27T11:32:32Z

I like this, with the twist that we should just require a regression test on each function/method.

I can't remember who said this (@dotsdl probably?), but we could(?) move analysis/vis into their own repo, but still package it all together as one package (so pip install MDAnalysis gets you everything). Then PRs like #708 #735 could go to the analysis repo, with this repo being the "main" stuff (something like a numpy/scipy split). Then we could monitor coverage of analysis+vis separately to core.

tylerjereddy · 2016-03-09T15:57:56Z

Sounds sensible. I guess my MDAnalysis.visualization stuff won't be subject
to this just yet though. I think a bit of creativity would be needed to get
sensible tests in there.

On 27 February 2016 at 11:32, Richard Gowers [email protected]
wrote:

I like this, with the twist that we should just require a regression test
on each function/method.

I can't remember who said this (@dotsdl https://github.com/dotsdl
probably?), but we could(?) move analysis/vis into their own repo, but
still package it all together as one package (so pip install MDAnalysis
gets you everything). Then PRs like #708
#708 #735
#735 could go to the
analysis repo, with this repo being the "main" stuff (something like a
numpy/scipy split). Then we could monitor coverage of analys is+vis
separately to core.

—
Reply to this email directly or view it on GitHub
#743 (comment)
.

richardjgowers · 2016-03-09T16:00:02Z

It might be possible just to assert that a matplotlib object gets returned
for analysis.... So at least the code didn't explode. Beyond that it's
tricky to inspect images.

On Wed, 9 Mar 2016 15:57 Tyler Reddy, [email protected] wrote:

Sounds sensible. I guess my MDAnalysis.visualization stuff won't be subject
to this just yet though. I think a bit of creativity would be needed to get
sensible tests in there.

On 27 February 2016 at 11:32, Richard Gowers [email protected]
wrote:

I like this, with the twist that we should just require a regression test
on each function/method.

I can't remember who said this (@dotsdl https://github.com/dotsdl
probably?), but we could(?) move analysis/vis into their own repo, but
still package it all together as one package (so pip install MDAnalysis
gets you everything). Then PRs like #708
#708 #735
#735 could go to the
analysis repo, with this repo being the "main" stuff (something like a
numpy/scipy split). Then we could monitor coverage of analys is+vis
separately to core.

—
Reply to this email directly or view it on GitHub
<
#743 (comment)

.

—
Reply to this email directly or view it on GitHub
#743 (comment)
.

kain88-de · 2016-03-09T16:12:07Z

prettyplotlib compares to some baseline images to test the plotting routines. But other then that I'm also not sure what to do to test images.

tylerjereddy · 2016-03-09T17:02:34Z

That's true. Although the streamline code does do some mathematics before producing the image so I could at least provide tests for that, or the data before it is plotted. I think any tests would likely just prevent regression / changing of behaviour rather than proving that the code is correct, but that is likely better than the current state (no tests). I'll make a note of this...

dotsdl · 2016-03-09T17:14:44Z

Since matplotlib figures are just python objects (they're only made into images when you do Figure.savefig or use one of the backends to display them), one could just inspect the Figure and its Axes objects to make sure everything looks right. Everything is in the Figure object, basically.

orbeckst · 2016-03-12T19:45:41Z

Added @richardjgowers 's suggestion to require regression tests (at a minimum).

orbeckst · 2016-03-12T19:52:42Z

Quick comments on the discussion:

Maybe we open a separate proposal for MDAnalysis.visualization --- I skirted this issue here and focused on MDAnalysis.analysis.
Let's discuss the feasibility of splitting the library in a different issue. Right now this sounds like opening up yet another big construction site... let's get Move from Atom based topology to array based #363 done before starting anything else of similar magnitude.

Do we have any other voices? Or can we consider this proposal, consisting of P1 and P2, consider accepted? (GitHub does not have proper voting but you can use the thumbs up/down emoticons on the top comment.)

kain88-de · 2016-03-14T14:00:33Z

I'm for both solutions. @orbeckst should we just count the like to your post that people agree with P1 and P2

richardjgowers · 2016-03-14T16:00:48Z

Rather than play around with a legacy submodule and stubs, couldn't we just write the regression tests for each? It'd probably be a similar amount of work either way. If anything isn't well documented enough to write tests for, get rid of it anyway.

orbeckst · 2016-03-15T22:45:00Z

Having regression tests for everything is clearly the preferred solution. However, I hate to throw away code because at some point someone put quite a bit of thought into it and it can still be useful.

In any case, it doesn't really look too daunting:

Analysis modules with no associated tests

gnm
nuclinfo
x3dna

Analysis modules with incomplete testing

hole
contacts

... but we can simply look at coverage to figure out where we need more tests.

Modules with difficult dependencies

The following modules require 3rd party code, sometimes not even free:

hole (hole)
x3dna (x3dna)
align (clustalw)

orbeckst · 2016-03-24T22:51:54Z

I consider P1 and P2 accepted.

I won't set up MDAnalysis.analysis.legacy right away, maybe we get everything tested, especially since @richardjgowers has already made valiant efforts in this direction. Any other steps in a similar direction very welcome.

The next steps here are to

document (wiki)
communicate on developer list

orbeckst · 2016-03-26T00:24:57Z

Updated the wiki page Style Guide: Tests and sent email to the developer list tests are now required for all of MDAnalysis.analysis.

- created new MDAnalysis.analysis.legacy package - directory - docs - x3dna is now considered legacy code (no/minimal testing, no maintenance) - see #906 for reasons (mainly because the x3dna license does not allow us to just install it for testing on travis, or rather, licensing and access to x3dna is unknown/too complicated) - closes #906 - see #743 on background for legacy code and also https://github.com/MDAnalysis/mdanalysis/wiki/Style-Guide#tests-for-mdanalysisanalysis - added stub with deprecation warning: until release 1.0, MDAnalysis.analysis.x3dna is still accessible - added test case that the stub is there

- legacy module (#743) - x3dna is now legacy code (#906) - updated upcoming release number to 0.16.0 [skip ci]

* moved analysis.x3dna to analysis.legacy.x3dna - created new MDAnalysis.analysis.legacy package - directory - docs - x3dna is now considered legacy code (no/minimal testing, no maintenance) - see #906 for reasons (mainly because the x3dna license does not allow us to just install it for testing on travis, or rather, licensing and access to x3dna is unknown/too complicated) - closes #906 - see #743 on background for legacy code and also https://github.com/MDAnalysis/mdanalysis/wiki/Style-Guide#tests-for-mdanalysisanalysis - added stub with deprecation warning: until release 1.0, MDAnalysis.analysis.x3dna is still accessible - added test case that the stub is there * updated CHANGELOG - legacy module (#743) - x3dna is now legacy code (#906) - updated upcoming release number to 0.16.0 [skip ci] * Removed legacy submodule from coverage

* moved analysis.x3dna to analysis.legacy.x3dna - created new MDAnalysis.analysis.legacy package - directory - docs - x3dna is now considered legacy code (no/minimal testing, no maintenance) - see MDAnalysis#906 for reasons (mainly because the x3dna license does not allow us to just install it for testing on travis, or rather, licensing and access to x3dna is unknown/too complicated) - closes MDAnalysis#906 - see MDAnalysis#743 on background for legacy code and also https://github.com/MDAnalysis/mdanalysis/wiki/Style-Guide#tests-for-mdanalysisanalysis - added stub with deprecation warning: until release 1.0, MDAnalysis.analysis.x3dna is still accessible - added test case that the stub is there * updated CHANGELOG - legacy module (MDAnalysis#743) - x3dna is now legacy code (MDAnalysis#906) - updated upcoming release number to 0.16.0 [skip ci] * Removed legacy submodule from coverage

…Analysis#743)

orbeckst added maintainability testing Component-Analysis proposal labels Feb 26, 2016

orbeckst added this to the 1.0 milestone Feb 26, 2016

orbeckst self-assigned this Mar 12, 2016

richardjgowers mentioned this issue Mar 20, 2016

Write regression tests for nuclinfo #790

Closed

orbeckst added Component-Docs policy and removed proposal labels Mar 24, 2016

orbeckst mentioned this issue Mar 26, 2016

formalize proposal/decision making process #802

Closed

3 tasks

orbeckst closed this as completed Mar 26, 2016

Endle mentioned this issue Mar 26, 2016

Started tests for gnm. #803

Merged

4 tasks

jbarnoud mentioned this issue Mar 27, 2016

Coverage is not reported for the analyses #804

Closed

This was referenced Jul 20, 2016

install external code on travis for complete analysis testing? #898

Closed

move analysis.x3dna to new analysis.legacy module #906

Closed

orbeckst mentioned this issue Sep 15, 2016

moved analysis.x3dna to analysis.legacy.x3dna #987

Merged

4 tasks

orbeckst added a commit that referenced this issue Sep 15, 2016

updated CHANGELOG

5106992

- legacy module (#743) - x3dna is now legacy code (#906) - updated upcoming release number to 0.16.0 [skip ci]

lohani2280 pushed a commit to lohani2280/mdanalysis that referenced this issue Feb 17, 2017

implemented new requirements for MDAnalysis.analysis tests (closes MD…

2ccef54

…Analysis#743)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mandate tests for additions to MDAnalysis.analysis and introduction of analysis.legacy #743

mandate tests for additions to MDAnalysis.analysis and introduction of analysis.legacy #743

orbeckst commented Feb 26, 2016

richardjgowers commented Feb 27, 2016

tylerjereddy commented Mar 9, 2016

richardjgowers commented Mar 9, 2016

kain88-de commented Mar 9, 2016

tylerjereddy commented Mar 9, 2016

dotsdl commented Mar 9, 2016

orbeckst commented Mar 12, 2016

orbeckst commented Mar 12, 2016

kain88-de commented Mar 14, 2016

richardjgowers commented Mar 14, 2016

orbeckst commented Mar 15, 2016

orbeckst commented Mar 24, 2016

orbeckst commented Mar 26, 2016

mandate tests for additions to MDAnalysis.analysis and introduction of analysis.legacy #743

mandate tests for additions to MDAnalysis.analysis and introduction of analysis.legacy #743

Comments

orbeckst commented Feb 26, 2016

(P1) Mandatory tests for new analysis code

(P2) MDAnalysis.analysis.legacy for unmaintained code

History

richardjgowers commented Feb 27, 2016

tylerjereddy commented Mar 9, 2016

richardjgowers commented Mar 9, 2016

kain88-de commented Mar 9, 2016

tylerjereddy commented Mar 9, 2016

dotsdl commented Mar 9, 2016

orbeckst commented Mar 12, 2016

orbeckst commented Mar 12, 2016

kain88-de commented Mar 14, 2016

richardjgowers commented Mar 14, 2016

orbeckst commented Mar 15, 2016

Analysis modules with no associated tests

Analysis modules with incomplete testing

Modules with difficult dependencies

orbeckst commented Mar 24, 2016

orbeckst commented Mar 26, 2016

(P2) `MDAnalysis.analysis.legacy` for unmaintained code