Split up tests? #5409

max-sixty · 2021-05-31T21:07:53Z

Currently a large share of our tests are in test_dataset.py and test_dataarray.py — each of which are around 7k lines.

There's a case for splitting these up:

Many of the tests are somewhat duplicated between the files (and test_variable.py in some cases) — i.e. we're running the same test over a Dataset & DataArray, but putting them far away from each other in separate files. Should we instead have them split by "function"; e.g. test_rolling.py for all rolling tests?
My editor takes 5-20 seconds to run the linter and save the file. This is a very narrow complaint.
Now that we're all onto pytest, there's no need to have them in the same class.

If we do this, we could start on the margin — new tests around some specific functionality — e.g. join / rolling / reindex / stack (just a few from browsing through) — could go into a new respective test_{}.py file. Rather than some big copy and paste commit.

The text was updated successfully, but these errors were encountered:

keewis · 2021-05-31T22:21:47Z

that sounds like a good idea, and we'll definitely have to do this gradually because PRs with several thousand lines of changes are really difficult to review.

Probably unrelated, but I'd also like to refactor / reorder the tests in test_combine.py.

shoyer · 2021-06-02T03:28:34Z

Yes, splitting up these giant test files would definitely be a good idea. test_backends.py might be another good candidate.

I don't think there was ever a good reason why they needed be in a single class. unittest works just fine with multiple test classes, too.

max-sixty · 2021-06-02T17:47:21Z

Great, it seems there's some backing for this. What's the best way to start?

I'm happy to do a PR to pull all the rolling tests into test_rolling.py as an example to start. I'm also happy to say any new rolling tests need to go into test_rolling.py, to minimize movement, as suggested above. But maybe that's hard to manage, and it's not obvious that's the policy for new contributors.

TomNicholas · 2021-06-04T16:16:30Z

This is kind of a general question whenever we add new features: if I add to the API such that I can access my new computation newfunc via any of da.newfunc(), ds.newfunc() or xr.newfunc(), then should the tests live in test_dataarray.py, test_dataset.py, test_computation.py, all three of them, or make a new file test_newfunc.py? I agree we should move towards the latter, and try to just make test_dataarray.py & test_dataset.py test the core functionality of those objects, and not their extensive method APIs.

xref pydata#5409

dcherian added the topic-testing label Jun 16, 2021

This was referenced Jun 16, 2021

Refactor out coarsen tests #5474

Merged

Refactor dataset groupby tests #5506

Merged

dcherian added a commit to dcherian/xarray that referenced this issue Aug 14, 2021

Refactor more groupby and resample tests

5ff1c49

xref pydata#5409

dcherian mentioned this issue Aug 14, 2021

Refactor more groupby and resample tests #5707

Merged

dcherian mentioned this issue Jul 12, 2022

Move Rolling tests to their own testing module #6777

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split up tests? #5409

Split up tests? #5409

max-sixty commented May 31, 2021 •

edited

Loading

keewis commented May 31, 2021

shoyer commented Jun 2, 2021

max-sixty commented Jun 2, 2021

TomNicholas commented Jun 4, 2021 •

edited

Loading

Split up tests? #5409

Split up tests? #5409

Comments

max-sixty commented May 31, 2021 • edited Loading

keewis commented May 31, 2021

shoyer commented Jun 2, 2021

max-sixty commented Jun 2, 2021

TomNicholas commented Jun 4, 2021 • edited Loading

max-sixty commented May 31, 2021 •

edited

Loading

TomNicholas commented Jun 4, 2021 •

edited

Loading