Fix pytest backend #151

StefRe · 2020-04-27T16:31:08Z

Refactor pytest backend

Instead of writing events for each phase (setup, call, teardown) of a
test (and eventually displaying them in the datatree widget) we now
accumulate results over the individual test phases and only then write
the final result of the entire test upon test completion using the
pytest_runtest_logfinish hook.

This ensures the following:

Capturing of output from the teardown phase even if it passes
without an error. This fixes Teardown function's logs not captured #127 .
Differentiating between the outcome of the test itself (the call
phase) and the entire test as a whole. An error during teardown,
for instance, used to overwrite the (potentially passed) test result
of the call phase. Now we get a passed test with an error during
teardown. So in the datatree widget, the test status (first column)
is no longer 1:1 linked with the color of this line (see also 3)).
Better handling of xfailed and xpassed tests which now show the
correct status xfailed or xpassed and are colored green and
red respectively (instead of status skipped and gray color for
both of them). This fixes pytest statuses "expected-fail" and "unexpectedly passing" not yet reflected in Category #47.
Better messages: the first message, which is usually the most
important one, will be placed in the message column of the datatree
widget. Further messages will be appended to the extra_text field
(instead of overwriting messages from previous test phases). If the
first message spans over multiple lines then only its first line
will be displayed in the message column and the complete message
will be repeated in the extra_text field for better readability.
This improves the visual impression of the datatree widget so that
each (collapsed) row is exactly one line high.

Instead of writing events for each phase (setup, call, teardown) of a test (and eventually displaying them in the datatree widget) we now accumulate results over the individual test phases and only then write the final result of the entire test upon test completion using the pytest_runtest_logfinish hook. This ensures the following: 1) Capturing of output from the teardown phase even if it passes without an error. This fixes spyder-ide#127. 2) Differentiating between the outcome of the test itself (the call phase) and the entire test as a whole. An error during teardown, for instance, used to overwrite the (potentially passed) test result of the call phase. Now we get a passed test with an error during teardown. So in the datatree widget, the test status (first column) is no longer 1:1 linked with the color of this line (see also 3)). 3) Better handling of xfailed and xpassed tests which now show the correct status `xfailed` or `xpassed` and are colored green and red respectively (instead of status `skipped` and gray color for both of them). This fixes spyder-ide#47. 4) Better messages: the first message, which is usually the most important one, will be placed in the message column of the datatree widget. Further messages will be _appended_ to the extra_text field (instead of overwriting messages from previous test phases). If the first message spans over multiple lines then only its first line will be displayed in the message column and the complete message will be repeated in the extra_text field for better readability. This improves the visual impression of the datatree widget so that each (collapsed) row is exactly one line high.

StefRe · 2020-04-27T16:41:11Z

Below is a comparison of the output before (left side) and after (right side) this PR when running pytest on the following code (click on image to enlarge):

import pytest

@pytest.fixture(scope="function",
                params=['none', 'setup', 'call'])
def resource(request):
    param = request.param
    print('setup')
    if 'setup' in param:
        raise Exception('setup')
    yield param
    print('teardown')
    if 'teardown' in param:
        raise Exception('teardown')


def test_with_error_in(resource):
    print('call')
    assert 'call' not in resource


@pytest.mark.xfail(reason='this is expected to fail')
class TestNoErrorXfail:
    def setup_method(self):
        print('setup')

    def teardown_method(self):
        print('teardown')

    def test_xfail(self):
        print('call')
        assert False

    def test_xpass(self):
        print('call')


@pytest.mark.skip(reason="we don't want to test this now")
def test_skip(self):
    print('call')

The following numbers (in the picture blue in cirles) show the improvements:
(1) teardown output is captured
(2) there's a linebreak between error messages and captured output
(3) only 1st line of multiline messages is show in row, rest in extra_text
(4) correct status, message and color for xfailed tests
(5) correct status, message and color for xpassed tests
(6) message for skipped text shown in message column (instead of extra_text)

StefRe · 2020-04-27T20:02:00Z

I forgot to mention in the commit message that I deliberately meant 'wasxfail' to be a non-translatable string (i.e. I didn't forget to enclose it in _(...)) in the following code snippet from pytest_runtest_logreport. It's just a placeholder for the unlikely case that no reason was given in the @pytest.mark.xfail decorator. I think that without any message it would be unclear why the test got the status xfailed or xpassed then.

            self.longrepr.append(
                report.wasxfail if report.wasxfail else 'wasxfail')

jitseniesen · 2020-04-28T17:39:27Z

Thanks, this looks good.

I am wondering whether the output may be confusing if errors occur during setup or teardown. If you run pytest from the command line, the output has lines like "error at teardown of ". It is not as clearly indicated here. The user should know that a status of --- means that an error at setup, and a status of passed with a red line means an error at teardown, which is rather obscure. Or they should look at the details in the backtrace and see that the test function itself does not appear there.

What do you think about that? Perhaps out of scope for this PR (as the PR does not make it worse)?

the unlikely case that no reason was given in the @pytest.mark.xfail decorator

People are lazy. The spyder repository has xfail decorators without reason. I don't know whether the string "wasxfail" is better than an empty string, but let's leave it as you wrote it.

StefRe · 2020-04-29T16:57:24Z

If you run pytest from the command line, the output has lines like "error at teardown of ". It is not as clearly indicated here.

Yes, maybe it's indeed better to prepend some reference to the message field as to where the error occurred in this case, like so:

May only concern is that the backend usually shouldn't contain any UI strings (like _('ERROR at') in this case), but maybe we can make an exception here. (*) If so we could also change the 'wasxfail' to a more verbose _('WAS EXPECTED TO FAIL') (capitals to match the other message). I double-checked and confirmed that wasxfail is the only case where pytest doesn't supply any message whatsoever if none is given in the code.

What do you think about that?

EDIT: (*) I just saw that noserunner also uses translatable strings, so it should be OK.

jitseniesen · 2020-04-29T18:50:29Z

My only concern is that the backend usually shouldn't contain any UI strings

Yes, I do not particularly like it, but I'm happy to ignore it for the moment. If you want to add the "ERROR at setup/teardown" as you suggest, please go ahead. Otherwise, I'll merge it as is. The only part of the PR that I have questions about is the part that you copied from what I wrote myself (from what I remember, one problem is that the interface of pytest is not well documented).

StefRe · 2020-04-29T18:57:25Z

If you want to add the "ERROR at setup/teardown" as you suggest, please go ahead.

I'll add it, but maybe it'll be tomorrow only.

StefRe · 2020-04-29T19:11:31Z

BTW, is it OK to use f-strings? CI for the plugin tests with python 3.6, 3.7 and 3.8 only, so it should be OK. Spyder itself, however, seems to support 2.7 too, as per README.md.

goanpeca · 2020-04-29T19:14:22Z

@StefRe for the time being it is best to not use f-strings (yet).

If an error occurs in the setup phase of a test, then the test itself (call phase) will not be executed and the status remains '---'. If an error occurs in the teardown phase after a passed test, then the status will be 'passed'. In both cases, the test category will be FAIL (red). In order to make this more easily comprehensible we prepend 'ERROR at setup: ' or 'ERROR at teardown: ' to the message in these cases.

if no reason was given in the test. Pytest gives no message whatsoever in this case (as opposed to skipped test without reason message). As we color xfailed tests green and xpassed tests red it is necessary to give some explanatory message as to why the test has the status xfailed or xpassed to avoid confusion about the color.

codecov-io · 2020-04-29T20:44:12Z

Codecov Report

Merging #151 into master will increase coverage by 0.03%.
The diff coverage is 100.00%.

Impacted Files	Coverage Δ
spyder_unittest/backend/pytestrunner.py	`97.75% <100.00%> (-0.15%)`	⬇️
spyder_unittest/backend/pytestworker.py	`94.73% <100.00%> (+4.73%)`	⬆️
spyder_unittest/widgets/datatree.py	`95.02% <0.00%> (-0.83%)`	⬇️

jitseniesen · 2020-04-30T09:03:40Z

Thanks very much!

StefRe added 2 commits April 27, 2020 16:24

Tests for refactoring pytest backend

88df15b

StefRe mentioned this pull request Apr 27, 2020

Fix test_pytestrunner_start #150

Merged

StefRe added 2 commits April 29, 2020 22:29

jitseniesen added type:bug v0.4.1 labels Apr 30, 2020

jitseniesen merged commit ff9999e into spyder-ide:master Apr 30, 2020

StefRe deleted the fix/pytest branch April 30, 2020 13:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix pytest backend #151

Fix pytest backend #151

StefRe commented Apr 27, 2020

StefRe commented Apr 27, 2020

StefRe commented Apr 27, 2020

jitseniesen commented Apr 28, 2020

StefRe commented Apr 29, 2020 •

edited

Loading

jitseniesen commented Apr 29, 2020

StefRe commented Apr 29, 2020

StefRe commented Apr 29, 2020

goanpeca commented Apr 29, 2020

codecov-io commented Apr 29, 2020

jitseniesen commented Apr 30, 2020

Fix pytest backend #151

Fix pytest backend #151

Conversation

StefRe commented Apr 27, 2020

StefRe commented Apr 27, 2020

StefRe commented Apr 27, 2020

jitseniesen commented Apr 28, 2020

StefRe commented Apr 29, 2020 • edited Loading

jitseniesen commented Apr 29, 2020

StefRe commented Apr 29, 2020

StefRe commented Apr 29, 2020

goanpeca commented Apr 29, 2020

codecov-io commented Apr 29, 2020

Codecov Report

jitseniesen commented Apr 30, 2020

StefRe commented Apr 29, 2020 •

edited

Loading