MST Test Suite integration #42

DavidBuchanan314 · 2024-12-11T18:13:30Z

The tests take ~30 seconds to run on my M1 pro macbook (with failing asserts commented out) - not too bad, but I anticipate adding a lot more tests to the suite.

Due to the slowness, I deliberately named the test file such that python -m unittest discover won't find it. It can be manually invoked via python -m unittest arroba.tests.mst_test_suite.

The test suite itself is still in a state of flux, hence this is a draft PR for now.

Currently, it only tests basic MST diffing (detection of created, updated, deleted records, and new and deleted MST blocks)

Unfortunately the tests currently don't pass - Diff.new_cids is flakey (which you note in one of your existing diff tests). Everything else seems good though! (due to the symmetry of diffing, you could in theory compute the reverse-diff and take Diff.removed_cids to get a correct result - but presumably you want to fix it properly heh)

DavidBuchanan314 · 2024-12-11T18:14:22Z

Note to self, I should probably add something to the README about how to init the submodule

DavidBuchanan314 · 2024-12-11T18:18:01Z

arroba/tests/mst_test_suite.py

+            return car_header["roots"][0]
+
+    def test_diffs(self):
+        for testname, testcase in tqdm(self.diff_testcases.items()):


oh, and I used tqdm for a progress bar - not super necessary. I should either take it out or add it as an optional dependency

DavidBuchanan314 · 2024-12-11T19:52:10Z

arroba/tests/mst_test_suite.py

+            reference_cid_set = set(x[0] for x in reference_blocks) # just look at the cids from the car
+
+            self.assertEqual(root_b, reference_root, f"{testname} inverse: new root") # fails occasionally
+            self.assertEqual(diff.new_cids, reference_cid_set, f"{testname} inverse: new cid set") # basically always fails, I think I'm doing something wrong


the gist of this test case is: take mst_a, apply the list of ops to it, and compare the result to mst_b. The list of CIDs I'm getting from null_diff(mst).new_cids looks completely different to what I expect, so I suspect I'm doing something wrong here (or maybe null_diff() is just very broken heh)

Hah. Both are possible I guess! arroba's diff code is used in production, but definitely not as broadly as it could be, and not all parts, so big bugs are very possible.

snarfed

This is awesome! Really psyched for this test suite. Thank you so much for putting it together, and for this PR! Especially appreciate you seeing and using arroba-idiomatic parts like testutil.TestCase.

Minor nit, you'll probably want to use with self.subTest(...) for the individual test cases to get unittest to see and run and count them individually. Alternatively you can generate them programmatically, eg https://github.com/snarfed/granary/blob/c29dd723f8b82ca5874f2122de64b2e9cd2c1fd7/granary/tests/test_testdata.py#L144-L156 , but that's more complicated and less idiomatic.

I'll also mention making the test suite pip-installable just for the record here, even just from GitHub if not pypi. Submodule is ok, pip package is definitely nicer.

Regardless, this is exciting!

snarfed · 2024-12-11T19:55:35Z

Oh and I like separating it out from unittest discover, but I'd happily run it in CI, 30s is totally fine. Example past runs: https://app.circleci.com/pipelines/github/snarfed/arroba

DavidBuchanan314 · 2024-12-11T20:10:31Z

ah yes, subTest is what I was looking for. Thanks for the feedback!

Current stats:

Ran 2 tests in 79.933s

FAILED (failures=30596)

(some of these are plausibly bugs in the test cases themselves, since I generated them with my own code which has low test coverage itself!)

snarfed · 2024-12-12T16:50:42Z

Thanks! I'm happy to merge this whenever. Lots more to do, sure, but it's contained and harmless and a great start as is. Up to you though!

snarfed · 2024-12-12T16:50:48Z

Discord thread: https://discord.com/channels/1097580399187738645/1316355962633982015

DavidBuchanan314 · 2024-12-12T18:13:42Z

Cool, feel free to merge it now. When I add new tests it should just be a case of updating the submodule, and if I add new test types I can PR those in too.

I'll think about making it a pip-installable package in the future, but submodule definitely works for now.

I just bumped the submodule version, no changes to the tests themselves but it includes my visualisation script.

snarfed · 2024-12-12T19:10:48Z

Thanks again, this is so great!

snarfed · 2024-12-12T19:18:57Z

[cracks knuckles] time to get to work

mst-test-suite integration

51fb0f6

DavidBuchanan314 commented Dec 11, 2024

View reviewed changes

add 'inverse' diff test case

8f724e5

DavidBuchanan314 commented Dec 11, 2024

View reviewed changes

snarfed reviewed Dec 11, 2024

View reviewed changes

drop tqdm, use self.subTest()

5dd35e6

bump submodule

1ab91f6

DavidBuchanan314 marked this pull request as ready for review December 12, 2024 18:13

snarfed merged commit 507956c into snarfed:main Dec 12, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MST Test Suite integration #42

MST Test Suite integration #42

DavidBuchanan314 commented Dec 11, 2024 •

edited

Loading

DavidBuchanan314 commented Dec 11, 2024

DavidBuchanan314 Dec 11, 2024

DavidBuchanan314 Dec 11, 2024

snarfed Dec 11, 2024

snarfed left a comment

snarfed commented Dec 11, 2024

DavidBuchanan314 commented Dec 11, 2024

snarfed commented Dec 12, 2024

snarfed commented Dec 12, 2024

DavidBuchanan314 commented Dec 12, 2024

snarfed commented Dec 12, 2024

snarfed commented Dec 12, 2024

MST Test Suite integration #42

MST Test Suite integration #42

Conversation

DavidBuchanan314 commented Dec 11, 2024 • edited Loading

DavidBuchanan314 commented Dec 11, 2024

DavidBuchanan314 Dec 11, 2024

Choose a reason for hiding this comment

DavidBuchanan314 Dec 11, 2024

Choose a reason for hiding this comment

snarfed Dec 11, 2024

Choose a reason for hiding this comment

snarfed left a comment

Choose a reason for hiding this comment

snarfed commented Dec 11, 2024

DavidBuchanan314 commented Dec 11, 2024

snarfed commented Dec 12, 2024

snarfed commented Dec 12, 2024

DavidBuchanan314 commented Dec 12, 2024

snarfed commented Dec 12, 2024

snarfed commented Dec 12, 2024

DavidBuchanan314 commented Dec 11, 2024 •

edited

Loading