🏁 Windows Support #266

burgholzer · 2024-08-26T22:42:27Z

Description

This PR adds continuous testing and deployment for Windows.
To this end, it switches to the reusable MQT workflows for CI and CD.
Along the way, it fixes a couple of errors that only revealed themselves on Windows.

Checklist:

The pull request only contains commits that are related to it.
I have added appropriate tests and documentation.
I have made sure that all CI jobs on GitHub pass.
The pull request introduces no new warnings and follows the project's style guidelines.

github-advanced-security · 2024-08-26T22:45:00Z

This pull request sets up GitHub code scanning for this repository. Once the scans have completed and the checks have passed, the analysis results for this pull request branch will appear on this overview. Once you merge this pull request, the 'Security' tab will show more code scanning analysis results (for example, for the default branch). Depending on your configuration and choice of analysis tool, future pull requests will be annotated with code scanning analysis results. For more information about GitHub code scanning, check out the documentation.

codecov · 2024-08-26T22:46:26Z

Codecov Report

Attention: Patch coverage is 65.67164% with 23 lines in your changes missing coverage. Please review.

Project coverage is 83.6%. Comparing base (361c4ae) to head (ceea10c).
Report is 80 commits behind head on main.

Files with missing lines	Patch %	Lines
src/UFDecoder.cpp	45.9%	20 Missing ⚠️
include/Code.hpp	81.2%	3 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##            main    #266     +/-   ##
=======================================
- Coverage   83.6%   83.6%   -0.1%     
=======================================
  Files         49      49             
  Lines       4162    4159      -3     
  Branches     372     372             
=======================================
- Hits        3481    3477      -4     
- Misses       681     682      +1

Flag	Coverage Δ
cpp	`83.6% <56.6%> (-0.3%)`	⬇️
python	`83.5% <100.0%> (+0.1%)`	⬆️

Files with missing lines	Coverage Δ
...mation_decoding/simulators/memory_experiment_v2.py	`89.6% <ø> (+5.1%)`	⬆️
...mation_decoding/simulators/quasi_single_shot_v2.py	`69.9% <100.0%> (ø)`
...alog_information_decoding/simulators/simulation.py	`82.7% <100.0%> (ø)`
src/mqt/qecc/cc_decoder/decoder.py	`98.2% <100.0%> (ø)`
include/Code.hpp	`79.5% <81.2%> (-0.4%)`	⬇️
src/UFDecoder.cpp	`61.2% <45.9%> (-2.6%)`	⬇️

burgholzer · 2024-09-02T14:51:14Z

Alright. This is looking much better now already.

@pehamTom I just removed the timeouts from the state prep tests and that seems to have done it. Although some of these tests (sometimes) seem to take forever. Do you see any kind of way to reduce the tests or to speed them up?
Some of the (simulation) tests also sporadically fail because a certain threshold is not reached. Could you maybe fix the random seed for these kind of tests so that the results are reproducible and less flaky?

@lucasberent could you please look at the change here: b58b604
This was necessary to fix one of the tests on Windows. However, the test now fails on Ubuntu and macOS. Do you see any kind of reason for that.
I was kind of suspicious that the following to lines are not the same despite the same comment:

mqt-qecc/test/python/analog_info/test_memory_experiment.py

Line 111 in 63defae

assert np.array_equal(res[0], np.array([0, 0, 0])) # estimate is all zeros
mqt-qecc/test/python/analog_info/test_memory_experiment.py

Line 137 in 63defae

assert np.array_equal(res[0], np.array([1, 0, 0])) # estimate is all zeros

pehamTom · 2024-09-02T14:58:16Z

@pehamTom I just removed the timeouts from the state prep tests and that seems to have done it. Although some of these tests (sometimes) seem to take forever. Do you see any kind of way to reduce the tests or to speed them up? Some of the (simulation) tests also sporadically fail because a certain threshold is not reached. Could you maybe fix the random seed for these kind of tests so that the results are reproducible and less flaky?

Well, the timeout is there, so the SAT solver doesn't spend a long time on unsatisfiable instances. I can try to trim down the tests further. But again, there are not many small code instances one can meaningfully test.

burgholzer · 2024-09-02T19:31:20Z

@pehamTom I just removed the timeouts from the state prep tests and that seems to have done it. Although some of these tests (sometimes) seem to take forever. Do you see any kind of way to reduce the tests or to speed them up? Some of the (simulation) tests also sporadically fail because a certain threshold is not reached. Could you maybe fix the random seed for these kind of tests so that the results are reproducible and less flaky?

Well, the timeout is there, so the SAT solver doesn't spend a long time on unsatisfiable instances. I can try to trim down the tests further. But again, there are not many small code instances one can meaningfully test.

At least on Windows that has led to basically none of the synthesis tasks returning a circuit (see the CI logs from a couple commits ago).
It's odd, but not completely unexpected that performance on windows is so different to the Unix systems.
Maybe relaxing the timeouts a little bit instead of removing them would be even better.
Just seeking a pragmatic solution here.
The current CI run shows that it works in principle given a long enough timeout.

lucasberent · 2024-09-03T08:09:49Z

@lucasberent could you please look at the change here: b58b604 This was necessary to fix one of the tests on Windows. However, the test now fails on Ubuntu and macOS. Do you see any kind of reason for that. I was kind of suspicious that the following to lines are not the same despite the same comment:

mqt-qecc/test/python/analog_info/test_memory_experiment.py

Line 111 in 63defae

assert np.array_equal(res[0], np.array([0, 0, 0])) # estimate is all zeros

mqt-qecc/test/python/analog_info/test_memory_experiment.py

Line 137 in 63defae

assert np.array_equal(res[0], np.array([1, 0, 0])) # estimate is all zeros

the test should be correct as is, the comment is wrong the estimate should be [1,0,0]..

burgholzer · 2024-09-03T08:11:09Z

Alright. This is looking much better now already.
@pehamTom I just removed the timeouts from the state prep tests and that seems to have done it. Although some of these tests (sometimes) seem to take forever. Do you see any kind of way to reduce the tests or to speed them up? Some of the (simulation) tests also sporadically fail because a certain threshold is not reached. Could you maybe fix the random seed for these kind of tests so that the results are reproducible and less flaky?
@lucasberent could you please look at the change here: b58b604 This was necessary to fix one of the tests on Windows. However, the test now fails on Ubuntu and macOS. Do you see any kind of reason for that. I was kind of suspicious that the following to lines are not the same despite the same comment:

mqt-qecc/test/python/analog_info/test_memory_experiment.py

Line 111 in 63defae

assert np.array_equal(res[0], np.array([0, 0, 0])) # estimate is all zeros

mqt-qecc/test/python/analog_info/test_memory_experiment.py

Line 137 in 63defae

assert np.array_equal(res[0], np.array([1, 0, 0])) # estimate is all zeros

the test should be correct as is, the comment is wrong the estimate should be [1,0,0]..

In that case, something is going wrong in the decoder on Windows as it always yields [0, 0, 0]. Is there anything that you could think of that could cause this?
I'll revert the test change for now.

lucasberent · 2024-09-03T08:13:32Z

In that case, something is going wrong in the decoder on Windows as it always yields [0, 0, 0]. Is there anything that you could think of that could cause this? I'll revert the test change for now.

perhaps something with the np arrays? Maybe using 'astype' helps otherwise unclear to me.

burgholzer · 2024-09-09T10:40:42Z

Ok. pymatching 2.2.1 already puts a pin on numpy < 2 so we don't have to.
Hopefully, all tests pass now and this can be merged. I'll then proceed to release a new version, which will be the first one to ship Windows wheels 🎉

Also cleaned up the PR history a little bit so it becomes clearer what changed.

Signed-off-by: burgholzer <[email protected]>

burgholzer · 2024-09-09T21:33:27Z

@pehamTom sorry that I have to bother you again, but I can't seem to get the ft simulation tests to work under Windows 3.12; even after increasing the number of shots by a factor of 10.
The logical error rate seems to be quite off in the two ft simulation tests.
Windows 3.9 seems to be fine and is not complaining.
Any idea what could be causing this?

pehamTom · 2024-09-10T11:07:53Z

@pehamTom sorry that I have to bother you again, but I can't seem to get the ft simulation tests to work under Windows 3.12; even after increasing the number of shots by a factor of 10. The logical error rate seems to be quite off in the two ft simulation tests. Windows 3.9 seems to be fine and is not complaining. Any idea what could be causing this?

It was another timeout issue with the state preparation. Since some optimal circuits are synthesized for the simulation, they timed out and returned a non-verified circuit.

But now the time to run the tests is 77 minutes on windows 3.12. I don't think that is an acceptable amount of time. I think the only way to circumvent this is to either not test the SAT solution for the state preparation and verification circuit synthesis or test the individual SAT formulas used in the encoding separately.

burgholzer · 2024-09-10T11:16:40Z

@pehamTom sorry that I have to bother you again, but I can't seem to get the ft simulation tests to work under Windows 3.12; even after increasing the number of shots by a factor of 10. The logical error rate seems to be quite off in the two ft simulation tests. Windows 3.9 seems to be fine and is not complaining. Any idea what could be causing this?

It was another timeout issue with the state preparation. Since some optimal circuits are synthesized for the simulation, they timed out and returned a non-verified circuit.

But now the time to run the tests is 77 minutes on windows 3.12. I don't think that is an acceptable amount of time. I think the only way to circumvent this is to either not test the SAT solution for the state preparation and verification circuit synthesis or test the individual SAT formulas used in the encoding separately.

Yeah. That's somewhat rough..
What about the most pragmatic solution of not running these tests under Windows but running them under all other operating systems?
It's not ideal, but at least we don't completely give up testing the functionality.

pehamTom · 2024-09-10T11:39:40Z

@pehamTom sorry that I have to bother you again, but I can't seem to get the ft simulation tests to work under Windows 3.12; even after increasing the number of shots by a factor of 10. The logical error rate seems to be quite off in the two ft simulation tests. Windows 3.9 seems to be fine and is not complaining. Any idea what could be causing this?

It was another timeout issue with the state preparation. Since some optimal circuits are synthesized for the simulation, they timed out and returned a non-verified circuit.
But now the time to run the tests is 77 minutes on windows 3.12. I don't think that is an acceptable amount of time. I think the only way to circumvent this is to either not test the SAT solution for the state preparation and verification circuit synthesis or test the individual SAT formulas used in the encoding separately.

Yeah. That's somewhat rough.. What about the most pragmatic solution of not running these tests under Windows but running them under all other operating systems? It's not ideal, but at least we don't completely give up testing the functionality.

That would be fine by me.

Signed-off-by: Lukas Burgholzer <[email protected]>

Signed-off-by: burgholzer <[email protected]>

burgholzer · 2024-09-10T21:52:33Z

Everything is passing and I am glad that this is finally in 🎉
Nice step forward!

burgholzer self-assigned this Aug 26, 2024

This was referenced Aug 27, 2024

Bump the github-actions group across 1 directory with 3 updates #260

Closed

⬆️🪝 update pre-commit hooks #259

Closed

Fix small code bugs for FT Stateprep #267

Merged

burgholzer linked an issue Aug 27, 2024 that may be closed by this pull request

Fix py wheels windows #4

Closed

burgholzer mentioned this pull request Aug 28, 2024

✨ Infrastructure Update without Windows Support #270

Merged

4 tasks

burgholzer force-pushed the infrastructure-update branch from d334c28 to 8dda0b1 Compare August 31, 2024 11:24

burgholzer changed the title ~~✨ Infrastructure Update~~ 🏁 Windows Support Aug 31, 2024

burgholzer removed dependencies Pull requests that update a dependency file python Pull requests that update Python code code quality Code quality improvements refactor Changes the refactor the code base packaging Anything related to Python packaging labels Aug 31, 2024

burgholzer force-pushed the infrastructure-update branch from 9885cf5 to 02f604d Compare September 9, 2024 10:38

burgholzer force-pushed the infrastructure-update branch from 02f604d to 6b2ab17 Compare September 9, 2024 10:45

♻️ switch to reusable MQT workflows

2f280d4

Signed-off-by: burgholzer <[email protected]>

burgholzer force-pushed the infrastructure-update branch 2 times, most recently from fb9cbb8 to 069d624 Compare September 9, 2024 20:40

burgholzer force-pushed the infrastructure-update branch from 3088d85 to c3dc83a Compare September 10, 2024 20:09

burgholzer and others added 9 commits September 10, 2024 22:24

🐛🏁 std::includes on unsorted ranges is undefined behavior

515162f

Signed-off-by: Lukas Burgholzer <[email protected]>

🔥 remove empty extra

a84b06a

Signed-off-by: Lukas Burgholzer <[email protected]>

🩹 fix timing in CC decoder

2165575

Signed-off-by: Lukas Burgholzer <[email protected]>

♻️ switching to ldpc and pymatching instead of bposd package

b8d0e60

✏️ Fix code name in test

8a55c1d

🚨 relax linter exclusions

feacf3f

🩹🏁 skip slow tests on Windows

3d046a6

Signed-off-by: burgholzer <[email protected]>

🔧 proper free-threading support

1225913

Signed-off-by: burgholzer <[email protected]>

🩹 fix remaining Python 3.8 references

c5a50d1

Signed-off-by: burgholzer <[email protected]>

burgholzer force-pushed the infrastructure-update branch from c3dc83a to 16e4a65 Compare September 10, 2024 20:32

burgholzer added 3 commits September 10, 2024 22:41

🚸 update and lint noxfile

c796b2c

Signed-off-by: burgholzer <[email protected]>

⚡ switch to manylinux_2_28 for wheels

ac43ef6

Signed-off-by: burgholzer <[email protected]>

✏️ fix intersphinx links

25eb6b8

Signed-off-by: burgholzer <[email protected]>

burgholzer force-pushed the infrastructure-update branch from 16e4a65 to 25eb6b8 Compare September 10, 2024 20:45

burgholzer added 2 commits September 10, 2024 22:49

🔧 add cancellation to CD runs on PRs

8e1a274

Signed-off-by: burgholzer <[email protected]>

🔧 enable attestation generation for PyPI publish

ceea10c

Signed-off-by: burgholzer <[email protected]>

burgholzer merged commit 3cf545e into main Sep 10, 2024
42 of 43 checks passed

burgholzer deleted the infrastructure-update branch September 10, 2024 21:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🏁 Windows Support #266

🏁 Windows Support #266

burgholzer commented Aug 26, 2024 •

edited

Loading

github-advanced-security bot commented Aug 26, 2024

codecov bot commented Aug 26, 2024 •

edited

Loading

burgholzer commented Sep 2, 2024

pehamTom commented Sep 2, 2024

burgholzer commented Sep 2, 2024

lucasberent commented Sep 3, 2024 •

edited

Loading

burgholzer commented Sep 3, 2024

lucasberent commented Sep 3, 2024

burgholzer commented Sep 9, 2024

burgholzer commented Sep 9, 2024

pehamTom commented Sep 10, 2024

burgholzer commented Sep 10, 2024

pehamTom commented Sep 10, 2024

burgholzer commented Sep 10, 2024

🏁 Windows Support #266

🏁 Windows Support #266

Conversation

burgholzer commented Aug 26, 2024 • edited Loading

Description

Checklist:

github-advanced-security bot commented Aug 26, 2024

codecov bot commented Aug 26, 2024 • edited Loading

Codecov Report

burgholzer commented Sep 2, 2024

pehamTom commented Sep 2, 2024

burgholzer commented Sep 2, 2024

lucasberent commented Sep 3, 2024 • edited Loading

burgholzer commented Sep 3, 2024

lucasberent commented Sep 3, 2024

burgholzer commented Sep 9, 2024

burgholzer commented Sep 9, 2024

pehamTom commented Sep 10, 2024

burgholzer commented Sep 10, 2024

pehamTom commented Sep 10, 2024

burgholzer commented Sep 10, 2024

burgholzer commented Aug 26, 2024 •

edited

Loading

codecov bot commented Aug 26, 2024 •

edited

Loading

lucasberent commented Sep 3, 2024 •

edited

Loading