Fix `Optimize1qGatesDecomposition` length heuristic #6553

ecpeterson · 2021-06-09T23:44:27Z

Summary

The decision in Optimize1qGatesDecomposition whether to use the re-synthesized gate string or the original gate string is decided purely by length: we prefer whichever of the new string and the old string is shorter. This is a rough approximation to expected infidelity cost of the sequence when both strings are of native gates. An input string with non-native gates might be unnaturally short, hence persist into the output — a bug.

Details and comments

Before:

In [2]: from qiskit import QuantumCircuit
   ...: from qiskit.transpiler.passes import Optimize1qGatesDecomposition
   ...: import numpy as np
   ...: 
   ...: c = QuantumCircuit(1)
   ...: c.h(0)
   ...: c.ry(np.pi/2, 0)
   ...: print(Optimize1qGatesDecomposition(['sx', 'rz'])(c))
     ┌───┐┌─────────┐
q_0: ┤ H ├┤ RY(π/2) ├
     └───┘└─────────┘

After:

In [1]: from qiskit import QuantumCircuit
   ...: from qiskit.transpiler.passes import Optimize1qGatesDecomposition
   ...: import numpy as np
   ...: 
   ...: c = QuantumCircuit(1)
   ...: c.h(0)
   ...: c.ry(np.pi/2, 0)
   ...: print(Optimize1qGatesDecomposition(['sx', 'rz'])(c))
     ┌────┐┌────┐
q_0: ┤ √X ├┤ √X ├
     └────┘└────┘

I also removed some special casing on U3 and single gates that seemed to overlap with this oversight. Happy to reverse that.

ajavadia

looks good, two small comments

...tes/fixed-bug-in-Optimize1qGatesDecomposition-skipping-short-sequences-044a64740bf414a7.yaml

qiskit/transpiler/passes/optimization/optimize_1q_decomposition.py

levbishop · 2021-06-10T01:14:49Z

I wonder if we can't completely remove the check if the new run is shorter. With the changes in #5827 I'm hoping there should be no remaining cases where the resynthesized sequence is ever non-optimal (and if any do remain, I consider it a bug). Maybe for now get the check to raise an exception if the new sequence is longer?

ecpeterson · 2021-06-10T04:35:55Z

I added a branch for a warning, rather than an error. It would be obnoxious for a synthesis flaw to generate suboptimal programs, but probably worse to then prevent a user from running at all.

I'll check out the test failures tomorrow.

mtreinish

Overall this LGTM thanks for fixing this. I agree all that logic is out of date with #5827 now since the issues that single gate case were there for have been fixed. This will also probably improve performance a bit since we're not going to do an identity check on each standalone 1q gate. Just a few nits inline, but then I think this good to go.

Also can you update the release note to say you've also fixed #6473 because I believe this will fix that issue too. (basically just add a new bullet point to the release note yaml for the second fix).

qiskit/transpiler/passes/optimization/optimize_1q_decomposition.py

...tes/fixed-bug-in-Optimize1qGatesDecomposition-skipping-short-sequences-044a64740bf414a7.yaml

mtreinish · 2021-06-10T14:46:59Z

qiskit/transpiler/passes/optimization/optimize_1q_decomposition.py

@@ -102,7 +84,13 @@ def run(self, dag):
                new_circs.append(decomposer._decompose(operator))
            if new_circs:
                new_circ = min(new_circs, key=len)
-                if len(run) > len(new_circ) or (single_u3 and new_circ.data[0][0].name != "u3"):
+                if all(g.name in self.basis for g in run) and len(run) < len(new_circ):


Do you have an idea on what the runtime cost of this additional check is? If it's noticeable I'd rather not add this to just raise a warning because this will be called quite a lot during a transpile call.

I don't have data, but I imagine it's negligible relative to the nonnegotiable cost of constructing new_circ: self.basis tends not to be super large, and decomposer has to read all of run anyhow. I'll do a small amount of with/without benchmarking.

mtreinish · 2021-06-10T14:51:53Z

Oh I didn't see the test failures before reviewing and just assumed they all passed. We probably should get to the bottom of those. Most I think are fine and we just need to update the tests like test_short_string: https://dev.azure.com/qiskit-ci/qiskit-terra/_build/results?buildId=28674&view=logs&j=8eacdd59-c2e8-5617-75a5-8ed7c854a78f&t=5df005f4-d5f4-5d2d-0307-5b24b15488fc&l=12271 we can just update that to be X as the expected.

But test_euler_decomposition_worse might be an issue because it looks like the output is worse than what we had before:
https://dev.azure.com/qiskit-ci/qiskit-terra/_build/results?buildId=28674&view=logs&j=8eacdd59-c2e8-5617-75a5-8ed7c854a78f&t=5df005f4-d5f4-5d2d-0307-5b24b15488fc&l=12338

levbishop · 2021-06-10T17:47:40Z

But test_euler_decomposition_worse might be an issue because it looks like the output is worse than what we had before

Yeah this is tricky. The Euler Z(phi)X(theta)Z(lambda) decomposition should find an optimal expansion for 0<=theta<=pi. The input circuit has theta=-pi/2 outside that range, so the Euler decomposer includes the extra Z(). It wouldn't be crazy to change the Euler decomposer to check in case the negative theta := -theta allows such a simplification and emit that if its fewer gates.

On the other hand, both of these solutions are optimal in terms of X() gates which is what actually cost something for many hardware implementations.

ecpeterson · 2021-06-11T01:17:09Z

Some of the test problems were legitimate: the slot Optimize1qGatesDecomposition.basis apparently stores functions which map into the basis described by the basis init kwarg — and that kwarg itself gets tossed. So, ... in self.basis didn't do what I expected (or probably what people reading this PR expected).

Fixed that, updated some of the reference circuits in the test suite, but this still isn't quite good to go: test.python.compiler.test_transpiler.TestTranspile.test_transpile_calibrated_nonbasis_gate_on_diff_qubit checks that non-native gates do / don't get rewritten if they don't / do have calibration definitions, but this change makes rewriting more aggressive. Stay tuned.

mtreinish · 2021-06-11T11:18:25Z

Some of the test problems were legitimate: the slot Optimize1qGatesDecomposition.basis apparently stores functions which map into the basis described by the basis init kwarg — and that kwarg itself gets tossed. So, ... in self.basis didn't do what I expected (or probably what people reading this PR expected).

Ah good catch, yeah I missed that in my earlier review, self.basis is a list of the 1q decomposer instances for each target euler basis that is usable by the target basis. Feel free to change the attribute name to something more descriptive to avoid confusion in the future, it shouldn't be part of any public api as it's only used internally by the pass and we can probably just change it without worrying about backwards compatibility.

qiskit/transpiler/passes/optimization/optimize_1q_decomposition.py

mtreinish · 2021-06-12T17:37:04Z

I ran the quantum volume benchmarks from the asv benchmark suite on this just now and it causes a roughly 30% run time performance regression:

Benchmarks that have got worse:

       before           after         ratio
     [5f6db11d]       [75d1f0b7]
     <main>       <ecpeterson-bugfix/nonnative-1q-heuristic>
+        239±10ms         334±30ms     1.39  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(8, 'translator')
+       1.55±0.1s        2.05±0.2s     1.32  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(20, 'translator')
+      14.3±0.1ms      18.9±0.07ms     1.32  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(2, 'translator')
+        782±40ms         943±80ms     1.21  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(14, 'translator')
+      17.8±0.3ms       19.8±0.1ms     1.11  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(2, 'synthesis')

SOME BENCHMARKS HAVE CHANGED SIGNIFICANTLY.
PERFORMANCE DECREASED.

My assumption is that this is caused by all the times we're iterating over the gates in each run to check for calibrations now. I haven't profiled it yet but I can do that on monday. But until we get to the bottom of this I've removed the automerge label to prevent mergify from automerging this.

levbishop · 2021-07-03T13:29:57Z

It would probably be a good idea to add a couple of extra lines to ANGEXP_ZYZ and ANGEXP_PSX for TestOneQubitEulerSpecial to exercise the new code paths for simplifying by flipping the sign of theta.

Co-authored-by: Lev Bishop <[email protected]>

…pha)

ecpeterson · 2021-07-06T17:44:49Z

... ANGEXP_ZYZ and ANGEXP_PSX ...

Good idea. Done.

…pha)

…/qiskit-terra into bugfix/nonnative-1q-heuristic

mtreinish · 2021-07-06T18:49:47Z

I reran benchmarks just now, it looks like the performance regression from ealier has now been resolved and the performance is equivalent to the current main branch:

Benchmarks that have stayed the same:

       before           after         ratio
     [e9f06e45]       [2fdd628f]
     <main>           <ecpeterson-bugfix/nonnative-1q-heuristic>
           failed           failed      n/a  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(14, 'synthesis')
           failed           failed      n/a  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(20, 'synthesis')
           failed           failed      n/a  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(27, 'synthesis')
          5.83±1s          7.16±3s    ~1.23  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(8, 'synthesis')
          1.90±0s       2.23±0.04s    ~1.18  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(20, 'translator')
        3.60±0.2s       3.98±0.07s    ~1.10  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(27, 'translator')
       5.71±0.5ms       6.24±0.6ms     1.09  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(1, 'synthesis')
         33.7±3ms       35.1±0.5ms     1.04  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(3, 'translator')
         98.6±4ms          101±5ms     1.03  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(5, 'translator')
        720±100ms        739±500ms     1.03  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(5, 'synthesis')
          1.03±0s       1.05±0.06s     1.02  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(14, 'translator')
       23.5±0.2ms       23.8±0.2ms     1.02  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(2, 'synthesis')
         20.3±1ms      20.4±0.04ms     1.01  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(2, 'translator')
         344±20ms         319±20ms     0.93  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(8, 'translator')
       1.95±0.2ms      1.76±0.02ms    ~0.90  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(1, 'translator')
         130±80ms         106±60ms    ~0.82  quantum_volume.QuantumVolumeBenchmark.time_ibmq_backend_transpile(3, 'synthesis')

BENCHMARKS NOT SIGNIFICANTLY CHANGED.

I'm a bit concerned about that variance, especially as I ran this on my local benchmark system which is doesn't have anything else running on it. But it doesn't look outside the norm for the benchmark (my guess is that the seeding for the qv circuit generation isn't correct or something like that).

ecpeterson · 2021-07-06T18:59:45Z

I also see really strong variance across benchmarking runs, and it sounds worth looking into. I don't think the variance is something I introduced, but who knows what it might be covering up.

mtreinish

LGTM, one nit inline about docs formatting but it's not worth blocking over as it doesn't get published. I definitely appreciate all the comments added, it makes it easier to trace through things now.

mtreinish · 2021-07-06T20:10:43Z

qiskit/quantum_info/synthesis/one_qubit_decompose.py

+        """
+        Installs the angles phi, theta, and lam into a KAK-type decomposition of the form
+        K(phi) . A(theta) . K(lam) , where K and A are an orthogonal pair drawn from RZGate, RYGate,
+        and RXGate.
+
+        Behavior flags:
+            `simplify` indicates whether gates should be elided / coalesced where possible.
+            `allow_non_canonical` indicates whether we are permitted to reverse the sign of the
+                middle parameter, theta, in the output.  When this and `simplify` are both enabled,
+                we take the opportunity to commute half-rotations in the outer gates past the middle
+                gate, which permits us to coalesce them at the cost of reversing the sign of theta.
+
+        NOTE: The input value of `theta` is expected to lie in [0, pi).
+        """


Since this is a private method and neither the docs builds or linters care about docstring formatting on this isn't a big issue. But I wonder if it would be better to restructure this in the normal docstrig format for the napoleon plugin just to be consistent with the rest of the docs. If this were a built and published doc it wouldn't actually pass CI.

Comments addressed

a by-hand attempt at reformatting the docstring

levbishop

This was a marathon... thanks for sticking with it

ecpeterson added bug Something isn't working Changelog: Bugfix Include in the "Fixed" section of the changelog labels Jun 9, 2021

ecpeterson requested a review from mtreinish June 9, 2021 23:44

ecpeterson requested a review from a team as a code owner June 9, 2021 23:44

ajavadia reviewed Jun 10, 2021

View reviewed changes

...tes/fixed-bug-in-Optimize1qGatesDecomposition-skipping-short-sequences-044a64740bf414a7.yaml Show resolved Hide resolved

qiskit/transpiler/passes/optimization/optimize_1q_decomposition.py Outdated Show resolved Hide resolved

mtreinish reviewed Jun 10, 2021

View reviewed changes

mtreinish added this to the 0.18 milestone Jun 10, 2021

mtreinish self-assigned this Jun 10, 2021

ecpeterson requested review from DanPuzzuoli, eggerdj, lcapelluto, manoelmarques, nkanazawa1989, taalexander and woodsp-ibm as code owners June 11, 2021 01:17

ecpeterson requested a review from mtreinish June 11, 2021 21:51

ajavadia reviewed Jun 12, 2021

View reviewed changes

qiskit/transpiler/passes/optimization/optimize_1q_decomposition.py Outdated Show resolved Hide resolved

ajavadia previously approved these changes Jun 12, 2021

View reviewed changes

ajavadia added the automerge label Jun 12, 2021

mtreinish added on hold Can not fix yet and removed automerge on hold Can not fix yet labels Jun 12, 2021

ajavadia added this to the 0.18 milestone Jul 6, 2021

ecpeterson and others added 3 commits July 7, 2021 05:08

Update qiskit/quantum_info/synthesis/one_qubit_decompose.py

f8f5e5f

Co-authored-by: Lev Bishop <[email protected]>

Update qiskit/quantum_info/synthesis/one_qubit_decompose.py

a747b5a

Co-authored-by: Lev Bishop <[email protected]>

add some Euler special case tests for pushing a K(pi) through an A(al…

a5272ac

…pha)

ecpeterson and others added 3 commits July 7, 2021 05:45

Merge branch 'main' into bugfix/nonnative-1q-heuristic

4043322

add some Euler special case tests for pushing a K(pi) through an A(al…

d808985

…pha)

Merge branch 'bugfix/nonnative-1q-heuristic' of github.com:ecpeterson…

97b7f00

…/qiskit-terra into bugfix/nonnative-1q-heuristic

ajavadia previously approved these changes Jul 6, 2021

View reviewed changes

mtreinish previously approved these changes Jul 6, 2021

View reviewed changes

mtreinish added the automerge label Jul 6, 2021

Update one_qubit_decompose.py

c3e7d6c

a by-hand attempt at reformatting the docstring

ecpeterson dismissed stale reviews from mtreinish and ajavadia via c3e7d6c July 6, 2021 20:30

mtreinish previously approved these changes Jul 6, 2021

View reviewed changes

ok linter

75acef0

ecpeterson dismissed mtreinish’s stale review via 75acef0 July 6, 2021 21:30

mtreinish approved these changes Jul 6, 2021

View reviewed changes

levbishop approved these changes Jul 7, 2021

View reviewed changes

Merge branch 'main' into bugfix/nonnative-1q-heuristic

1ebb329

mergify bot merged commit b32a531 into Qiskit:main Jul 7, 2021

This was referenced Jul 8, 2021

Optimize1qGatesDecompositions does not optimize single (non-parameterized) epsilon rotations #6473

Closed

Absolute tolerance on ZXZ one qubit decomposition #6605

Closed

ecpeterson mentioned this pull request Aug 4, 2021

Tame 1Q simplification warning #6868

Merged

levbishop mentioned this pull request Aug 13, 2021

Test suite should error on unexpected warnings #6904

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `Optimize1qGatesDecomposition` length heuristic #6553

Fix `Optimize1qGatesDecomposition` length heuristic #6553

ecpeterson commented Jun 9, 2021

ajavadia left a comment

levbishop commented Jun 10, 2021 •

edited

Loading

ecpeterson commented Jun 10, 2021 •

edited

Loading

mtreinish left a comment

mtreinish Jun 10, 2021

ecpeterson Jun 10, 2021

mtreinish commented Jun 10, 2021

levbishop commented Jun 10, 2021 •

edited

Loading

ecpeterson commented Jun 11, 2021

mtreinish commented Jun 11, 2021

mtreinish commented Jun 12, 2021 •

edited

Loading

levbishop commented Jul 3, 2021

ecpeterson commented Jul 6, 2021

mtreinish commented Jul 6, 2021

ecpeterson commented Jul 6, 2021

mtreinish left a comment

mtreinish Jul 6, 2021

levbishop left a comment

Fix Optimize1qGatesDecomposition length heuristic #6553

Fix Optimize1qGatesDecomposition length heuristic #6553

Conversation

ecpeterson commented Jun 9, 2021

Summary

Details and comments

ajavadia left a comment

Choose a reason for hiding this comment

levbishop commented Jun 10, 2021 • edited Loading

ecpeterson commented Jun 10, 2021 • edited Loading

mtreinish left a comment

Choose a reason for hiding this comment

mtreinish Jun 10, 2021

Choose a reason for hiding this comment

ecpeterson Jun 10, 2021

Choose a reason for hiding this comment

mtreinish commented Jun 10, 2021

levbishop commented Jun 10, 2021 • edited Loading

ecpeterson commented Jun 11, 2021

mtreinish commented Jun 11, 2021

mtreinish commented Jun 12, 2021 • edited Loading

levbishop commented Jul 3, 2021

ecpeterson commented Jul 6, 2021

mtreinish commented Jul 6, 2021

ecpeterson commented Jul 6, 2021

mtreinish left a comment

Choose a reason for hiding this comment

mtreinish Jul 6, 2021

Choose a reason for hiding this comment

levbishop left a comment

Choose a reason for hiding this comment

Fix `Optimize1qGatesDecomposition` length heuristic #6553

Fix `Optimize1qGatesDecomposition` length heuristic #6553

levbishop commented Jun 10, 2021 •

edited

Loading

ecpeterson commented Jun 10, 2021 •

edited

Loading

levbishop commented Jun 10, 2021 •

edited

Loading

mtreinish commented Jun 12, 2021 •

edited

Loading