Add `shape_unsafe` tag to rewrites that can hide shape errors #381

ricardoV94 · 2023-07-13T15:29:54Z

Remove BroadcastTo in favor of Alloc
Address spoken inconsistencies between Second / Alloc in rewrites
Add tag to easily exclude rewrites that can hide shape errors
Simplify such rewrites
Improve static shape of Alloc

Note that from a user standpoint, providing static shapes (via vector("x", shape=(5,)) or specify_shapes) will many times reveal shape errors immediately (this is the case for 99% of PyMC models). In this case users should feel pretty safe about "shape_unsafe" rewrites because they aren't really masking anything that wasn't checked before already.

Alloc.make_node now also raises early when it can see the provided shape is inconsistent. Alloc and Elemwise make up all of the tagged "shape_unsafe" rewrites so far.

With this PR, users can also do mode=get_default_mode().excluding("shape_unsafe") or add shape_unsafe to the excluding config to skip these rewrites at the cost of less optimizations.

Closes #367

ricardoV94 · 2023-07-13T15:43:06Z

pytensor/tensor/extra_ops.py

@@ -1757,7 +1616,19 @@ def broadcast_arrays(*args: TensorVariable) -> Tuple[TensorVariable, ...]:
        The arrays to broadcast.

    """
-    return tuple(broadcast_to(a, broadcast_shape(*args)) for a in args)
+
+    def broadcast_with_others(a, others):


We discussed with @aseyboldt that it may make sense to generalize Second so that it accepts arbitrary many inputs and returns every variable as output. This would become a flat broadcast_arrays once Elemwised, and make rewrites easier to read. By overriding the __str__ we can also make it much more readable in debug_print than the current nested Second

codecov-commenter · 2023-07-14T12:14:36Z

Codecov Report

Merging #381 (2a3adbe) into main (7218431) will decrease coverage by 0.03%.
The diff coverage is 90.90%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #381      +/-   ##
==========================================
- Coverage   80.44%   80.41%   -0.03%     
==========================================
  Files         156      156              
  Lines       45470    45413      -57     
  Branches    11136    11119      -17     
==========================================
- Hits        36578    36520      -58     
- Misses       6687     6693       +6     
+ Partials     2205     2200       -5

Impacted Files	Coverage Δ
pytensor/link/jax/dispatch/extra_ops.py	`72.58% <ø> (-3.48%)`	⬇️
pytensor/link/numba/dispatch/extra_ops.py	`91.78% <ø> (-0.47%)`	⬇️
pytensor/tensor/rewriting/elemwise.py	`88.99% <ø> (ø)`
pytensor/tensor/rewriting/math.py	`86.32% <72.72%> (-0.05%)`	⬇️
pytensor/tensor/rewriting/basic.py	`94.01% <95.74%> (+0.48%)`	⬆️
pytensor/configdefaults.py	`65.92% <100.00%> (-0.10%)`	⬇️
pytensor/tensor/basic.py	`90.82% <100.00%> (+0.05%)`	⬆️
pytensor/tensor/extra_ops.py	`88.53% <100.00%> (-0.48%)`	⬇️
pytensor/tensor/rewriting/extra_ops.py	`88.23% <100.00%> (-1.02%)`	⬇️

... and 9 files with indirect coverage changes

ricardoV94 · 2023-07-14T13:13:38Z

Fixing #379 should also help with the "unsafety" concerns

pytensor/tensor/basic.py

aseyboldt · 2023-08-04T16:55:35Z

pytensor/tensor/extra_ops.py

@@ -1561,141 +1561,6 @@ def broadcast_shape_iter(
    return tuple(result_dims)


-class BroadcastTo(COp):


BroadcastTo is imported in pymc a couple of times. Maybe we should leave an empty Op here, that is deprecated and doesn't do anything?

Some of the removed rewrites are also directly imported.

This shouldn't be a problem however. I marked this PR as a major release so we will bump the version above the upper-bound pinned by PyMC. When we update the pin on PyMC I'll address the changes. They require some manual review anyway to see if the logic that depended on BroadcastTo was valid per our new rules and can be transferred to Alloc.

This was all on the logprob inference module AFAICT so impact should be pretty contained.

sounds good

aseyboldt · 2023-08-04T17:06:46Z

Other than the two suggestions above this looks good :-)

ricardoV94 changed the title ~~Remove BroadcastTo~~ Remove BroadcastTo and add shape_unsafe tag to rewrites that make shape assumptions Jul 13, 2023

ricardoV94 changed the title ~~Remove BroadcastTo and add shape_unsafe tag to rewrites that make shape assumptions~~ Remove BroadcastTo and add shape_unsafe tag to rewrites that can hide shape errors Jul 13, 2023

ricardoV94 changed the title ~~Remove BroadcastTo and add shape_unsafe tag to rewrites that can hide shape errors~~ Add shape_unsafe tag to rewrites that can hide shape errors Jul 13, 2023

ricardoV94 commented Jul 13, 2023

View reviewed changes

ricardoV94 force-pushed the cleanup_broadcast branch 7 times, most recently from 1f614bd to f1d19f7 Compare July 14, 2023 11:16

ricardoV94 added enhancement New feature or request major graph rewriting labels Jul 14, 2023

ricardoV94 force-pushed the cleanup_broadcast branch from f1d19f7 to 2a3adbe Compare July 14, 2023 11:37

ricardoV94 marked this pull request as ready for review July 14, 2023 12:43

ricardoV94 requested a review from aseyboldt July 14, 2023 12:43

ricardoV94 mentioned this pull request Jul 19, 2023

Forbid runtime broadcasting by Alloc #390

Merged

aseyboldt reviewed Aug 4, 2023

View reviewed changes

pytensor/tensor/basic.py Outdated Show resolved Hide resolved

aseyboldt reviewed Aug 4, 2023

View reviewed changes

ricardoV94 added 7 commits August 7, 2023 10:13

Clarify behavior of Elemwise second

70033c9

Use second for broadcast_arrays and remove fill_chain helper

28037cd

Refactor encompasses_broadcastable to broadcasted_by

add8d5f

Rename broadcast_like to alloc_like

dd8462a

Be consistent about second vs alloc in rewrites

bd918db

Tag rewrites that make shape assumptions

548c14a

Simplify rewrites by assuming Elemwise / Alloc shapes are correct

2ac8774

ricardoV94 added 2 commits August 7, 2023 10:13

Remove BroadcastTo

84c46f1

Incorporate static shape of Alloc input

9f8ed94

ricardoV94 force-pushed the cleanup_broadcast branch from 2a3adbe to 9f8ed94 Compare August 7, 2023 08:14

ricardoV94 requested a review from aseyboldt August 7, 2023 10:07

aseyboldt approved these changes Aug 7, 2023

View reviewed changes

ricardoV94 merged commit c6b0858 into pymc-devs:main Aug 7, 2023
51 of 52 checks passed

ricardoV94 deleted the cleanup_broadcast branch August 7, 2023 12:45

ricardoV94 mentioned this pull request Jun 24, 2024

Rewrite determinant of diagonal matrix as product of diagonal #797

Merged

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `shape_unsafe` tag to rewrites that can hide shape errors #381

Add `shape_unsafe` tag to rewrites that can hide shape errors #381

ricardoV94 commented Jul 13, 2023 •

edited

Loading

ricardoV94 Jul 13, 2023 •

edited

Loading

codecov-commenter commented Jul 14, 2023

ricardoV94 commented Jul 14, 2023

aseyboldt Aug 4, 2023

ricardoV94 Aug 7, 2023 •

edited

Loading

aseyboldt Aug 7, 2023

aseyboldt commented Aug 4, 2023

		@@ -1561,141 +1561,6 @@ def broadcast_shape_iter(
		return tuple(result_dims)


		class BroadcastTo(COp):

Add shape_unsafe tag to rewrites that can hide shape errors #381

Add shape_unsafe tag to rewrites that can hide shape errors #381

Conversation

ricardoV94 commented Jul 13, 2023 • edited Loading

ricardoV94 Jul 13, 2023 • edited Loading

Choose a reason for hiding this comment

codecov-commenter commented Jul 14, 2023

Codecov Report

ricardoV94 commented Jul 14, 2023

aseyboldt Aug 4, 2023

Choose a reason for hiding this comment

ricardoV94 Aug 7, 2023 • edited Loading

Choose a reason for hiding this comment

aseyboldt Aug 7, 2023

Choose a reason for hiding this comment

aseyboldt commented Aug 4, 2023

Add `shape_unsafe` tag to rewrites that can hide shape errors #381

Add `shape_unsafe` tag to rewrites that can hide shape errors #381

ricardoV94 commented Jul 13, 2023 •

edited

Loading

ricardoV94 Jul 13, 2023 •

edited

Loading

ricardoV94 Aug 7, 2023 •

edited

Loading