Derive logprob of matmul #7542

ricardoV94 · 2024-10-18T20:46:24Z

This can be pretty useful for timeseries models as well as normalizing flows.

Here is a simple example:

import numpy as np
import pytensor.tensor as pt

import pymc as pm

rng = np.random.default_rng(37)
x = pm.MvNormal.dist(cov=np.eye(2), size=(128,))

n_layers = 3
A = pt.tensor("A", shape=(n_layers, 2, 2))
b = pt.tensor("b", shape=(n_layers, 2,))

# Repeated layers of Affine transform -> Tanh
for i in range(n_layers):
    y = A[i] @ x + b[i]
    # parametrized leaky-relu would be nicer: https://github.com/pymc-devs/pymc/issues/7543
    # y = pt.switch(y > 0, y, c[i] * y)
    y = pt.tanh(y)

A_test = rng.normal(size=A.type.shape)
b_test = rng.normal(size=b.type.shape)
y_test = rng.uniform(-1, 1, size=y.type.shape)
pm.logp(y, y_test).sum().eval({A: A_test, b: b_test})  # array(-3.54498234)

pymc/logprob/linalg.py

codecov · 2024-10-18T21:23:28Z

Codecov Report

Attention: Patch coverage is 87.77778% with 11 lines in your changes missing coverage. Please review.

Project coverage is 92.82%. Comparing base (5352798) to head (5e5e077).

Files with missing lines	Patch %	Lines
pymc/logprob/tensor.py	79.41%	7 Missing ⚠️
pymc/logprob/linalg.py	91.48%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7542      +/-   ##
==========================================
- Coverage   92.85%   92.82%   -0.04%     
==========================================
  Files         105      106       +1     
  Lines       17591    17669      +78     
==========================================
+ Hits        16335    16402      +67     
- Misses       1256     1267      +11

Files with missing lines	Coverage Δ
pymc/logprob/__init__.py	`100.00% <100.00%> (ø)`
pymc/logprob/abstract.py	`94.28% <100.00%> (+0.16%)`	⬆️
pymc/logprob/basic.py	`94.28% <100.00%> (ø)`
pymc/logprob/mixture.py	`95.70% <100.00%> (ø)`
pymc/logprob/rewriting.py	`100.00% <100.00%> (ø)`
pymc/logprob/scan.py	`94.90% <ø> (ø)`
pymc/logprob/transform_value.py	`98.14% <100.00%> (ø)`
pymc/logprob/utils.py	`92.46% <100.00%> (ø)`
pymc/logprob/linalg.py	`91.48% <91.48%> (ø)`
pymc/logprob/tensor.py	`94.48% <79.41%> (-5.52%)`	⬇️

… direct valued nodes

ricardoV94 · 2024-10-18T22:06:01Z

pymc/logprob/utils.py

@@ -320,7 +320,7 @@ def find_negated_var(var):
    return None


-def get_related_valued_nodes(node: Apply, fgraph: FunctionGraph) -> list[Apply]:
+def get_related_valued_nodes(fgraph: FunctionGraph, node: Apply) -> list[Apply]:


This is much more natural order used in all sorts of pytensor utilities that require a node/variable and its' fgraph

ricardoV94 · 2024-10-18T22:08:31Z

pymc/logprob/tensor.py

+    # In cases where DimShuffle transposes dimensions, we only apply this rewrite when only Elemwise
+    # operations separate it from the valued node. Further transformations likely need to know where
+    # the support axes are for a correct implementation (and thus assume they are the rightmost axes).
+    # TODO: When we include the support axis as meta information in each  intermediate MeasurableVariable,
+    #  we can lift this restriction (see https://github.com/pymc-devs/pymc/issues/6360)
+    if tuple(node.op.shuffle) != tuple(sorted(node.op.shuffle)) and not _elemwise_univariate_chain(
+        fgraph, node
+    ):
+        return None


These DimShuffle changes were needed to naturally accommodate A @ x when x is a vector, which looks like:

import pytensor.tensor as pt A = pt.matrix("A") x = pt.vector("x") y = A @ x y.dprint() # DropDims{axis=1} [id A] # └─ Blockwise{dot, (m,k),(k,n)->(m,n)} [id B] # ├─ A [id C] # └─ ExpandDims{axis=1} [id D] # └─ x [id E]

It's also more strict / correct than the limitation we had before, because the concerns are much more about what's after the DimShuffle not so much before.

pymc/logprob/linalg.py

jessegrabowski

lgtm : )

Change order of arguments in get_related_valued_nodes

cead578

ricardoV94 added enhancements logprob labels Oct 18, 2024

ricardoV94 requested review from larryshamalama and jessegrabowski October 18, 2024 20:46

ricardoV94 force-pushed the matmul branch from e2d4b57 to 7aca04a Compare October 18, 2024 20:50

ricardoV94 commented Oct 18, 2024

View reviewed changes

pymc/logprob/linalg.py Show resolved Hide resolved

ricardoV94 requested a review from ferrine October 18, 2024 20:53

Only allow measurable transpositions in univariate Elemwise chains or…

f05ae2c

… direct valued nodes

ricardoV94 force-pushed the matmul branch from 7aca04a to 5e5e077 Compare October 18, 2024 22:05

ricardoV94 commented Oct 18, 2024

View reviewed changes

jessegrabowski reviewed Oct 19, 2024

View reviewed changes

pymc/logprob/linalg.py Outdated Show resolved Hide resolved

Derive matmul probability

3ba3f13

ricardoV94 force-pushed the matmul branch from 5e5e077 to 3ba3f13 Compare October 21, 2024 07:02

ricardoV94 requested a review from jessegrabowski October 21, 2024 07:02

jessegrabowski approved these changes Oct 21, 2024

View reviewed changes

ricardoV94 merged commit 1249c86 into pymc-devs:main Oct 21, 2024
18 of 20 checks passed

ricardoV94 deleted the matmul branch October 21, 2024 09:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Derive logprob of matmul #7542

Derive logprob of matmul #7542

ricardoV94 commented Oct 18, 2024 •

edited

Loading

codecov bot commented Oct 18, 2024 •

edited

Loading

ricardoV94 Oct 18, 2024

ricardoV94 Oct 18, 2024 •

edited

Loading

jessegrabowski left a comment

Derive logprob of matmul #7542

Derive logprob of matmul #7542

Conversation

ricardoV94 commented Oct 18, 2024 • edited Loading

codecov bot commented Oct 18, 2024 • edited Loading

Codecov Report

ricardoV94 Oct 18, 2024

Choose a reason for hiding this comment

ricardoV94 Oct 18, 2024 • edited Loading

Choose a reason for hiding this comment

jessegrabowski left a comment

Choose a reason for hiding this comment

ricardoV94 commented Oct 18, 2024 •

edited

Loading

codecov bot commented Oct 18, 2024 •

edited

Loading

ricardoV94 Oct 18, 2024 •

edited

Loading