Add `RecursiveLinearTransform` for linear state space models. #1766

tillahoffmann · 2024-03-20T19:57:24Z

This PR adds a RecursiveLinearTransform which is a linear transformation applied recursively such that $y_t = A y_{t - 1} + x_t$ for $t > 0$, where $x_t$ and $y_t$ are $p$-vectors and $A$ is a $p\times p$ transition matrix. The series is initialized by $y_0 = 0$.

This transform can be used to easily declare linear state space models, e.g., a Cauchy random walk is

>>> from jax import random
>>> from jax import numpy as jnp
>>> import numpyro
>>> from numpyro import distributions as dist
>>>
>>> def cauchy_random_walk():
...     return numpyro.sample(
...         "x",
...         dist.TransformedDistribution(
...             dist.Cauchy(0, 1).expand([10, 1]).to_event(1),
...             dist.transforms.RecursiveLinearTransform(jnp.eye(1)),
...         ),
...     )

A Kalman-style model for a rocket with state y = (position, velocity) is

>>> def rocket_trajectory():
...     scale = numpyro.sample(
...         "scale",
...         dist.HalfCauchy(1).expand([2]).to_event(1),
...     )
...     transition_matrix = jnp.array([[1, 1], [0, 1]])
...     return numpyro.sample(
...         "x",
...         dist.TransformedDistribution(
...             dist.Normal(0, scale).expand([10, 2]).to_event(1),
...             dist.transforms.RecursiveLinearTransform(transition_matrix),
...         ),
...     )

This PR also makes a few minor changes (happy to factor out if you prefer):

Reformat the ExplicitReparam from Add explicit reparametrizer. #1754 to comply with the stricter linting.
Add the RealFastFourierTransform from Add complex constraint and real Fourier transform. #1762 to the documentation.
Ignore venv directory in the update_headers.py script.
Add autogenerated documentation sources docs/source/{examples,tutorials,getting_started.rst} to .gitignore.
Verify log_abs_det_jacobian implementation using autodiff.

fehiepsi · 2024-03-21T01:06:10Z

numpyro/distributions/transforms.py

+    are vectors and :math:`A` is a transition matrix. The series is initialized by
+    :math:`y_0 = 0`.
+
+    :param transition_matrix: Transition matrix :math:`A` for successive states.


Maybe 'matrix' -> squared matrix for clarity. Currently, the bias x is time-dependent, but the matrix is constant. Do you plan to make the class name more verbose to reflect that?

Maybe 'matrix' -> squared matrix for clarity.

👍

Currently, the bias x is time-dependent, but the matrix is constant. Do you plan to make the class name more verbose to reflect that?

Sorry, I explained poorly. The x here is the argument of the transform rather than the bias in the AffineTransform, for example. Keeping the transition matrix constant means that the transform can be applied to sequences of arbitrary length.

x is the (time-dependent) bias (or noise if we place a normal distribution over it) of a linear dynamical model. Typically, people use other notations, something like x_t = Ax{t-1} + b_t.

Fair. I was using the x/y notation here to stick with the typical arguments of the Transform classes. Do you have an idea for a more explanatory name?

The name looks 👌 to me. :)

fehiepsi · 2024-03-22T23:53:42Z

numpyro/distributions/transforms.py

@@ -1347,26 +1347,25 @@ def __call__(self, x: jnp.ndarray) -> jnp.ndarray:
        x = jnp.moveaxis(x, -2, 0)

        def f(y, x):
-            y = (self.transition_matrix * y[..., None, :]).sum(axis=-1) + x
+            y = y @ self.transition_matrix.T + x


I think you might want jnp.swapaxes(self.transition_matrix, -1, -2)

I've updated it to use einsum because we have shapes (..., p, p) for the transition matrices A and (..., n, p) for the states x. Inside the scan function, we are dealing with state of shape (..., p) because we're scanning along the n dimension. There may very well be a better way to do this.

fehiepsi · 2024-03-22T23:54:12Z

numpyro/distributions/transforms.py

-            x = y_t - (self.transition_matrix * y_tm1[..., None, :]).sum(axis=-1)
-            return y_tm1, x
+        def f(y, prev):
+            x = y - prev @ self.transition_matrix.T


similarly, jnp.swapaxes(self.transition_matrix, -1, -2)

fehiepsi · 2024-03-22T23:58:46Z

numpyro/distributions/transforms.py


        _, x = lax.scan(f, y[-1], jnp.roll(y, 1, axis=0).at[0].set(0), reverse=True)
        return jnp.moveaxis(x, 0, -2)

    def log_abs_det_jacobian(self, x: jnp.ndarray, y: jnp.ndarray, intermediates=None):
-        slogdet = jnp.linalg.slogdet(self.transition_matrix)
-        return jnp.broadcast_to(slogdet.logabsdet, x.shape[:-2]) * x.shape[-2]
+        return jnp.zeros_like(x, shape=x.shape[:-2])


sounds reasonable to me, this is sort of a shear transformation, so the Jacobian determinant is 1.

Yes, because of the temporal nature of the transform, the Jacobian is triangular. Because the x only appears additively, the diagonal is one, leading to the unit Jacobian.

fehiepsi · 2024-03-25T14:46:03Z

Thanks, @tillahoffmann!

…pl#1766) * Format reparam module to comply with style guide. * Add `RealFastFourierTransform` to documentation. * Ignore `venv` directory for `update_headers.py` script. * Ignore autogenerated documentation sources. * Add numerical Jacobian check for bijective transforms. * Add `RecursiveLinearTransform`. * Use matrix multiplication operator and fix Jacobian. * Use non-trivial transition matrix in test. * Specify that transition matrices must (batches of) square matrices. * Fix `scan` implementation for batched transition matrices and add test.

fehiepsi added the awaiting review label Mar 21, 2024

fehiepsi reviewed Mar 22, 2024

View reviewed changes

tillahoffmann added 10 commits March 23, 2024 18:32

Format reparam module to comply with style guide.

9fd2a38

Add RealFastFourierTransform to documentation.

1835626

Ignore venv directory for update_headers.py script.

676cf95

Ignore autogenerated documentation sources.

4381d3e

Add numerical Jacobian check for bijective transforms.

26e8786

Add RecursiveLinearTransform.

58316a1

Use matrix multiplication operator and fix Jacobian.

9b245b7

Use non-trivial transition matrix in test.

fb197f3

Specify that transition matrices must (batches of) square matrices.

e68aeac

Fix scan implementation for batched transition matrices and add test.

c99fec1

tillahoffmann force-pushed the recursive-linear branch from 55b24c7 to c99fec1 Compare March 23, 2024 23:25

fehiepsi approved these changes Mar 25, 2024

View reviewed changes

fehiepsi merged commit ad6861a into pyro-ppl:master Mar 25, 2024
4 checks passed

tillahoffmann deleted the recursive-linear branch March 25, 2024 16:54

tillahoffmann mentioned this pull request Nov 11, 2024

Add Gaussian state space model distribution. #1904

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `RecursiveLinearTransform` for linear state space models. #1766

Add `RecursiveLinearTransform` for linear state space models. #1766

tillahoffmann commented Mar 20, 2024 •

edited

Loading

fehiepsi Mar 21, 2024

tillahoffmann Mar 23, 2024

fehiepsi Mar 24, 2024

tillahoffmann Mar 25, 2024

fehiepsi Mar 25, 2024

fehiepsi Mar 22, 2024

tillahoffmann Mar 23, 2024

fehiepsi Mar 22, 2024

fehiepsi Mar 22, 2024

tillahoffmann Mar 23, 2024

fehiepsi commented Mar 25, 2024

Add RecursiveLinearTransform for linear state space models. #1766

Add RecursiveLinearTransform for linear state space models. #1766

Conversation

tillahoffmann commented Mar 20, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fehiepsi commented Mar 25, 2024

Add `RecursiveLinearTransform` for linear state space models. #1766

Add `RecursiveLinearTransform` for linear state space models. #1766

tillahoffmann commented Mar 20, 2024 •

edited

Loading