Cleanup for Optimal Control Ops #1045

jessegrabowski · 2024-10-20T11:43:14Z

Description

Add blockwise support for optimal control ops (SolveDiscreteLyapunov, SolveContinuousLyapunov, SolveDiscreteARE)
Add rewrite to remove BilinearSolveDiscreteLyapunov in JAX mode. JAX can't call out to the necessary LAPACK functions, but we can still fall back to the direct method
Simplify tests, adding batched cases
Add/update typehints

Related Issue

Closes #
Related to #

Checklist

Checked that the pre-commit linting/style checks pass
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks)
If you are a pro: each commit corresponds to a relevant logical change

Type of change

📚 Documentation preview 📚: https://pytensor--1045.org.readthedocs.build/en/1045/

pytensor/tensor/rewriting/linalg.py

pytensor/tensor/slinalg.py

jessegrabowski · 2024-10-20T15:14:52Z

JAX backend doesn't like my new batch-compatible _direct_solve_discrete_lyapunov, I guess because of this line:

    vec_q_shape = pt.concatenate([Q.shape[:-2], [-1]])
    vec_Q = Q.reshape(vec_q_shape)

I initially tried just wrapping the core case in pt.vectorize, but I hit shape errors going that route. Any suggestions?

ricardoV94 · 2024-10-20T15:22:36Z

JAX backend doesn't like my new batch-compatible _direct_solve_discrete_lyapunov, I guess because of this line:
    vec_q_shape = pt.concatenate([Q.shape[:-2], [-1]])
    vec_Q = Q.reshape(vec_q_shape)
I initially tried just wrapping the core case in pt.vectorize, but I hit shape errors going that route. Any suggestions?

May need something for Reshape like we do for the size argument of RVs (check size_tuple Op or whatever it is)

jessegrabowski · 2024-10-21T03:06:16Z

I fixed the JAX problem by doing what I should have been doing in the first place -- implementing a core case then using Blockwise on it.

Unfortunately, this had the side effect of breaking gradients for the regular pytensor backend. Something is failing in rewrites; I think a useless blockwise isn't getting rewritten away.

ricardoV94 · 2024-10-21T06:34:45Z

Sounds like progress, can you show the gradient graph that's failing?

jessegrabowski · 2024-10-21T06:55:13Z

Graph:

Sum{axes=None} [id A] <Scalar(float64, shape=())> 10
 └─ Mul [id B] <Tensor3(float64, shape=(5, ?, ?))> 9
    ├─ random_projection [id C] <Tensor3(float64, shape=(?, ?, ?))>
    └─ SpecifyShape [id D] <Tensor3(float64, shape=(5, ?, ?))> 8
       ├─ Reshape{3} [id E] <Tensor3(float64, shape=(?, ?, ?))> 7
       │  ├─ Blockwise{Solve{assume_a='gen', lower=False, check_finite=True, b_ndim=1, overwrite_a=False, overwrite_b=False}, (m,m),(m)->(m)} [id F] <Matrix(float64, shape=(5, ?))> 6
       │  │  ├─ Sub [id G] <Tensor3(float64, shape=(5, ?, ?))> 5
       │  │  │  ├─ ExpandDims{axis=0} [id H] <Tensor3(float64, shape=(1, ?, ?))> 4
       │  │  │  │  └─ Eye{dtype='float64'} [id I] <Matrix(float64, shape=(?, ?))> 3
       │  │  │  │     ├─ Shape_i{2} [id J] <Scalar(int64, shape=())> 2
       │  │  │  │     │  └─ Blockwise{KroneckerProduct{inline=False}, (i00,i01),(i10,i11)->(o00,o01)} [id K] <Tensor3(float64, shape=(5, ?, ?))> 1
       │  │  │  │     │     ├─ input 0 [id L] <Tensor3(float64, shape=(5, 5, 5))>
       │  │  │  │     │     └─ input 0 [id L] <Tensor3(float64, shape=(5, 5, 5))>
       │  │  │  │     ├─ Shape_i{2} [id J] <Scalar(int64, shape=())> 2
       │  │  │  │     │  └─ ···
       │  │  │  │     └─ 0 [id M] <Scalar(int8, shape=())>
       │  │  │  └─ Blockwise{KroneckerProduct{inline=False}, (i00,i01),(i10,i11)->(o00,o01)} [id K] <Tensor3(float64, shape=(5, ?, ?))> 1
       │  │  │     └─ ···
       │  │  └─ Reshape{2} [id N] <Matrix(float64, shape=(?, ?))> 0
       │  │     ├─ input 1 [id O] <Tensor3(float64, shape=(5, 5, 5))>
       │  │     └─ [ 5 -1] [id P] <Vector(int64, shape=(2,))>
       │  └─ [5 5 5] [id Q] <Vector(int64, shape=(3,))>
       ├─ 5 [id R] <Scalar(int8, shape=())>
       ├─ NoneConst{None} [id S] <NoneTypeT>
       └─ NoneConst{None} [id S] <NoneTypeT>

Error:

ERROR    pytensor.graph.rewriting.basic:basic.py:1746 Rewrite failure due to: local_blockwise_alloc
ERROR    pytensor.graph.rewriting.basic:basic.py:1747 node: Blockwise{Reshape{1}, (i00,i01),(i10)->(o00)}(SpecifyShape.0, Alloc.0)
ERROR    pytensor.graph.rewriting.basic:basic.py:1748 TRACEBACK:
ERROR    pytensor.graph.rewriting.basic:basic.py:1749 Traceback (most recent call last):
  File "/Users/jessegrabowski/Documents/Python/pytensor/pytensor/graph/rewriting/basic.py", line 1909, in process_node
    replacements = node_rewriter.transform(fgraph, node)
                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/jessegrabowski/Documents/Python/pytensor/pytensor/graph/rewriting/basic.py", line 1081, in transform
    return self.fn(fgraph, node)
           ^^^^^^^^^^^^^^^^^^^^^
  File "/Users/jessegrabowski/Documents/Python/pytensor/pytensor/tensor/rewriting/blockwise.py", line 176, in local_blockwise_alloc
    new_outs = node.op.make_node(*new_inputs).outputs
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/jessegrabowski/Documents/Python/pytensor/pytensor/tensor/blockwise.py", line 130, in make_node
    core_node = self._create_dummy_core_node(inputs)
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/jessegrabowski/Documents/Python/pytensor/pytensor/tensor/blockwise.py", line 103, in _create_dummy_core_node
    raise ValueError(
ValueError: Input 1 DropDims{axis=0}.0 has insufficient core dimensions for signature (i00,i01),(i10)->(o00)

pytensor/tensor/rewriting/linalg.py

ricardoV94 · 2024-10-21T08:13:58Z

Looking at the rewrite bug

codecov · 2024-10-21T17:28:40Z

Codecov Report

Attention: Patch coverage is 95.91837% with 2 lines in your changes missing coverage. Please review.

Project coverage is 81.93%. Comparing base (dae731d) to head (89d5fd0).
Report is 88 commits behind head on main.

Files with missing lines	Patch %	Lines
pytensor/tensor/slinalg.py	95.00%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1045      +/-   ##
==========================================
+ Coverage   81.90%   81.93%   +0.02%     
==========================================
  Files         182      182              
  Lines       47872    47890      +18     
  Branches     8617     8617              
==========================================
+ Hits        39210    39239      +29     
+ Misses       6489     6481       -8     
+ Partials     2173     2170       -3

Files with missing lines	Coverage Δ
pytensor/tensor/rewriting/blockwise.py	`96.40% <100.00%> (ø)`
pytensor/tensor/rewriting/linalg.py	`91.37% <100.00%> (+0.44%)`	⬆️
pytensor/tensor/slinalg.py	`93.47% <95.00%> (+1.49%)`	⬆️

... and 1 file with indirect coverage changes

ricardoV94

Small point about the casting in the perform method. Otherwise looks great

pytensor/tensor/slinalg.py

jessegrabowski · 2024-10-22T13:35:03Z

I had to go back to tracking Blockwise on the rewrite. I found that when I instantiated an Op, as in:

_solve_lyapunov = Blockwise(SolveLyapunov)`

I found though that the node.outputs[0].type.dtype was being "frozen" at the first call, though. If I did float64 then complex128, it was downcasting the complex output to float64. If I remake the node each time a function wrapper is called, I didn't have this problem.

ricardoV94 · 2024-10-22T15:30:47Z

I had to go back to tracking Blockwise on the rewrite. I found that when I instantiated an Op, as in:
_solve_lyapunov = Blockwise(SolveLyapunov)`
I found though that the node.outputs[0].type.dtype was being "frozen" at the first call, though. If I did float64 then complex128, it was downcasting the complex output to float64. If I remake the node each time a function wrapper is called, I didn't have this problem.

I'm confused. The make_node of an Op has flexibility (and reponsability) to create variables of any type it wants, it's not frozen unless you wrote it like that?

So you shouldn't even have to cast the output if you could predict it correctly at the time make_node is called (based on the input types). Several Ops go the lazy way and just call the scipy/numpy function with the smallest input possible and get the output type from there.

pytensor/tensor/slinalg.py

The rewrite was squeezing too many dimensions of the alloced value, when this didn't have dummy expand dims to the left.

Revert change to `_solve_discrete_lyapunov`

* Blockwise optimal linear control ops * Add jax rewrite to eliminate `BilinearSolveDiscreteLyapunov` * set `solve_discrete_lyapunov` method default to bilinear * Appease mypy * restore method dispatching * Use `pt.vectorize` on base `solve_discrete_lyapunov` case * Apply JAX rewrite before canonicalization * Improve tests * Remove useless warning filters * Fix local_blockwise_alloc rewrite The rewrite was squeezing too many dimensions of the alloced value, when this didn't have dummy expand dims to the left. * Fix float32 tests * Test against complex inputs * Appease ViPy (Vieira-py type checking) * Remove condition from `TensorLike` import * Infer dtype from `node.outputs.type.dtype` * Remove unused mypy ignore * Don't manually set dtype of output Revert change to `_solve_discrete_lyapunov` * Set dtype of Op outputs --------- Co-authored-by: ricardoV94 <[email protected]>

jessegrabowski requested a review from ricardoV94 October 20, 2024 11:43

jessegrabowski added graph rewriting linalg Linear algebra labels Oct 20, 2024

ricardoV94 reviewed Oct 20, 2024

View reviewed changes

pytensor/tensor/rewriting/linalg.py Outdated Show resolved Hide resolved

pytensor/tensor/slinalg.py Outdated Show resolved Hide resolved

pytensor/tensor/slinalg.py Show resolved Hide resolved

pytensor/tensor/slinalg.py Outdated Show resolved Hide resolved

ricardoV94 added enhancement New feature or request backend compatibility labels Oct 20, 2024

ricardoV94 reviewed Oct 21, 2024

View reviewed changes

pytensor/tensor/rewriting/linalg.py Outdated Show resolved Hide resolved

jessegrabowski requested a review from ricardoV94 October 22, 2024 04:43

ricardoV94 reviewed Oct 22, 2024

View reviewed changes

pytensor/tensor/slinalg.py Show resolved Hide resolved

ricardoV94 reviewed Oct 22, 2024

View reviewed changes

pytensor/tensor/slinalg.py Show resolved Hide resolved

pytensor/tensor/slinalg.py Outdated Show resolved Hide resolved

ricardoV94 approved these changes Oct 22, 2024

View reviewed changes

ricardoV94 mentioned this pull request Oct 23, 2024

Make blockwise perform method node dependent #1048

Merged

jessegrabowski and others added 9 commits October 24, 2024 19:07

Blockwise optimal linear control ops

57f73e8

Add jax rewrite to eliminate BilinearSolveDiscreteLyapunov

1674667

set solve_discrete_lyapunov method default to bilinear

c14ad65

Appease mypy

10453bb

restore method dispatching

c7057dc

Use pt.vectorize on base solve_discrete_lyapunov case

7e2eaae

Apply JAX rewrite before canonicalization

2b751eb

Improve tests

30759f4

Remove useless warning filters

132f7a5

ricardoV94 and others added 9 commits October 24, 2024 19:07

Fix local_blockwise_alloc rewrite

a0466e4

The rewrite was squeezing too many dimensions of the alloced value, when this didn't have dummy expand dims to the left.

Fix float32 tests

ebf2d72

Test against complex inputs

5e07560

Appease ViPy (Vieira-py type checking)

1f2eceb

Remove condition from TensorLike import

455338f

Infer dtype from node.outputs.type.dtype

beb1bf0

Remove unused mypy ignore

fb35d92

Don't manually set dtype of output

cb809c1

Revert change to `_solve_discrete_lyapunov`

Set dtype of Op outputs

89d5fd0

jessegrabowski force-pushed the lyapunov-jax branch from d5e8e45 to 89d5fd0 Compare October 24, 2024 11:08

jessegrabowski merged commit fffb84c into pymc-devs:main Oct 24, 2024
60 of 61 checks passed

jessegrabowski deleted the lyapunov-jax branch October 24, 2024 11:45

ricardoV94 removed the enhancement New feature or request label Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup for Optimal Control Ops #1045

Cleanup for Optimal Control Ops #1045

jessegrabowski commented Oct 20, 2024 •

edited by github-actions bot

Loading

jessegrabowski commented Oct 20, 2024 •

edited

Loading

ricardoV94 commented Oct 20, 2024 •

edited

Loading

jessegrabowski commented Oct 21, 2024

ricardoV94 commented Oct 21, 2024

jessegrabowski commented Oct 21, 2024

ricardoV94 commented Oct 21, 2024

codecov bot commented Oct 21, 2024 •

edited

Loading

ricardoV94 left a comment

jessegrabowski commented Oct 22, 2024

ricardoV94 commented Oct 22, 2024 •

edited

Loading

Cleanup for Optimal Control Ops #1045

Cleanup for Optimal Control Ops #1045

Conversation

jessegrabowski commented Oct 20, 2024 • edited by github-actions bot Loading

Description

Related Issue

Checklist

Type of change

jessegrabowski commented Oct 20, 2024 • edited Loading

ricardoV94 commented Oct 20, 2024 • edited Loading

jessegrabowski commented Oct 21, 2024

ricardoV94 commented Oct 21, 2024

jessegrabowski commented Oct 21, 2024

ricardoV94 commented Oct 21, 2024

codecov bot commented Oct 21, 2024 • edited Loading

Codecov Report

ricardoV94 left a comment

Choose a reason for hiding this comment

jessegrabowski commented Oct 22, 2024

ricardoV94 commented Oct 22, 2024 • edited Loading

jessegrabowski commented Oct 20, 2024 •

edited by github-actions bot

Loading

jessegrabowski commented Oct 20, 2024 •

edited

Loading

ricardoV94 commented Oct 20, 2024 •

edited

Loading

codecov bot commented Oct 21, 2024 •

edited

Loading

ricardoV94 commented Oct 22, 2024 •

edited

Loading