lazily evaluate the integrator coefficients #311

basnijholt · 2021-03-19T10:46:54Z

Description

Currently, the having the following in a script:

import adaptive
import gpytorch

breaks any Python process because of pytorch/pytorch#54063.

This is not a proper fix but avoiding the problem.

Howver, rather than doing any math on import, I think it is a better style to only evaluate the coefficients when required.

Checklist

Fixed style issues using pre-commit run --all (first install using pip install pre-commit)
pytest passed

Type of change

Check relevant option(s).

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
(Code) style fix or documentation update
This change requires a documentation update

akhmerov · 2021-03-22T14:55:00Z

Introducing the class and requiring that consumers instantiate it makes the code harder to use.

What do you think about using module-level __getattr__ instead, see here?

akhmerov · 2021-03-22T15:14:08Z

Additionally I don't follow the reasoning why not doing linear algebra on import is better style.

The new implementation is certainly harder to read, which is usually a sign of worse style.
Also according to my rough benchmarks the import time isn't adversely affected.
Since we are not mutating anything but the module namespace, there are no side-effects, which is the usual argument against doing stuff on import (pyplot I'm looking at you!)

Why do you claim it is better?

basnijholt · 2021-03-22T15:27:14Z

What do you think about using module-level getattr instead, see here?

Two reasons why it might not be better.

It's Python ≥3.7 only and we require ≥3.6.
If we decide to cache the values (which we should) I think we need to evaluate all coefficients as many times as there are coefficients.

Since we are not mutating anything but the module namespace, there are no side-effects, which is the usual argument against doing stuff on import (pyplot I'm looking at you!)

This is not true and the whole reason I would like to make this change. Calling a scipy function loads the BLAS/MKL implementation. In case one wants to import a package that was compiled against a different BLAS implementation, stuff might go wrong, as happens in pytorch/pytorch#54063.

akhmerov · 2021-03-22T19:17:06Z

It's Python ≥3.7 only and we require ≥3.6.

Fair, although end of main support window for 3.6 was 2018 and NEP 29 claims we're solidly in 3.7 territory.

On the one hand, I understand we don't want to make a minor release for this bug, and we wouldn't want to bump dependencies for a patch release. However we are changing undocumented, but also not underscored module attributes, so the current implementation could be a breaking change.

If we decide to cache the values (which we should) I think we need to evaluate all coefficients as many times as there are coefficients.

Something like this should work (or one out of many alternative implementations):

@lru_cache
def _coefficients():
    eps = np.spacing(1)

    # the nodes and Newton polynomials
    ns = (5, 9, 17, 33)
    xi = [-np.cos(np.linspace(0, np.pi, n)) for n in ns]

    # Make `xi` perfectly anti-symmetric, important for splitting the intervals
    xi = [(row - row[::-1]) / 2 for row in xi]

    # Compute the Vandermonde-like matrix and its inverse.
    V = [calc_V(x, n) for x, n in zip(xi, ns)]
    V_inv = list(map(scipy.linalg.inv, V))
    Vcond = [scipy.linalg.norm(a, 2) * scipy.linalg.norm(b, 2) for a, b in zip(V, V_inv)]

    # Compute the shift matrices.
    T_left, T_right = [V_inv[3] @ calc_V((xi[3] + a) / 2, ns[3]) for a in [-1, 1]]

    # If the relative difference between two consecutive approximations is
    # lower than this value, the error estimate is considered reliable.
    # See section 6.2 of Pedro Gonnet's thesis.
    hint = 0.1

    # Smallest acceptable relative difference of points in a rule.  This was chosen
    # such that no artifacts are apparent in plots of (i, log(a_i)), where a_i is
    # the sequence of estimates of the integral value of an interval and all its
    # ancestors..
    min_sep = 16 * eps

    ndiv_max = 20

    # set-up the downdate matrix
    k = np.arange(ns[3])
    alpha = np.sqrt((k + 1) ** 2 / (2 * k + 1) / (2 * k + 3))
    gamma = np.concatenate([[0, 0], np.sqrt(k[2:] ** 2 / (4 * k[2:] ** 2 - 1))])

    b_def = calc_bdef(ns)
    return locals()
    
def __getattr__(attr):
    return _coefficients()[attr]

basnijholt · 2021-03-23T09:17:05Z

@akhmerov, that's a really good suggestion!

I've implemented that and opened #312 to bump our requirement to Python≥3.7.

basnijholt · 2021-03-23T11:06:40Z

I have rebased this on top of the latest master (which has the Python ≥ 3.7 requirement) so all tests should now pass.

codecov-io · 2021-03-23T11:10:30Z

Codecov Report

Merging #311 (839c232) into master (1e3d26a) will increase coverage by 0.06%.
The diff coverage is 97.50%.

@@            Coverage Diff             @@
##           master     #311      +/-   ##
==========================================
+ Coverage   80.50%   80.57%   +0.06%     
==========================================
  Files          35       35              
  Lines        4633     4638       +5     
  Branches      834      834              
==========================================
+ Hits         3730     3737       +7     
+ Misses        778      777       -1     
+ Partials      125      124       -1

Impacted Files	Coverage Δ
adaptive/learner/integrator_learner.py	`91.42% <94.11%> (ø)`
adaptive/learner/integrator_coeffs.py	`95.55% <100.00%> (+0.26%)`	⬆️
adaptive/tests/test_cquad.py	`92.25% <100.00%> (ø)`
adaptive/runner.py	`71.53% <0.00%> (+0.71%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1e3d26a...839c232. Read the comment docs.

adaptive/learner/integrator_coeffs.py

basnijholt force-pushed the coeff-cache branch from 86f2fc1 to 50f866a Compare March 23, 2021 09:16

basnijholt force-pushed the coeff-cache branch from 50f866a to 1834b2a Compare March 23, 2021 09:32

basnijholt added 6 commits March 23, 2021 12:04

evaluate coeffs lazily

12075a1

use cached_property

5be5a66

use cached_property is available

d748cc7

delay setting of coefficients

4cf4a54

rename c to coeffs

878068e

implement integrator_coeffs using module.__getattr__

839c232

basnijholt force-pushed the coeff-cache branch from 1834b2a to 839c232 Compare March 23, 2021 11:04

akhmerov reviewed Mar 23, 2021

View reviewed changes

adaptive/learner/integrator_coeffs.py Show resolved Hide resolved

add doc-string to _coefficients

ebae237

basnijholt merged commit 39c8bf0 into master Mar 23, 2021

basnijholt deleted the coeff-cache branch March 23, 2021 11:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lazily evaluate the integrator coefficients #311

lazily evaluate the integrator coefficients #311

basnijholt commented Mar 19, 2021 •

edited

Loading

akhmerov commented Mar 22, 2021

akhmerov commented Mar 22, 2021

basnijholt commented Mar 22, 2021

akhmerov commented Mar 22, 2021

basnijholt commented Mar 23, 2021

basnijholt commented Mar 23, 2021

codecov-io commented Mar 23, 2021

lazily evaluate the integrator coefficients #311

lazily evaluate the integrator coefficients #311

Conversation

basnijholt commented Mar 19, 2021 • edited Loading

Description

Checklist

Type of change

akhmerov commented Mar 22, 2021

akhmerov commented Mar 22, 2021

basnijholt commented Mar 22, 2021

akhmerov commented Mar 22, 2021

basnijholt commented Mar 23, 2021

basnijholt commented Mar 23, 2021

codecov-io commented Mar 23, 2021

Codecov Report

basnijholt commented Mar 19, 2021 •

edited

Loading