[WIP] Implement 1D to ND interpolation #3262

nbren12 · 2019-08-24T21:23:21Z

Closes interp and reindex should work for 1d -> nd indexing #3252
Tests added
Passes black . && mypy . && flake8
Fully documented, including whats-new.rst for all changes and api.rst for new API

The tests cover: 1. index is halfway between coordinates 2. index is the same as old coordinates 3. ND index

pep8speaks · 2019-08-24T21:23:23Z

Hello @nbren12! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file xarray/tests/test_interp.py:

Line 670:5: F841 local variable 'old_coord' is assigned to but never used
Line 676:36: E226 missing whitespace around arithmetic operator
Line 676:51: E226 missing whitespace around arithmetic operator
Line 679:21: E226 missing whitespace around arithmetic operator
Line 693:31: E231 missing whitespace after ','
Line 711:1: F811 redefinition of unused 'test_interp_1d_nd_targ' from line 704
Line 718:1: E302 expected 2 blank lines, found 1
Line 729:1: W391 blank line at end of file

Comment last updated at 2020-06-10 23:33:02 UTC

shoyer

Very nice!

shoyer · 2019-08-26T04:46:17Z

xarray/tests/test_interp.py

+
+def interp_nd(data, coords):
+    for dim in coords:
+        data = interp_1d(data, dim, coords[dim])


The downside of this approach (indexing separately along each dimension) is that it is potentially much more expensive than doing all indexing at once, e.g., if the result of indexing along multiple dimensions is only a single point.

I think it would be interesting to see if this could be done by indexing all dimensions at once 2**len(coords) times, with all combinations of bfill/ffill along all dimensions.

This would require modifying sel to support a dict of options for method indicating different indexing methods along different axes, along the lines of the proposal from #3223 for tolerance.

shoyer · 2019-08-26T04:49:07Z

xarray/tests/test_interp.py

+    upper = data.sel(**selectors, method='bfill')
+    lower_x = data[dim].sel(**selectors, method='ffill')
+    upper_x = data[dim].sel(**selectors, method='bfill')
+    weight = np.abs(vals - lower_x)/np.abs(upper_x-lower_x)


I would lean towards using the built in abs() here, which is slightly more generic than the NumPy function.

nbren12 · 2019-08-26T20:47:33Z

@shoyer Thanks for the comments. I was struggling to incorporate it into Dataset.interp since core.missing is a pretty complicated. Would it be worth refactoring that module to clarify how interp calls are mapped to a given function? Also, most of the methods in interp work like Dataset -> Variables -> Numpy arrays, but the method you proposed above operates on the Dataset level, so it doesn't quite fit into core.missing.interp.

The interpolation code I was working with doesn't regrid the coordinates appropriately, so we would need to do that too.

shoyer · 2019-08-27T06:12:14Z

Feel free to refactor as you see fit, but it may still make sense to do indexing at the Variable rather than Dataset level. That potentially would let you avoid redundant operations on the entire Dataset object.

Take a look at the _localize() helper function in missing.py for an example of how to do stuff with in the underlying index. I think something like the following helper function could do the trick:

def linear_interp(var, indexes_coords):
    lower_indices = {}
    upper_indices = {}
    for dim, [x, new_x] in indexes_coords.items():
        index = x.to_index()
        # ideally should precompute these, rather than calling get_indexer_nd for each
        # variable separately
        lower_indices[dim] = get_indexer_nd(index, new_x.values, method="ffill")
        upper_indices[dim] = get_indexer_nd(index, new_x.values, method="bfill")
    result = 0
    for weight, indexes in ...  # need to compute weights and all lower/upper combinations
        result += weight * var.isel(**indexes)
    return result

nbren12 · 2019-08-27T06:26:49Z

Thanks so much for the help. This is a good learning experience for me.

That potentially would let you avoid redundant operations on the entire Dataset object.

Yes. This is where I got stuck TBH.

crusaderky · 2019-08-27T13:34:50Z

For highly optimized interpolation of an N-dimensional array along any one dimension, see also https://xarray-extras.readthedocs.io/en/latest/api/interpolate.html

shoyer · 2019-11-02T21:46:32Z

One missing part of the algorithm I wrote in #3262 (comment) was looping over all index/weight combinations. I recently wrote a version of this for another project that might be a good starting point here:

def prod(items):
  out = 1
  for item in items:
    out *= item
  return out

def index_by_linear_interpolation(array, float_indices):
  all_indices_and_weights = []
  for origin in float_indices:
    lower = np.floor(origin)
    upper = np.ceil(origin)
    l_index = xlower.astype(np.int32)
    u_index = upper.astype(np.int32)
    l_weight = origin - lower
    u_weight = 1 - l_weight
    all_indices_and_weights.append(
        ((l_index, l_weight), (u_index, u_weight))
    )

  out = 0
  for items in itertools.product(*all_indices_and_weights):
    indices, weights = zip(*items)
    indices = tuple(index % size for index, size in zip(indices, array.shape))
    out += prod(weights) * array[indices]
  return out

nbren12 · 2019-11-02T22:01:10Z

Unfortunately, I don’t think I have much time now to contribute to a general purpose solution leveraging xarray’s built-in indexing. So feel free to add to or close this PR. To be successful, I would need to study xarray’s indexing internals more since I don’t think it is as easily implemented as a routine calling DataArray methods. Some custom numba code I wrote fits in my brain much better, and is general enough for my purposes when wrapped with xr.apply_ufunc. I encourage someone else to pick up where I left off, or we could close this PR.

shoyer · 2019-11-02T23:01:30Z

No worries! You were a great help already!

…

On Sat, Nov 2, 2019 at 3:01 PM Noah D Brenowitz ***@***.***> wrote: Unfortunately, I don’t think I have much time now to contribute to a general purpose solution leveraging xarray’s built-in indexing. So feel free to add to or close this PR. To be successful, I would need to study xarray’s indexing internals more since I don’t think it is as easily implemented as a routine calling DataArray methods. Some custom numba code I wrote fits in my brain much better, and is general enough for my purposes when wrapped with xr.apply_ufunc. I encourage someone else to pick up where I left off, or we could close this PR. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3262?email_source=notifications&email_token=AAJJFVQEA7HP77G2FKRYYATQRX2CPA5CNFSM4IPHIIXKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEC5F7HI#issuecomment-549085085>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAJJFVXSUO5GOMC67PPGBGDQRX2CPANCNFSM4IPHIIXA> .

nbren12 · 2020-12-17T01:29:12Z

I'm going to close this since I won't be working on it any longer.

nbren12 added 3 commits August 24, 2019 13:49

Begin work on 1->ND interpolation

992ac21

Solves pydata#3252

Implement linear interpolation

a0e2314

The tests cover: 1. index is halfway between coordinates 2. index is the same as old coordinates 3. ND index

Start interp_nd stub and failing test

5f90f4e

shoyer reviewed Aug 26, 2019

View reviewed changes

huard mentioned this pull request Feb 10, 2020

Fix interp bug when indexer shares coordinates with array #3758

Merged

4 tasks

nbren12 force-pushed the master branch 2 times, most recently from 34e3ce0 to 5f90f4e Compare June 10, 2020 23:32

nbren12 closed this Dec 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Implement 1D to ND interpolation #3262

[WIP] Implement 1D to ND interpolation #3262

nbren12 commented Aug 24, 2019

pep8speaks commented Aug 24, 2019 •

edited

Loading

shoyer left a comment

shoyer Aug 26, 2019

shoyer Aug 26, 2019

nbren12 commented Aug 26, 2019 •

edited

Loading

shoyer commented Aug 27, 2019

nbren12 commented Aug 27, 2019

crusaderky commented Aug 27, 2019

shoyer commented Nov 2, 2019

nbren12 commented Nov 2, 2019

shoyer commented Nov 2, 2019 via email

nbren12 commented Dec 17, 2020

[WIP] Implement 1D to ND interpolation #3262

[WIP] Implement 1D to ND interpolation #3262

Conversation

nbren12 commented Aug 24, 2019

pep8speaks commented Aug 24, 2019 • edited Loading

Comment last updated at 2020-06-10 23:33:02 UTC

shoyer left a comment

Choose a reason for hiding this comment

shoyer Aug 26, 2019

Choose a reason for hiding this comment

shoyer Aug 26, 2019

Choose a reason for hiding this comment

nbren12 commented Aug 26, 2019 • edited Loading

shoyer commented Aug 27, 2019

nbren12 commented Aug 27, 2019

crusaderky commented Aug 27, 2019

shoyer commented Nov 2, 2019

nbren12 commented Nov 2, 2019

shoyer commented Nov 2, 2019 via email

nbren12 commented Dec 17, 2020

pep8speaks commented Aug 24, 2019 •

edited

Loading

nbren12 commented Aug 26, 2019 •

edited

Loading