Regridding to larger grid not resulting in NaNs where no starting data #31

kjdoore · 2024-02-15T14:09:45Z

When regridding from a smaller spatial extent to a larger spatial extent, I would expect NaNs to be the resulting values in regions of the target grid where no data was present in the original data. This is the result when regridding using the methods that utilized xarray.interp (i.e., linear, cubic, nearest). However, this is not the case for conservative and most_common. I have included an example below.

import numpy as np
import xarray as xr
import xarray_regrid

grid = xarray_regrid.Grid(
    north=48,
    east=48,
    south=0,
    west=0,
    resolution_lat=8,
    resolution_lon=8,
)
target_ds = xarray_regrid.create_regridding_dataset(grid)

data = np.array(
    [
        [2, 2, 2, 2, 0, 0, 0, 0, 0, 0, 0],
        [2, 2, 0, 2, 0, 0, 0, 0, 0, 0, 0],
        [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0],
        [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0],
        [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0],
        [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0],
        [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0],
        [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0],
        [3, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1],
        [3, 3, 3, 3, 0, 0, 0, 0, 1, 1, 1],
        [3, 3, 0, 3, 0, 0, 0, 0, 1, 1, 1],
    ]
)
lat_coords = np.linspace(0, 40, num=11)
lon_coords = np.linspace(0, 40, num=11)

ds = xr.Dataset(data_vars={"lc": (["longitude", "latitude"], data)},
                coords={"longitude": (["longitude"], lon_coords),
                        "latitude": (["latitude"], lat_coords)},
                attrs={"test": "not empty"})

ds.regrid.conservative(target_ds, latitude_coord='latitude')
ds.regrid.most_common(target_ds)

The text was updated successfully, but these errors were encountered:

kjdoore · 2024-02-15T14:11:50Z

Similar to #14

BSchilperoort · 2024-02-15T15:31:37Z

This would be a good issue to fix. I think that the best way to implement this for the conservative regridder would be to compute a mask (only if the data cannot cover the target grid), and replace all values under that mask with np.nan

Doing this inside the actual routines is challenging, as we have to mask NaNs out some way to avoid the entire matrix becoming NaN.

kjdoore · 2024-02-15T16:19:04Z

Yeah, the masking could work. Were you thinking that this would be something that occurs after the regridding? Like a final step?

BSchilperoort · 2024-02-16T07:33:54Z

I think it's the most simple solution. It would be possible to reduce the target grid for regridding (so it fits the data) and then pad NaNs after regridding, but that's a bit more complex and possibly not worth the effort.

BSchilperoort linked a pull request Feb 29, 2024 that will close this issue

Regridding to larger grid results in NaNs outside of data range #33

Merged

BSchilperoort closed this as completed in #33 Feb 29, 2024

BSchilperoort mentioned this issue Jul 22, 2024

User should receive a warning when the given data set can not cover the target grid. #14

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regridding to larger grid not resulting in NaNs where no starting data #31

Regridding to larger grid not resulting in NaNs where no starting data #31

kjdoore commented Feb 15, 2024

kjdoore commented Feb 15, 2024 •

edited

Loading

BSchilperoort commented Feb 15, 2024

kjdoore commented Feb 15, 2024

BSchilperoort commented Feb 16, 2024

Regridding to larger grid not resulting in NaNs where no starting data #31

Regridding to larger grid not resulting in NaNs where no starting data #31

Comments

kjdoore commented Feb 15, 2024

kjdoore commented Feb 15, 2024 • edited Loading

BSchilperoort commented Feb 15, 2024

kjdoore commented Feb 15, 2024

BSchilperoort commented Feb 16, 2024

kjdoore commented Feb 15, 2024 •

edited

Loading