Quantile Delta Mapping #200

castelao · 2024-03-21T02:15:57Z

Implementing Quantile Delta Mapping correction.

Note that this implementation tried to keep consistent as much as possible with the rest of the library, such as LinearCorrection.

Handle analytical distributions using scipy or empirical ones.

An empty placeholder for now. Let's try a different approach.

Isolating QDM method since bias_calc was already getting too large.

Simplifies get_base_data().

Allows an alternative handler to deal with multiple bias datasets with the very same get_bias_data().

The QuantileDeltaMapping() requires a third dataset, the biased future, thus requiring a modified instantiation to receive such dataset.

An MVP of empirical distributions estimate for historical observations, historical modeled, and future modeled. Runing serial only.

Let's ignore for now the requirement on seasonal estimates.

Following the standard in the library, pre-allocate out holder (currently a dictionary).

For now, hardcoded to linear only.

Just trying to mimic the linear calibration. It's not clear the indices used to select and slice the quantiles. There is a weakness here since the quantiles are estiamted in a previous step, it lacks some lock to guarantee that the choosed coefficients file is the correct pair with the data to be corrected.

An interface to local_qdm_bc using np.array .

Keys aspects here is to minimize memory footprint and allow transparent concurrency.

A different implementation for QDM that mimics as much as possible LinearCorrection.

The easiest way to allow running with older Python.

@grantbuster

As suggested by @grantbuster.

It doesn't correct at this point, but just estimate the statistical distributions.

I'm getting some strange errors testing locally. Let's check this.

@bnb32

Just copy-n-paste @bnb32 's definitions.

castelao · 2024-04-17T15:50:33Z

@bnb32 , I added your rules (just the ignore for now) for ruff with pyproject.toml. If you have a chance, please double check it.

pyproject.toml

@grantbuster

* Core distribution classes Handle analytical distributions using scipy or empirical ones. * Initiating a new QuantileDeltaMapping An empty placeholder for now. Let's try a different approach. * refactor: Reorganizing module Isolating QDM method since bias_calc was already getting too large. * feat: Implementing from_fit() for EmpiricalDistribution * Reducing default empirical quantiles to 20 chunks * feat: QDM.get_base_data() Simplifies get_base_data(). * feat: Optional alterntive handler for get_bias_data() Allows an alternative handler to deal with multiple bias datasets with the very same get_bias_data(). * doc: Example for EmpiricalDistribution.from_quantiles() * feat: Custom QuantileDeltaMapping.__init__ to deal with biased future The QuantileDeltaMapping() requires a third dataset, the biased future, thus requiring a modified instantiation to receive such dataset. * fix: Missing imports * feat: QuantileDeltaMapping.run() to estimate distributions An MVP of empirical distributions estimate for historical observations, historical modeled, and future modeled. Runing serial only. * Renaming NT to NQ (Number of quantiles) Let's ignore for now the requirement on seasonal estimates. * Prototype for saving quantiles * fix: Missing imports * Temporary solution for number of quantiles * feat: _init_out() Following the standard in the library, pre-allocate out holder (currently a dictionary). * Renaming output items * Saving metadata for sampling method For now, hardcoded to linear only. * fix: Using 'filename' here * cleaning: output collector now created at __init__out() * feat: bias_trasnforms.get_spatial_bc_quantiles() Just trying to mimic the linear calibration. It's not clear the indices used to select and slice the quantiles. There is a weakness here since the quantiles are estiamted in a previous step, it lacks some lock to guarantee that the choosed coefficients file is the correct pair with the data to be corrected. * feat: bias_transforms.get_spatial_bc_quantiles() * feat: bias_transforms.local_qdm_bc_as_nparray() An interface to local_qdm_bc using np.array . * feat: [MVP] bias_transforms.local_qdm_bc() Keys aspects here is to minimize memory footprint and allow transparent concurrency. * feat: bias_calc.QuantileDeltaMappingCorrection A different implementation for QDM that mimics as much as possible LinearCorrection. * test: serial vs parallel * test: basic run of QuantileDeltaMappingCorrection * Requirements for testing * Renaming test file * clean: Unecessary imports * style: Arguments alignment * style: * test: Save distributions in a valid HDF5 * Avoiding xr.Dataset to conform with library * Using QuantileDeltaMapping from rex DRY. * QDM using rex's implementation * fix: get_spatial_bc_quantiles() requires base dataset name * feat: local_qdm_bc based on rex's QDM * Getting distribution definitions from saved HDF5 * Removing module distribution By using rex's QDM we don't need to know about distributions here anymore. * Removing my QDM I'm now using rex's QDM, so we don't need this anymore. * Making QuantileDeltaMappingCorrection available in the lib * clean: _quantile_delta_mapping() is not used anymore QDM core calculation moved to use rex. * feat: Implementing DataHandler.qdm_bc() Keeping it as close as possible to .lin_bc(). * test: DataHandler.qdm_bc() * style: Removing unused variables * feat: Custom distributions Quantiles configuration is not hardcoded anymore, but defined when instantiating the class. * fix: Must load n_quantiles before initializing 'out' * fix: Left behind an `NQ` variable * style: Matching the library style * Setup black to follow the 79 chars * style: Matching the library style * doc: QuantileDeltaMappingCorrection() * test: Refactoring common test dataset * fix: typo * doc: Extending documentation for __init_out__ * doc: Adding documentation to standard test dataset * A testing sample without trend * test: Saving some standard distribution params Help to speed up tests. Re-use these standard params as much as possible and isolate other tests. * test: Simplifying tests Re-use params if the goal is to test something else. * test: Using pytest's tmp_path Reduce coding. * test, fix: Handler don't accept Path, but string * test: Simplifying test_handler_qdm_bc() * doc: More documentation on tests * test: identity QDM * fix: Must copy reference or it was a softlink * test: Constant model, offset with reference * test: Standard setup should result in some correction * clean: Unused import * test: identity relative & absolute Both cases should result in no change correction. * Improving documentation * doc: Expanding documentation for QuantileDeltaMappingCorrection * doc: Constant model tests * style: For now, a single line doc * doc: More on QuantileDeltaMappingCorrection * test, refactor: Just moving tests around The constant model is a more intuitive case and good next case after identity. * doc: __init_out__() * doc: bias_transforms.local_qdm_bc() * Initiating ruff to keep consistent with what is used here * typo, doc: local_qdm_bc() * test, doc: More description on the tests concepts * test: test_bc_trend_same_hist() * refactor: Renaming '*_CDF' to '*_params' * Extending ruff's setup * doc: QuantileDeltaMappingCorrection.get_qdm_params() * doc: [WIP] run() * style: * fix: Removing empty line * doc: Correct syntax to function * fix: Exit context and return * doc: Improving links to other resources (DataRetrievalBase) * typo: * refactor: Clarify transformations required to use rex Since rex assumes a different data structure and we use regular numpy arrays, we have to orient our data when sending, and re-orient it on the way back. This commit just make these transformations a little more easier to follow on the price of a somehow larger memory footprint. * test: Adding range check as suggested by @grantbuster * test: All finite or none, can't be both * test: Downgrading scope to module level * doc: Documenting qdm_bc() * doc: Fixing link/reference to Cannon 2015 * doc: Improving QuantileDeltaMappingCorrection documentation * doc: Using Reference * doc: Improving documentation everywhere on bias_calc * doc: QuantileDeltaMappingCorrection.run() * doc: Minimalist example for local_qdm_bc() * fix, doc: Wrong syntax for rst * feat: _expand_paths() Used to expand (from wildcards) single of multiple paths. * style: super-linter wasn't happy with lambda * doc: Changing example to illustrate better possibilties * Adding option no_trend to QDM This allows using the same procedure for an ordinary Delta Mapping, reproducing rex's QDM design. * test: Increasing noise and changing offset The offset was too close to the bias offset, so this will help to distinguish between both. * test: Using a normal random noise instead * test, doc: Better info on the reference data * test: test_qdm_transform_notrend() * doc: A warning on the concept of no trend * style: * fix: Remove type hint The easiest way to allow running with older Python. * style: Breaking single line in multiple steps * meta: Adding info on datasets path As suggested by @grantbuster. * fix: Misleading log statement It doesn't correct at this point, but just estimate the statistical distributions. * fix: Making Python-3.8 happy (removing type) * style: Combining multiple `isinstance` * refactor: Isolating common part of get_factors() * style: * test: Validating get_spatial_bc_factors() transition I'm getting some strange errors testing locally. Let's check this. * fix, test: Forgot to add equal_nan * refactor: Diverting to _get_factors() * doc: Documenting get_spatial_bc_quantiles() * doc: Improving doc for get_spatial_bc_quantiles() * clean: get_spatial_bc_factors() * clean: get_spatial_bc_quantiles() * Adding ruff rules to ignore Just copy-n-paste @bnb32 's definitions. * refactor: Using rex's property to load distributions metadata

Missed those issues on PR #200. Somehow those didn't show up in the last check before commit it.

Fixing minor issues left from PR #200

castelao requested review from grantbuster and bnb32 March 21, 2024 02:15

castelao self-assigned this Mar 21, 2024

castelao added 24 commits March 28, 2024 14:08

Core distribution classes

fbae2ac

Handle analytical distributions using scipy or empirical ones.

Initiating a new QuantileDeltaMapping

f7502fb

An empty placeholder for now. Let's try a different approach.

refactor: Reorganizing module

399f086

Isolating QDM method since bias_calc was already getting too large.

feat: Implementing from_fit() for EmpiricalDistribution

0d2661d

Reducing default empirical quantiles to 20 chunks

3a9e984

feat: QDM.get_base_data()

7b2faf1

Simplifies get_base_data().

feat: Optional alterntive handler for get_bias_data()

d6d2fd1

Allows an alternative handler to deal with multiple bias datasets with the very same get_bias_data().

doc: Example for EmpiricalDistribution.from_quantiles()

f8550ba

feat: Custom QuantileDeltaMapping.__init__ to deal with biased future

8b97605

The QuantileDeltaMapping() requires a third dataset, the biased future, thus requiring a modified instantiation to receive such dataset.

fix: Missing imports

8a20f23

feat: QuantileDeltaMapping.run() to estimate distributions

73b3ebb

An MVP of empirical distributions estimate for historical observations, historical modeled, and future modeled. Runing serial only.

Renaming NT to NQ (Number of quantiles)

c9a9ae5

Let's ignore for now the requirement on seasonal estimates.

Prototype for saving quantiles

11841e2

fix: Missing imports

ffad4d2

Temporary solution for number of quantiles

560ad6a

feat: _init_out()

45e4bb0

Following the standard in the library, pre-allocate out holder (currently a dictionary).

Renaming output items

02b9cad

Saving metadata for sampling method

dc3cd3c

For now, hardcoded to linear only.

fix: Using 'filename' here

41140c2

cleaning: output collector now created at __init__out()

f2254e2

feat: bias_transforms.get_spatial_bc_quantiles()

ca8ccb3

feat: bias_transforms.local_qdm_bc_as_nparray()

19bb727

An interface to local_qdm_bc using np.array .

feat: [MVP] bias_transforms.local_qdm_bc()

1e9c55b

Keys aspects here is to minimize memory footprint and allow transparent concurrency.

castelao force-pushed the Gui/QDM branch from 8ac80ef to 4f9ae8b Compare March 28, 2024 20:08

castelao added 2 commits March 29, 2024 10:27

feat: bias_calc.QuantileDeltaMappingCorrection

daa8db5

A different implementation for QDM that mimics as much as possible LinearCorrection.

test: serial vs parallel

6c7e494

castelao added 3 commits April 16, 2024 11:04

doc: A warning on the concept of no trend

7aade75

style:

2dd904f

fix: Remove type hint

2b55f28

The easiest way to allow running with older Python.

castelao force-pushed the Gui/QDM branch from c6d7f7f to 2b55f28 Compare April 16, 2024 18:14

castelao added 15 commits April 16, 2024 12:18

style: Breaking single line in multiple steps

0611ba4

meta: Adding info on datasets path

fd270ad

As suggested by @grantbuster.

fix: Misleading log statement

51a45ec

It doesn't correct at this point, but just estimate the statistical distributions.

fix: Making Python-3.8 happy (removing type)

9d571d0

style: Combining multiple isinstance

7865d97

refactor: Isolating common part of get_factors()

907cec7

style:

6a7d733

test: Validating get_spatial_bc_factors() transition

a31abe8

I'm getting some strange errors testing locally. Let's check this.

fix, test: Forgot to add equal_nan

f9ebbf2

refactor: Diverting to _get_factors()

50abc77

doc: Documenting get_spatial_bc_quantiles()

98a25c1

doc: Improving doc for get_spatial_bc_quantiles()

9b08db0

clean: get_spatial_bc_factors()

decc286

clean: get_spatial_bc_quantiles()

51bba0c

Adding ruff rules to ignore

20ab6b2

Just copy-n-paste @bnb32 's definitions.

refactor: Using rex's property to load distributions metadata

098030d

castelao force-pushed the Gui/QDM branch from fda1e5c to 098030d Compare April 17, 2024 15:53

castelao commented Apr 17, 2024

View reviewed changes

pyproject.toml Show resolved Hide resolved

grantbuster approved these changes Apr 17, 2024

View reviewed changes

castelao merged commit b74d837 into main Apr 17, 2024
8 checks passed

castelao deleted the Gui/QDM branch April 17, 2024 22:12

castelao added a commit that referenced this pull request Apr 18, 2024

style: Fixing some extra blank lines

fd8f2f0

Missed those issues on PR #200. Somehow those didn't show up in the last check before commit it.

castelao added a commit that referenced this pull request Apr 18, 2024

Merge pull request #209 from NREL/fix/QDM_lint

f36cc10

Fixing minor issues left from PR #200

github-actions bot pushed a commit that referenced this pull request Apr 18, 2024

Merge pull request #209 from NREL/fix/QDM_lint

1eedefb

Fixing minor issues left from PR #200

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantile Delta Mapping #200

Quantile Delta Mapping #200

castelao commented Mar 21, 2024 •

edited

Loading

castelao commented Apr 17, 2024

Quantile Delta Mapping #200

Quantile Delta Mapping #200

Conversation

castelao commented Mar 21, 2024 • edited Loading

castelao commented Apr 17, 2024

castelao commented Mar 21, 2024 •

edited

Loading