Validate the model when cannot find valid initial params. #733

fehiepsi · 2020-09-10T23:32:32Z

Resolves #731. As explained there, it might be tricky to find bugs when the inference cannot find valid initial parameters. With this PR, when that happens, we can recognize at which site, things go wrong.

@rim30 could you run your code with this branch to see where causes the problem?

TODO

incorporate init_strategy because some distributions such Improper does not have sample method.

rim30 · 2020-09-18T12:41:40Z

Hi @fehiepsi ,

Sorry for the massive delay, I was a bit busy until today. So I installed numpyro from your branch (https://github.com/fehiepsi/numpyro.git@validate) and got this error:

The time series that i am using is: [1146., 488., 753., 583., 553., 832., 807., 875.,
945., 795., 862., 1322., 890., 911., 990., 791.,
910., 957., 838., 956., 920., 945., 1192., 1921.,
987., 907., 762., 859., 843., 804., 785., 942.,
822., 727.] which you can clearly see it is not negative.

quick note, i used this simpler version of sgt which i modelled after the one from the numpyro website:

def simple_sgt(y, seasonality, future=0):
    # heuristically, standard derivation of Cauchy prior depends on
    # the max value of data
    cauchy_sd = jnp.max(y) / 150

    # NB: priors' parameters are taken from
    # https://github.com/cbergmeir/Rlgt/blob/master/Rlgt/R/rlgtcontrol.R
    nu = numpyro.sample("nu", dist.Uniform(2, 20))
    powx = numpyro.sample("powx", dist.Uniform(0, 1))
    sigma = numpyro.sample("sigma", dist.HalfCauchy(cauchy_sd))
    offset_sigma = numpyro.sample(
        "offset_sigma", dist.TruncatedCauchy(low=1e-10, loc=1e-10, scale=cauchy_sd)
    )

    coef_trend = numpyro.sample("coef_trend", dist.Cauchy(0, cauchy_sd))
    pow_trend_beta = numpyro.sample("pow_trend_beta", dist.Beta(1, 1))
    # pow_trend takes values from -0.5 to 1
    pow_trend = 1.5 * pow_trend_beta - 0.5
    #pow_season = numpyro.sample("pow_season", dist.Beta(1, 1))

    level_sm = numpyro.sample("level_sm", dist.Beta(1, 2))
    s_sm = numpyro.sample("s_sm", dist.Uniform(0, 1))
    init_s = numpyro.sample("init_s", dist.Cauchy(0, y[:seasonality] * 0.3))

    def transition_fn(carry, t):
        #level, s, moving_sum = carry
        level, s = carry
        #season = s[0] * level ** pow_season
        #exp_val = level + coef_trend * level ** pow_trend + season
        exp_val = (level + coef_trend * level ** pow_trend) * s[0]
        exp_val = jnp.clip(exp_val, a_min=0)
        # use expected vale when forecasting
        y_t = jnp.where(t >= N, exp_val, y[t])

        #moving_sum = (
        #    moving_sum + y[t] - jnp.where(t >= seasonality, y[t - seasonality], 0.0)
        #)
        #level_p = jnp.where(t >= seasonality, moving_sum / seasonality, y_t - season)
        #level = level_sm * level_p + (1 - level_sm) * level
        level = level_sm * y_t / s[0] + (1 - level_sm) * level
        level = jnp.clip(level, a_min=0)

        #new_s = (s_sm * (y_t - level) / season + (1 - s_sm)) * s[0]
        new_s = (s_sm * y_t / level + (1 - s_sm)) * s[0]
        # repeat s when forecasting
        new_s = jnp.where(t >= N, s[0], new_s)
        s = jnp.concatenate([s[1:], new_s[None]], axis=0)

        omega = sigma * exp_val ** powx + offset_sigma
        y_ = numpyro.sample("y", dist.StudentT(nu, exp_val, omega))

        #return (level, s, moving_sum), y_
        return (level, s), y_

    N = y.shape[0]
    level_init = y[0]
    s_init = jnp.concatenate([init_s[1:], init_s[:1]], axis=0)
    #moving_sum = level_init
    with numpyro.handlers.condition(data={"y": y[1:]}):
        _, ys = scan(
            transition_fn, (level_init, s_init), jnp.arange(1, N + future)
        )
    if future > 0:
        numpyro.deterministic("y_forecast", ys[-future:])

fehiepsi · 2020-09-18T17:12:15Z

@rim30 How about replacing exp_val = jnp.clip(exp_val, a_min=0) by

exp_val = jnp.clip(exp_val, a_min=1e-30, a_max=1e38)

? Our validation code hardly detects numerical issues...

neerajprad · 2020-09-22T04:57:52Z

numpyro/infer/util.py

@@ -427,6 +427,21 @@ def initialize_model(rng_key, model,

    if not_jax_tracer(is_valid):
        if device_get(~jnp.all(is_valid)):
+            with numpyro.validation_enabled(), trace() as tr:


A more informative warning / error message is definitely needed. I am thinking that can we simply run initialize_model with validation_enabled (I wouldn't expect that to add any material overhead)? Is the resulting warning message not informative enough?

Yes, it is simpler (I also don't worry about the overhead) but there are two issues with that:

validation is only useful for the first try (under jax loop, we can't prompt the warning/error for the later tries)

displaying the warning message for the first try might not be useful for users when we can find a valid one in a later try

What do you think?

Thanks for explaining, @fehiepsi, both of your points make a lot of sense.

neerajprad · 2020-09-22T06:23:46Z

numpyro/infer/util.py

+                            for w in ws:
+                                # at site information to the warning message
+                                w.message.args = ("Site {}: {}".format(site["name"], w.message.args[0]),) \
+                                    + w.message.args[1:]


What does a sample warning message look like?

Thanks for reviewing, @neerajprad! I'm a bit busy today but will run this again to display the warning message tomorrow. Here I just want to add site information to the warning message, because not many users know how to use warnings to turn a warning to an error to debug.

@neerajprad the following model

import numpyro import numpy as np def model(): x = numpyro.sample("x", numpyro.distributions.Normal()) numpyro.sample("obs", numpyro.distributions.Normal(x), obs=float('nan')) mcmc = numpyro.infer.MCMC(numpyro.infer.NUTS(model), 10, 10) mcmc.run(np.array([0, 0], dtype='uint32'))

gives the warning

UserWarning: Site obs: Out-of-support values provided to log prob method. The value argument should be within the support.

neerajprad · 2020-09-22T06:26:07Z

LGTM. Do you think this handles most of the forum questions you have been getting, or were they due to other numerical issues outside of distributions?

fehiepsi · 2020-09-23T06:31:46Z

This helps detect some issues in the forum, one for data not belong to the support and one for wrong parameter.

fehiepsi added 2 commits September 10, 2020 15:27

validate model if cannot find valid initial parameters

d54bb2b

add better warning message

3fcee5a

fehiepsi added the WIP label Sep 11, 2020

add init strategy

92da057

fehiepsi added awaiting review and removed WIP labels Sep 12, 2020

fehiepsi requested a review from martinjankowiak September 12, 2020 05:17

cleanup the code

71f6b87

neerajprad reviewed Sep 22, 2020

View reviewed changes

neerajprad approved these changes Sep 23, 2020

View reviewed changes

neerajprad merged commit 7f61de0 into pyro-ppl:master Sep 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validate the model when cannot find valid initial params. #733

Validate the model when cannot find valid initial params. #733

fehiepsi commented Sep 10, 2020 •

edited

Loading

rim30 commented Sep 18, 2020 •

edited

Loading

fehiepsi commented Sep 18, 2020 •

edited

Loading

neerajprad Sep 22, 2020

fehiepsi Sep 22, 2020

neerajprad Sep 22, 2020

neerajprad Sep 22, 2020

fehiepsi Sep 23, 2020

fehiepsi Sep 24, 2020

neerajprad commented Sep 22, 2020

fehiepsi commented Sep 23, 2020

Validate the model when cannot find valid initial params. #733

Validate the model when cannot find valid initial params. #733

Conversation

fehiepsi commented Sep 10, 2020 • edited Loading

rim30 commented Sep 18, 2020 • edited Loading

fehiepsi commented Sep 18, 2020 • edited Loading

neerajprad Sep 22, 2020

Choose a reason for hiding this comment

fehiepsi Sep 22, 2020

Choose a reason for hiding this comment

neerajprad Sep 22, 2020

Choose a reason for hiding this comment

neerajprad Sep 22, 2020

Choose a reason for hiding this comment

fehiepsi Sep 23, 2020

Choose a reason for hiding this comment

fehiepsi Sep 24, 2020

Choose a reason for hiding this comment

neerajprad commented Sep 22, 2020

fehiepsi commented Sep 23, 2020

fehiepsi commented Sep 10, 2020 •

edited

Loading

rim30 commented Sep 18, 2020 •

edited

Loading

fehiepsi commented Sep 18, 2020 •

edited

Loading