Grads w.r.t. weights of `MixtureGeneral` Distribution are giving `nan`s #1870

Qazalbash · 2024-09-27T23:55:10Z

Hi,

We have created some models where we estimate the weights of the MixtureGeneral distribution. However, when computing the gradient of this argument, we are encountering nan values. We enabled jax.config.update("debug_nan", True) to diagnose the issue, and it pointed to the following line:

numpyro/numpyro/distributions/mixtures.py

Line 152 in 8e9313f

return jax.nn.logsumexp(sum_log_probs, axis=-1)

I suspect that after the implementation of #1791, extra care is needed to handle inf and nan values, possibly by using a double where for a safe logsumexp.

Important

This is an urgent issue, so a prompt response would be greatly appreciated.

The text was updated successfully, but these errors were encountered:

fehiepsi · 2024-09-28T14:34:36Z

You can add jax.debug.print(...) to inspect the component log probs. If all of the component log probs are -inf, nan will happen.

Qazalbash · 2024-10-02T15:16:17Z

I am able to get gradients of weights even with -jnp.inf, by modifying,

numpyro/numpyro/distributions/mixtures.py

Lines 148 to 152 in 8e9313f

    
           @validate_sample 
        
           def log_prob(self, value, intermediates=None): 
        
               del intermediates 
        
               sum_log_probs = self.component_log_probs(value) 
        
               return jax.nn.logsumexp(sum_log_probs, axis=-1)

to

@validate_sample
def log_prob(self, value, intermediates=None):
    del intermediates
    sum_log_probs = self.component_log_probs(value)
    safe_sum_log_probs = jnp.where(
        jnp.isneginf(sum_log_probs), -jnp.inf, sum_log_probs
    )
    return jax.nn.logsumexp(safe_sum_log_probs, axis=-1)

fehiepsi added the enhancement New feature or request label Sep 28, 2024

This comment was marked as outdated.

Sign in to view

Qazalbash mentioned this issue Oct 2, 2024

gh-1870: Refactor log_prob method in _MixtureBase class to handle -jnp.inf #1874

Merged

fehiepsi closed this as completed in #1874 Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Grads w.r.t. weights of `MixtureGeneral` Distribution are giving `nan`s #1870

Grads w.r.t. weights of `MixtureGeneral` Distribution are giving `nan`s #1870

Qazalbash commented Sep 27, 2024

fehiepsi commented Sep 28, 2024

This comment was marked as outdated.

Qazalbash commented Oct 2, 2024

Grads w.r.t. weights of MixtureGeneral Distribution are giving nans #1870

Grads w.r.t. weights of MixtureGeneral Distribution are giving nans #1870

Comments

Qazalbash commented Sep 27, 2024

fehiepsi commented Sep 28, 2024

This comment was marked as outdated.

Qazalbash commented Oct 2, 2024

Grads w.r.t. weights of `MixtureGeneral` Distribution are giving `nan`s #1870

Grads w.r.t. weights of `MixtureGeneral` Distribution are giving `nan`s #1870