You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Using distributed or parallel set-up in script?: No
The Problem
The Pythia models, and by extension the GPTNeoXForCausalLM implementation don't appear to be working correctly in 4.35. I've attached a simple reproduction snippet below. This code works on 4.34, but produces NaNs on 4.35 during the forward pass. The token ids are not particularly anomalous.
The problem is likely related to the report at #28316, but this issue shows that any effects on reward modeling might be 2nd-order effects and that changes between 4.34 and 4.35 are the problem.
System Info
transformers
version: 4.35.0The Problem
The Pythia models, and by extension the
GPTNeoXForCausalLM
implementation don't appear to be working correctly in 4.35. I've attached a simple reproduction snippet below. This code works on 4.34, but produces NaNs on 4.35 during the forward pass. The token ids are not particularly anomalous.The problem is likely related to the report at #28316, but this issue shows that any effects on reward modeling might be 2nd-order effects and that changes between 4.34 and 4.35 are the problem.
@ArthurZucker @younesbelkada
Reproduction
Expected behavior
Normal forward pass, without NaNs.
The text was updated successfully, but these errors were encountered: