Possible bugs? #79

crazylyf · 2016-01-16T04:16:53Z

Hello,
When I running ocropus-ltrain, it will occasionally warning: "FloatingPointError: overflow encountered in exp", and the program seems to restart from the nearest saved state. The problem occurs mainly in the "ffunc" function in lstm.py, which defines the softmax function using: 1.0/(1.0+exp(-x)). Same problem also occurs in the "sigmoid" function. I think this may be caused by large values in x. In the CLSTM source code, the values x is clipped to 20 for positive values, and -20 for negtive values. After clipping like this, the program goes well without warning.

Another problem is that the "backward" method in class "Parallel" returns None. This is correct for 1-layer BLSTM system, but for multiple layers BLSTM configuration which stacking paralleled BLSTM one over another, this will lead to error, as the deltas of subsequent layer is assigned as the current deltas. So, maybe the method should return deltas.

Best,

tmbdev · 2016-01-28T22:08:36Z

I'll keep the bug open, but for LSTM training, at this point, you're probably better off using the CLSTM implementation. It is almost a drop-in replacement and considerably faster.

If you want to submit patches for a "safe" sigmoid and for the BLSTM, please do.

@tmbdev

This should avoid (hopefully) some possible FloatingPointError overflow errors. The sigmoid function ffunc is for any x<-20 and x>20 already 0 resp. 1 up to 10^-9 and cutting will therefore not change the function substantially. This idea is from @tmbdev in #5 (comment) Implemented first in #49 (comment) Additional infos from #79 (comment)

zuphilip · 2017-12-25T17:17:44Z

Fixed in #201.

zuphilip · 2017-12-25T17:19:58Z

Second part may still be open:

Another problem is that the "backward" method in class "Parallel" returns None. This is correct for 1-layer BLSTM system, but for multiple layers BLSTM configuration which stacking paralleled BLSTM one over another, this will lead to error, as the deltas of subsequent layer is assigned as the current deltas. So, maybe the method should return deltas.

zuphilip mentioned this issue Apr 18, 2017

Clip exponential in ffunc to avoid overflow #201

Merged

zuphilip closed this as completed Dec 25, 2017

zuphilip reopened this Dec 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible bugs? #79

Possible bugs? #79

crazylyf commented Jan 16, 2016

tmbdev commented Jan 28, 2016

zuphilip commented Dec 25, 2017

zuphilip commented Dec 25, 2017

Possible bugs? #79

Possible bugs? #79

Comments

crazylyf commented Jan 16, 2016

tmbdev commented Jan 28, 2016

zuphilip commented Dec 25, 2017

zuphilip commented Dec 25, 2017