Possible RNN design for comment. #185

aterzis-google · 2020-12-17T18:42:00Z

Alternative RNN design due to david-berthelot@

david-berthelot · 2020-12-17T21:18:51Z

Following on the feedback on the original RNN pull request #97 Andreas and I did a pair-programming session to prototype a better design with the following goals:

flexible (accommodate any kind of internal primitive)
does not explicitly refers to a batch size
does not make decisions for inference (e.g. there are too many cases to provide a solution: like feeding previous prediction, beam search, ...)

Not being an expert myself in recurrent network, I would really love feedback @ebrevdo @AlexeyKurakin - feel free to loop in more people whose opinions could help.

ebrevdo · 2020-12-17T21:22:45Z

Some rough thoughts: 1. There will always be some RNNs that will want to do something across batch and time. These will be some combination of scan + input/output layers that can handle the time dimension (e.g., Dense layer). So expect that users will still subclass/override RNN. LSTM is a good example. 2. get_initial_state is still super useful. why? there are RNNs whose state is a nested combination of tensors including tensors of different *types*. for example, some special RNNs have as the state an integer scalar corresponding to the current time step. so each iteration of the RNN increments that state by one. but the other states are typical floating point vectors.

…

On Thu, Dec 17, 2020 at 1:19 PM David Berthelot ***@***.***> wrote: Following on the feedback on the original RNN pull request #97 <#97> Andreas and I did a pair-programming session to prototype a better design with the following goals: - flexible (accommodate any kind of internal primitive) - does not explicitly refers to a batch size - does not make decisions for inference (e.g. there are too many cases to provide a solution: like feeding previous prediction, beam search, ...) Not being an expert myself in recurrent network, I would really love feedback @ebrevdo <https://github.com/ebrevdo> @AlexeyKurakin <https://github.com/AlexeyKurakin> - feel free to loop in more people whose opinions could help. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#185 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AANWFG4XEXERYH3TDFGX7FLSVJYUVANCNFSM4VACK3FQ> .

aterzis-google · 2021-03-22T20:00:28Z

See #211 instead

aterzis-google · 2021-03-22T20:01:06Z

Instead, see: #211

aterzis-google added 19 commits August 22, 2020 09:26

RNN -> Recurrent neural network

25b9700

Merge remote-tracking branch 'upstream/master'

f72fa31

Merge remote-tracking branch 'upstream/master'

3ca78fd

GRU module

9e54c36

Merge remote-tracking branch 'upstream/master'

290f7e3

Merge remote-tracking branch 'upstream/master'

024f3eb

Add Tutorials to the documentation.

efcde5c

Merge remote-tracking branch 'upstream/master'

5cd76de

Merge remote-tracking branch 'upstream/master'

53fd2b8

Merge remote-tracking branch 'upstream/master'

f238a3b

Merge remote-tracking branch 'upstream/master'

282d670

Move RNN cell to layers.py

2e4569b

Merge remote-tracking branch 'upstream/master'

3aec247

Fix empty spaces

fb8b562

Merge remote-tracking branch 'upstream/master'

8bb0fcd

Merge remote-tracking branch 'upstream/master'

daadf8f

Merge remote-tracking branch 'upstream/master'

df705fa

Merge remote-tracking branch 'upstream/master'

88e25c6

Merge remote-tracking branch 'upstream/master'

165971a

aterzis-google requested review from david-berthelot, ebrevdo and AlexeyKurakin December 17, 2020 18:42

aterzis-google linked an issue Dec 17, 2020 that may be closed by this pull request

Could you outline how to write a simplest RNN Module? #31

Open

aterzis-google added 5 commits March 16, 2021 12:19

Merge remote-tracking branch 'upstream/master'

4975f41

Possible RNN design for comment.

2df126e

Documentation.

0f3dc99

Added vectorized implementation

b8f6462

removed unneeded file

e35205c

Updated design

74271d6

aterzis-google force-pushed the rnn_redesign branch from c3ad751 to 74271d6 Compare March 16, 2021 19:29

aterzis-google removed the request for review from david-berthelot March 16, 2021 19:29

aterzis-google closed this Mar 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible RNN design for comment. #185

Possible RNN design for comment. #185

aterzis-google commented Dec 17, 2020

david-berthelot commented Dec 17, 2020

ebrevdo commented Dec 17, 2020 via email

aterzis-google commented Mar 22, 2021

aterzis-google commented Mar 22, 2021

Possible RNN design for comment. #185

Possible RNN design for comment. #185

Conversation

aterzis-google commented Dec 17, 2020

david-berthelot commented Dec 17, 2020

ebrevdo commented Dec 17, 2020 via email

aterzis-google commented Mar 22, 2021

aterzis-google commented Mar 22, 2021