Unit 2: Explaining the "residual learning" #342

0xD4rky · 2024-09-05T19:48:14Z

I would like to explain the residual learning, introduced in the official paper, in depth.

I want to explain how learning (h(x)-x) is easier for the model rather than learning h(x) (where h(x) is the function that maps the input and output of the stacked layer).

Hence, allow me to raise a PR for updating the docs and you review the changes!

johko · 2024-09-19T19:42:16Z

Sounds great, feel free to write something up and create a Pr 👍

0xD4rky · 2024-09-20T08:42:42Z

will do for sure!

sezan92 · 2024-10-25T05:41:33Z

is this issue done ?

johko · 2024-10-26T20:47:56Z

I think it is connected to PR #347, which is still open, but almost done

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unit 2: Explaining the "residual learning" #342

Unit 2: Explaining the "residual learning" #342

0xD4rky commented Sep 5, 2024

johko commented Sep 19, 2024

0xD4rky commented Sep 20, 2024

sezan92 commented Oct 25, 2024

johko commented Oct 26, 2024

Unit 2: Explaining the "residual learning" #342

Unit 2: Explaining the "residual learning" #342

Comments

0xD4rky commented Sep 5, 2024

johko commented Sep 19, 2024

0xD4rky commented Sep 20, 2024

sezan92 commented Oct 25, 2024

johko commented Oct 26, 2024