Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 101 Bytes

20221130.md

File metadata and controls

2 lines (2 loc) · 101 Bytes

1. Transformer

  • out = self.embedding(x)*math.sqrt(self.d_embed): 그래디언트 안정화 목적