Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
DeepNet: Scaling Transformers to 1k Layers
arxiv.org
194 points
homarp
4 years ago
38 comments
Loading...
DeepNet: Scaling Transformers to 1k Layers | Heykuki News