Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
Mixture-of-Depths: Dynamically allocating compute in transformers | Heykuki News
Mixture-of-Depths: Dynamically allocating compute in transformers
arxiv.org
281 points
milliondreams
2 years ago
83 comments
Loading...