Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Cramming the training of a (BERT-type) language model into limited compute (github.com/JonasGeiping)
3 points
homarp
3 years ago
discuss