Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Compiling LLMs into a MegaKernel: A path to low-latency inference (zhihaojia.medium.com)
314 points
matt_d
a year ago
76 comments