Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
Medusa: Framework for Accelerating LLM Generation with Multiple Decoding Heads
github.com/FasterDecoding
5 points
PaulHoule
2 years ago
No comment yet
Medusa: Framework for Accelerating LLM Generation with Multiple Decoding Heads | Heykuki News