Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
Sarathi-Serve: A low-latency and high-throughput serving engine for LLMs
github.com/microsoft
2 points
tonyabracadabra
2 years ago
No comment yet
Sarathi-Serve: A low-latency and high-throughput serving engine for LLMs | Heykuki News