Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Show HN: Made a batching LLM API for a project. Mistral 200 tk/s on RTX 3090
(github.com/epolewski)
3 points
muttled
2 years ago
discuss