Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Show HN: Made a batching LLM API for a project. Mistral 200 tk/s on RTX 3090 (github.com/epolewski)
3 points
muttled
2 years ago
discuss