Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
A Python Library to 6-7x the inference speed of your HF models
(github.com/MDK8888)
75 points
MDK8888
2 years ago
15 comments
2.
▲
A Minimal Implementation of Vllm
(github.com/MDK8888)
3 points
MDK8888
2 years ago
1 comment
3.
▲
Show HN: SageMode - A Python library for deploying, scaling, and monitoring LLMs
(github.com/MDK8888)
3 points
MDK8888
2 years ago
1 comment