Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Show HN: A Zero-Copy 1.58-bit LLM Engine hitting 117 Tokens/s on single CPU core
(github.com/r3-engine)
4 points
dhilipsiva
4 months ago
discuss