Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Graphic designer hacking together my first basic game
(github.com/kveca)
1 point
kveca
10 years ago
discuss
2.
▲
Show HN: KTransformers–236B Model and 1M Context LLM Inference on Local Machines
(github.com/kvcache-ai)
20 points
sssummer
2 years ago
3 comments
3.
▲
Show HN: KTransformers:671B DeepSeek-R1 on a Single Machine-286 tokens/s Prefill
(github.com/kvcache-ai)
14 points
sssummer
a year ago
discuss
4.
▲
Mooncake: A KVCache-Centric Disaggregated Architecture for LLM Serving
(github.com/kvcache-ai)
13 points
zinccat
2 years ago
discuss
5.
▲
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving
(github.com/kvcache-ai)
8 points
sarkory
a year ago
discuss
6.
▲
Kill all descendants of a process using POSIX shell and /proc
(github.com/kvechera)
2 points
sply
8 years ago
discuss
7.
▲
Kvcached: Virtualized, elastic KV cache for LLM serving on shared GPUs
(notion.so)
69 points
Jrxing
7 months ago
13 comments
8.
▲
Show HN: Revibing nanochat's inference model in C++ with ggml
(github.com/k-ye)
5 points
makechan
5 months ago
discuss
9.
▲
Show HN: I Used Llama-70B Logprobs for Better, Cheaper and Faster Chunking
(github.com/ZeroEntropy-AI)
1 point
ghita_
2 years ago
discuss
10.
▲
Microsoft on GitHub
(github.com/Microsoft)
73 points
kevcampb
12 years ago
57 comments
11.
▲
Etcher 1.4.4 Ignores Privacy Setting
(github.com/balena-io)
2 points
kevcampb
7 years ago
discuss