Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Graphic designer hacking together my first basic game (github.com/kveca)
1 point
kveca
10 years ago
discuss
2.
Show HN: KTransformers–236B Model and 1M Context LLM Inference on Local Machines (github.com/kvcache-ai)
20 points
sssummer
2 years ago
3 comments
3.
Show HN: KTransformers:671B DeepSeek-R1 on a Single Machine-286 tokens/s Prefill (github.com/kvcache-ai)
14 points
sssummer
a year ago
discuss
4.
Mooncake: A KVCache-Centric Disaggregated Architecture for LLM Serving (github.com/kvcache-ai)
13 points
zinccat
2 years ago
discuss
5.
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving (github.com/kvcache-ai)
8 points
sarkory
a year ago
discuss
6.
Kill all descendants of a process using POSIX shell and /proc (github.com/kvechera)
2 points
sply
8 years ago
discuss
7.
Kvcached: Virtualized, elastic KV cache for LLM serving on shared GPUs (notion.so)
69 points
Jrxing
7 months ago
13 comments
8.
Show HN: Revibing nanochat's inference model in C++ with ggml (github.com/k-ye)
5 points
makechan
5 months ago
discuss
9.
Show HN: I Used Llama-70B Logprobs for Better, Cheaper and Faster Chunking (github.com/ZeroEntropy-AI)
1 point
ghita_
2 years ago
discuss
10.
Microsoft on GitHub (github.com/Microsoft)
73 points
kevcampb
12 years ago
57 comments
11.
Etcher 1.4.4 Ignores Privacy Setting (github.com/balena-io)
2 points
kevcampb
7 years ago
discuss