Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
961.
▲
Deepseek R1 Distill 8B Q40 on 4 x Raspberry Pi 5
(github.com/b4rtaz)
306 points
b4rtazz
a year ago
156 comments
962.
▲
DeepDive in everything of Llama3: revealing detailed insights and implementation
(github.com/therealoliver)
222 points
therealoliver
a year ago
14 comments
963.
▲
Run Llama locally with only PyTorch on CPU
(github.com/anordin95)
168 points
anordin95
2 years ago
34 comments
964.
▲
Mistral Integration Improved in Llama.cpp
(github.com/ggml-org)
95 points
decide1000
10 months ago
15 comments
965.
▲
Karpathy/Nano-Llama31
(github.com/karpathy)
74 points
tim_sw
2 years ago
1 comment
966.
▲
llamafile: Distribute and Run LLMs with a Single File
(github.com/mozilla-ai)
43 points
stefankuehnel
6 months ago
7 comments
967.
▲
Llama.cpp: Add GPT-OSS
(github.com/ggml-org)
35 points
atgctg
10 months ago
discuss
968.
▲
LlamaChunk: Better RAG Chunking Than LlamaIndex
(github.com/ZeroEntropy-AI)
15 points
npip99
2 years ago
5 comments
969.
▲
Show HN: Distributed Llama – Run LLMs on multiple devices in parallel
(github.com/b4rtaz)
12 points
b4rtazz
2 years ago
discuss
970.
▲
Directly run and investigate Llama models locally
(github.com/anordin95)
8 points
anordin95
2 years ago
1 comment
971.
▲
Llamafile 0.4 now with Mixtral support
(github.com/Mozilla-Ocho)
6 points
louismerlin
2 years ago
1 comment
972.
▲
Llama.cpp: Deterministic Inference Mode (CUDA): RMSNorm, MatMul, Attention
(github.com/ggml-org)
6 points
diwank
9 months ago
discuss
973.
▲
Run Llama 3.3 70B Q40 on $1516 GPU 3.3 tok/s
(github.com/b4rtaz)
6 points
b4rtazz
a year ago
discuss
974.
▲
Llama, alpaca and gpt4all API in Golang
(github.com/go-skynet)
6 points
mudler
3 years ago
discuss
975.
▲
Open-Source LLaMA v2 Chatbot
(github.com/a16z-infra)
5 points
gk1
3 years ago
discuss
976.
▲
GitHub deletes popular llama.cpp fork without explanation
(github.com/ikawrakow)
4 points
akawry
a year ago
2 comments
977.
▲
Distributed Llama on Raspberry Pis
(github.com/b4rtaz)
4 points
politelemon
2 years ago
2 comments
978.
▲
Llama.cpp's Agents.md
(github.com/ggml-org)
4 points
Wowfunhappy
2 months ago
1 comment
979.
▲
Distributed-Llama: Connect home devices into a cluster for LLM inference
(github.com/b4rtaz)
4 points
tosh
a year ago
1 comment
980.
▲
Llama.cpp launches official WebUI for local LLMs
(github.com/ggml-org)
4 points
victormustar
7 months ago
discuss
981.
▲
Llama 3.1 is available for home AI clusters! Run 8B model with the full context
(github.com/b4rtaz)
4 points
b4rtazz
2 years ago
discuss
982.
▲
llamafile v0.8
(github.com/Mozilla-Ocho)
4 points
birriel
2 years ago
discuss
983.
▲
Distributed Llama
(github.com/b4rtaz)
4 points
lagniappe
2 years ago
discuss
984.
▲
Code Llama for VS Code
(github.com/xNul)
4 points
hcrisp
3 years ago
discuss
985.
▲
Directly run and investigate Llama models locally with only PyTorch
(github.com/anordin95)
3 points
anordin95
2 years ago
1 comment
986.
▲
ik_llama.cpp – llama.cpp fork with better CPU performance
(github.com/ikawrakow)
3 points
peter_d_sherman
7 days ago
discuss
987.
▲
Grinder12: 0.96-Bit Lossless Streaming KV-Cache (16.55x VRAM Savings
(github.com/ggml-org)
3 points
AMICLLC
a month ago
discuss
988.
▲
Distributed Llama
(github.com/b4rtaz)
3 points
oldfuture
4 months ago
discuss
989.
▲
LlamaBarn – A macOS menu bar app for running local LLMs
(github.com/ggml-org)
3 points
lyapustin
7 months ago
discuss
990.
▲
AMD teams contributing to the llama.cpp codebase
(github.com/ggml-org)
3 points
gzer0
10 months ago
discuss
More