Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
961.
Deepseek R1 Distill 8B Q40 on 4 x Raspberry Pi 5 (github.com/b4rtaz)
306 points
b4rtazz
a year ago
156 comments
962.
DeepDive in everything of Llama3: revealing detailed insights and implementation (github.com/therealoliver)
222 points
therealoliver
a year ago
14 comments
963.
Run Llama locally with only PyTorch on CPU (github.com/anordin95)
168 points
anordin95
2 years ago
34 comments
964.
Mistral Integration Improved in Llama.cpp (github.com/ggml-org)
95 points
decide1000
10 months ago
15 comments
965.
Karpathy/Nano-Llama31 (github.com/karpathy)
74 points
tim_sw
2 years ago
1 comment
966.
llamafile: Distribute and Run LLMs with a Single File (github.com/mozilla-ai)
43 points
stefankuehnel
6 months ago
7 comments
967.
Llama.cpp: Add GPT-OSS (github.com/ggml-org)
35 points
atgctg
10 months ago
discuss
968.
LlamaChunk: Better RAG Chunking Than LlamaIndex (github.com/ZeroEntropy-AI)
15 points
npip99
2 years ago
5 comments
969.
Show HN: Distributed Llama – Run LLMs on multiple devices in parallel (github.com/b4rtaz)
12 points
b4rtazz
2 years ago
discuss
970.
Directly run and investigate Llama models locally (github.com/anordin95)
8 points
anordin95
2 years ago
1 comment
971.
Llamafile 0.4 now with Mixtral support (github.com/Mozilla-Ocho)
6 points
louismerlin
2 years ago
1 comment
972.
Llama.cpp: Deterministic Inference Mode (CUDA): RMSNorm, MatMul, Attention (github.com/ggml-org)
6 points
diwank
9 months ago
discuss
973.
Run Llama 3.3 70B Q40 on $1516 GPU 3.3 tok/s (github.com/b4rtaz)
6 points
b4rtazz
a year ago
discuss
974.
Llama, alpaca and gpt4all API in Golang (github.com/go-skynet)
6 points
mudler
3 years ago
discuss
975.
Open-Source LLaMA v2 Chatbot (github.com/a16z-infra)
5 points
gk1
3 years ago
discuss
976.
GitHub deletes popular llama.cpp fork without explanation (github.com/ikawrakow)
4 points
akawry
a year ago
2 comments
977.
Distributed Llama on Raspberry Pis (github.com/b4rtaz)
4 points
politelemon
2 years ago
2 comments
978.
Llama.cpp's Agents.md (github.com/ggml-org)
4 points
Wowfunhappy
2 months ago
1 comment
979.
Distributed-Llama: Connect home devices into a cluster for LLM inference (github.com/b4rtaz)
4 points
tosh
a year ago
1 comment
980.
Llama.cpp launches official WebUI for local LLMs (github.com/ggml-org)
4 points
victormustar
7 months ago
discuss
981.
Llama 3.1 is available for home AI clusters! Run 8B model with the full context (github.com/b4rtaz)
4 points
b4rtazz
2 years ago
discuss
982.
llamafile v0.8 (github.com/Mozilla-Ocho)
4 points
birriel
2 years ago
discuss
983.
Distributed Llama (github.com/b4rtaz)
4 points
lagniappe
2 years ago
discuss
984.
Code Llama for VS Code (github.com/xNul)
4 points
hcrisp
3 years ago
discuss
985.
Directly run and investigate Llama models locally with only PyTorch (github.com/anordin95)
3 points
anordin95
2 years ago
1 comment
986.
ik_llama.cpp – llama.cpp fork with better CPU performance (github.com/ikawrakow)
3 points
peter_d_sherman
7 days ago
discuss
987.
Grinder12: 0.96-Bit Lossless Streaming KV-Cache (16.55x VRAM Savings (github.com/ggml-org)
3 points
AMICLLC
a month ago
discuss
988.
Distributed Llama (github.com/b4rtaz)
3 points
oldfuture
4 months ago
discuss
989.
LlamaBarn – A macOS menu bar app for running local LLMs (github.com/ggml-org)
3 points
lyapustin
7 months ago
discuss
990.
AMD teams contributing to the llama.cpp codebase (github.com/ggml-org)
3 points
gzer0
10 months ago
discuss
More