Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
751.
30B model now needs only 5.8GB of RAM? How? (github.com/ggerganov)
31 points
olalonde
3 years ago
11 comments
752.
WIP Llama.cpp Vulkan Implementations (github.com/ggerganov)
24 points
brucethemoose2
3 years ago
1 comment
753.
Gemma Is Added to Llama.cpp (github.com/ggerganov)
17 points
behnamoh
2 years ago
discuss
754.
Speculative: PoC for speeding-up inference via speculative sampling by ggerganov (github.com/ggerganov)
16 points
kristianp
3 years ago
1 comment
755.
Show HN: LiteParse v2, now in Rust 100x faster (github.com/run-llama)
15 points
pierre
10 days ago
discuss
756.
An open source Claude Artifacts – generate small apps with one prompt (github.com/Nutlope)
12 points
alexfefun
2 years ago
3 comments
757.
Show HN: LiteParse, a fast open-source document parser for AI agents (github.com/run-llama)
12 points
freezed8
3 months ago
discuss
758.
Llama 3.2 Release (github.com/meta-llama)
12 points
nickthegreek
2 years ago
discuss
759.
Show HN: Llama2.ipynb (github.com/rbitr)
12 points
andy99
3 years ago
discuss
760.
Show HN: Chaos Llama – Chaos Monkey Build on AWS Lambda (github.com/hassy)
12 points
hassy
10 years ago
discuss
761.
Grok-1 Support for Llama.cpp (github.com/ggerganov)
11 points
schappim
2 years ago
2 comments
762.
Liteparse (github.com/run-llama)
9 points
pierre
2 months ago
1 comment
763.
Show HN: LlamaChat - interact with your favourite LLaMA models on macOS (github.com/alexrozanski)
9 points
alexrozanski
3 years ago
discuss
764.
Show HN: LlamaExtract, a tool to automatically extract schema from documents (github.com/run-llama)
8 points
pierre
2 years ago
4 comments
765.
Llama2.c64: a port of llama2.c to the Commodore C64 (github.com/ytmytm)
8 points
adunk
a year ago
discuss
766.
Llama 3.1: 405B, the largest openly available model released (github.com/meta-llama)
8 points
yarapavan
2 years ago
discuss
767.
gg: "M2 Ultra is the absolute best personal LLM inference node you can buy." (github.com/ggerganov)
8 points
behnamoh
3 years ago
discuss
768.
LLaMA-rs: a Rust port of llama.cpp for fast LLaMA inference on CPU (github.com/setzer22)
8 points
darthdeus
3 years ago
discuss
769.
Llama2 + Haystack on Colab (github.com/anakin87)
7 points
anakin87
3 years ago
1 comment
770.
Ggml 2x WASM Speed with SIMD Optimization Using 99% DeekSeek-R1-Generated Code (github.com/ggerganov)
7 points
bratao
a year ago
discuss
771.
Llama as a System (github.com/meta-llama)
7 points
cztomsik
2 years ago
discuss
772.
Distributed LLama3 Inference (github.com/evilsocket)
7 points
345765476586
2 years ago
discuss
773.
Llama.cpp Working on Support for Llama3 (github.com/ggerganov)
7 points
theolivenbaum
2 years ago
discuss
774.
Show HN: secinsights.ai – An open-source full-stack app using LlamaIndex (github.com/run-llama)
7 points
secsamai
3 years ago
discuss
775.
Karpathy's llama2.c ported to pure Python (github.com/tairov)
6 points
atairov
3 years ago
10 comments
776.
DeepSeek-R1 speeds up llama.cpp code by x2 (github.com/ggerganov)
6 points
roboboffin
a year ago
3 comments
777.
L2E llama2.c running in a PDF in a Shroedinger PNG [pdf] (github.com/trholding)
6 points
AMICABoard
a year ago
2 comments
778.
Show HN: LlaMaKey – One master key for all cloud LLM/GenAI APIs (github.com/TexteaInc)
6 points
forrestbao
2 years ago
2 comments
779.
Llama2.c Running in a PDF (github.com/trholding)
6 points
AMICABoard
a year ago
1 comment
780.
llama.cpp now supports StarCoder model series (github.com/ggerganov)
6 points
wsxiaoys
3 years ago
1 comment
More