Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
751.
▲
30B model now needs only 5.8GB of RAM? How?
(github.com/ggerganov)
31 points
olalonde
3 years ago
11 comments
752.
▲
WIP Llama.cpp Vulkan Implementations
(github.com/ggerganov)
24 points
brucethemoose2
3 years ago
1 comment
753.
▲
Gemma Is Added to Llama.cpp
(github.com/ggerganov)
17 points
behnamoh
2 years ago
discuss
754.
▲
Speculative: PoC for speeding-up inference via speculative sampling by ggerganov
(github.com/ggerganov)
16 points
kristianp
3 years ago
1 comment
755.
▲
Show HN: LiteParse v2, now in Rust 100x faster
(github.com/run-llama)
15 points
pierre
10 days ago
discuss
756.
▲
An open source Claude Artifacts – generate small apps with one prompt
(github.com/Nutlope)
12 points
alexfefun
2 years ago
3 comments
757.
▲
Show HN: LiteParse, a fast open-source document parser for AI agents
(github.com/run-llama)
12 points
freezed8
3 months ago
discuss
758.
▲
Llama 3.2 Release
(github.com/meta-llama)
12 points
nickthegreek
2 years ago
discuss
759.
▲
Show HN: Llama2.ipynb
(github.com/rbitr)
12 points
andy99
3 years ago
discuss
760.
▲
Show HN: Chaos Llama – Chaos Monkey Build on AWS Lambda
(github.com/hassy)
12 points
hassy
10 years ago
discuss
761.
▲
Grok-1 Support for Llama.cpp
(github.com/ggerganov)
11 points
schappim
2 years ago
2 comments
762.
▲
Liteparse
(github.com/run-llama)
9 points
pierre
2 months ago
1 comment
763.
▲
Show HN: LlamaChat - interact with your favourite LLaMA models on macOS
(github.com/alexrozanski)
9 points
alexrozanski
3 years ago
discuss
764.
▲
Show HN: LlamaExtract, a tool to automatically extract schema from documents
(github.com/run-llama)
8 points
pierre
2 years ago
4 comments
765.
▲
Llama2.c64: a port of llama2.c to the Commodore C64
(github.com/ytmytm)
8 points
adunk
a year ago
discuss
766.
▲
Llama 3.1: 405B, the largest openly available model released
(github.com/meta-llama)
8 points
yarapavan
2 years ago
discuss
767.
▲
gg: "M2 Ultra is the absolute best personal LLM inference node you can buy."
(github.com/ggerganov)
8 points
behnamoh
3 years ago
discuss
768.
▲
LLaMA-rs: a Rust port of llama.cpp for fast LLaMA inference on CPU
(github.com/setzer22)
8 points
darthdeus
3 years ago
discuss
769.
▲
Llama2 + Haystack on Colab
(github.com/anakin87)
7 points
anakin87
3 years ago
1 comment
770.
▲
Ggml 2x WASM Speed with SIMD Optimization Using 99% DeekSeek-R1-Generated Code
(github.com/ggerganov)
7 points
bratao
a year ago
discuss
771.
▲
Llama as a System
(github.com/meta-llama)
7 points
cztomsik
2 years ago
discuss
772.
▲
Distributed LLama3 Inference
(github.com/evilsocket)
7 points
345765476586
2 years ago
discuss
773.
▲
Llama.cpp Working on Support for Llama3
(github.com/ggerganov)
7 points
theolivenbaum
2 years ago
discuss
774.
▲
Show HN: secinsights.ai – An open-source full-stack app using LlamaIndex
(github.com/run-llama)
7 points
secsamai
3 years ago
discuss
775.
▲
Karpathy's llama2.c ported to pure Python
(github.com/tairov)
6 points
atairov
3 years ago
10 comments
776.
▲
DeepSeek-R1 speeds up llama.cpp code by x2
(github.com/ggerganov)
6 points
roboboffin
a year ago
3 comments
777.
▲
L2E llama2.c running in a PDF in a Shroedinger PNG [pdf]
(github.com/trholding)
6 points
AMICABoard
a year ago
2 comments
778.
▲
Show HN: LlaMaKey – One master key for all cloud LLM/GenAI APIs
(github.com/TexteaInc)
6 points
forrestbao
2 years ago
2 comments
779.
▲
Llama2.c Running in a PDF
(github.com/trholding)
6 points
AMICABoard
a year ago
1 comment
780.
▲
llama.cpp now supports StarCoder model series
(github.com/ggerganov)
6 points
wsxiaoys
3 years ago
1 comment
More