Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
631.
▲
Modern C++ Programming: Busato
(github.com/federico-busato)
102 points
KnuthIsGod
a month ago
27 comments
632.
▲
Minigpt4 Inference on CPU
(github.com/Maknee)
102 points
maknee
3 years ago
12 comments
633.
▲
Performance of llama.cpp on Apple Silicon A-series
(github.com/ggerganov)
100 points
mobilio
2 years ago
41 comments
634.
▲
llama.cpp: Roadmap May 2023
(github.com/ggerganov)
97 points
tosh
3 years ago
6 comments
635.
▲
Running Llama.cpp on AWS Instances
(github.com/ggerganov)
96 points
schappim
3 years ago
10 comments
636.
▲
Revert for jart’s llama.cpp MMAP miracles
(github.com/ggerganov)
86 points
mmoustafa
3 years ago
86 comments
637.
▲
Show HN: Match(it): A C++17 pattern-matching library with lots of good stuffs
(github.com/BowenFu)
85 points
amazing42
4 years ago
44 comments
638.
▲
Whisper.cpp now has CoreML suppprt
(github.com/ggerganov)
70 points
schappim
3 years ago
5 comments
639.
▲
A history of CPAN (2015)
(github.com/neilb)
61 points
fanf2
8 years ago
discuss
640.
▲
Show HN: 3DGS.cpp – performant, cross platform Gaussian Splatting with Vulkan
(github.com/shg8)
50 points
0x02A
2 years ago
20 comments
641.
▲
CPU Port of OpenAI's Whisper Speech to Text
(github.com/ggerganov)
41 points
abetusk
4 years ago
3 comments
642.
▲
30B model now needs only 5.8GB of RAM? How?
(github.com/ggerganov)
31 points
olalonde
3 years ago
11 comments
643.
▲
WIP Llama.cpp Vulkan Implementations
(github.com/ggerganov)
24 points
brucethemoose2
3 years ago
1 comment
644.
▲
Show HN: SyNumpy – a Header only C++17 library for working with NumPy Arrays
(github.com/symisc)
22 points
symisc_devel
2 months ago
4 comments
645.
▲
Ollama are 'try[ing to] achieve vendor lock-in'
(github.com/ggerganov)
17 points
alexmorley
a year ago
5 comments
646.
▲
Gemma Is Added to Llama.cpp
(github.com/ggerganov)
17 points
behnamoh
2 years ago
discuss
647.
▲
Speculative: PoC for speeding-up inference via speculative sampling by ggerganov
(github.com/ggerganov)
16 points
kristianp
3 years ago
1 comment
648.
▲
CLIP inference in plain C/C++ with no extra dependencies
(github.com/monatis)
12 points
lawrencechen
3 years ago
2 comments
649.
▲
Grok-1 Support for Llama.cpp
(github.com/ggerganov)
11 points
schappim
2 years ago
2 comments
650.
▲
AI-SDK-cpp: Modern C++ AI SDK
(github.com/ClickHouse)
11 points
samaysharma
a year ago
1 comment
651.
▲
BehaviorTree.CPP: C++ behavior tree library, batteries included
(github.com/BehaviorTree)
9 points
facontidavide
5 years ago
2 comments
652.
▲
Port of OpenAI's Whisper model in C/C++
(github.com/ggerganov)
8 points
pbowyer
4 years ago
1 comment
653.
▲
gg: "M2 Ultra is the absolute best personal LLM inference node you can buy."
(github.com/ggerganov)
8 points
behnamoh
3 years ago
discuss
654.
▲
Linux/Clang/Modern C++ on Travis-CI
(github.com/jbcoe)
7 points
jbcoe
10 years ago
1 comment
655.
▲
Ggml 2x WASM Speed with SIMD Optimization Using 99% DeekSeek-R1-Generated Code
(github.com/ggerganov)
7 points
bratao
a year ago
discuss
656.
▲
Llama.cpp Working on Support for Llama3
(github.com/ggerganov)
7 points
theolivenbaum
2 years ago
discuss
657.
▲
DeepSeek-R1 speeds up llama.cpp code by x2
(github.com/ggerganov)
6 points
roboboffin
a year ago
3 comments
658.
▲
Show HN: SAM3-CPU – Run Segment Anything on CPU with memory-aware video chunking
(github.com/rhubarb-ai)
6 points
judlaw
2 months ago
1 comment
659.
▲
llama.cpp now supports StarCoder model series
(github.com/ggerganov)
6 points
wsxiaoys
3 years ago
1 comment
660.
▲
LLaMA 7B model running on 4GB RAM Raspberry Pi 4
(github.com/ggerganov)
6 points
amrrs
3 years ago
discuss
More