Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
661.
▲
Show HN: Spirit of C++
(github.com/legends2k)
5 points
legends2k
7 years ago
4 comments
662.
▲
Bloomz.cpp: Run multilingual BLOOM model with C++
(github.com/NouamaneTazi)
5 points
osanseviero
3 years ago
2 comments
663.
▲
I implemented CLIP inference in plain C/C++
(github.com/monatis)
5 points
monatis
3 years ago
1 comment
664.
▲
AWS is laying the groundwork for nested virtualization on EC2
(github.com/aws)
5 points
acj
4 months ago
discuss
665.
▲
Show HN: AI-SDK-Cpp – Unified C++ SDK for OpenAI, Anthropic, and More
(github.com/iskakaushik)
5 points
cauchyk
a year ago
discuss
666.
▲
Safetensors.cpp – Zero Dependency Safetensors Loading and Storing in C++
(github.com/carsonpo)
5 points
carsonpoole
2 years ago
discuss
667.
▲
Llama.cpp: SOTA 2-bit quants
(github.com/ggerganov)
5 points
tosh
2 years ago
discuss
668.
▲
Whisper.cpp: Port of OpenAI's Whisper model in C/C++
(github.com/ggerganov)
5 points
lnyan
4 years ago
discuss
669.
▲
Show HN: 2048.cpp – Play 2048 in your terminal
(github.com/plibither8)
5 points
plibither8
8 years ago
discuss
670.
▲
Llama.cpp speculative sampling: 2x faster inference for large models
(github.com/ggerganov)
4 points
bobivl
3 years ago
1 comment
671.
▲
Prima.cpp – run 70B-Scale LLMs on low-powered home clusters
(github.com/Lizonghang)
4 points
oleg_tarasov
a year ago
discuss
672.
▲
Llama.cpp PR with 99% of code written by DeepSeek-R1
(github.com/ggerganov)
4 points
zelag
a year ago
discuss
673.
▲
Bark.cpp: Port of Suno AI's Bark in C/C++ for fast inference
(github.com/PABannier)
4 points
siraben
2 years ago
discuss
674.
▲
Source code of Google Gemma model in C++
(github.com/google)
4 points
yu3zhou4
2 years ago
discuss
675.
▲
Wchess
(github.com/ggerganov)
4 points
tosh
2 years ago
discuss
676.
▲
Whisper.wasm
(github.com/ggerganov)
4 points
tosh
3 years ago
discuss
677.
▲
AMD ROCm Support Added to Llama.cpp
(github.com/ggerganov)
4 points
irusensei
3 years ago
discuss
678.
▲
Full GPU Inference of LLaMA on Apple Silicon Using Metal
(github.com/ggerganov)
4 points
behnamoh
3 years ago
discuss
679.
▲
Inference at the Edge
(github.com/ggerganov)
4 points
Mizza
3 years ago
discuss
680.
▲
Whisper: performant port of OpenAI's Whisper spech recognition model in C/C++
(github.com/ggerganov)
4 points
nateb2022
4 years ago
discuss
681.
▲
Show HN: Bark.cpp, fast TTS model for multilingual realistic audio generation
(github.com/PABannier)
3 points
el_pa_b
2 years ago
3 comments
682.
▲
Alpaca 7B running on Google Pixel 7 Pro
(github.com/rupeshs)
3 points
oneinfiniteloop
3 years ago
2 comments
683.
▲
Llama.cpp now supports tool calling (OpenAI-compatible)
(github.com/ggerganov)
3 points
ochafik
a year ago
1 comment
684.
▲
GGML Flash Attention support merged into llama.cpp
(github.com/ggerganov)
3 points
smcleod
2 years ago
1 comment
685.
▲
Kubernetes In-Place Pod Resource Resize in Action: Kube Startup CPU Boost
(github.com/google)
3 points
mikowhy
2 years ago
1 comment
686.
▲
Acestep.cpp: portable C++17 implementation of ACE-Step 1.5 using GGML
(github.com/ServeurpersoCom)
3 points
qxip
3 months ago
discuss
687.
▲
Ggml C++ port of VITS for text-to-speech
(github.com/maxilevi)
3 points
maxilevi
3 months ago
discuss
688.
▲
To Use Snapdragon NPU, HTP Ops Libraries Must Be Signed with Trusted Certs
(github.com/qualcomm)
3 points
WhereIsTheTruth
4 months ago
discuss
689.
▲
Deepseek CPP for CPU only inference
(github.com/andrewkchan)
3 points
amrrs
a year ago
discuss
690.
▲
Whisper.cpp: Looking for Maintainers
(github.com/ggerganov)
3 points
tech234a
a year ago
discuss
More