Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
61.
▲
Show HN: GPT-J inference on the CPU using C/C++
(github.com/ggerganov)
5 points
ggerganov
4 years ago
2 comments
62.
▲
Llama.cpp: SOTA 2-bit quants
(github.com/ggerganov)
5 points
tosh
2 years ago
discuss
63.
▲
Falcon 40B Working on Ggml
(github.com/ggerganov)
5 points
__anon-2023__
3 years ago
discuss
64.
▲
Whisper.cpp: Port of OpenAI's Whisper model in C/C++
(github.com/ggerganov)
5 points
lnyan
4 years ago
discuss
65.
▲
HNTERM – Browse Hacker News interactively in your terminal
(github.com/ggerganov)
5 points
graderjs
4 years ago
discuss
66.
▲
ImTui: Immediate Mode Text-Based User Interface C++ Library
(github.com/ggerganov)
5 points
seansh
4 years ago
discuss
67.
▲
Real-Time Capturing Exact Keystrokes Using Sound
(github.com/ggerganov)
5 points
foobaw
8 years ago
discuss
68.
▲
Llama.cpp speculative sampling: 2x faster inference for large models
(github.com/ggerganov)
4 points
bobivl
3 years ago
1 comment
69.
▲
Show HN: GGWave – Data over Sound for Microcontrollers
(github.com/ggerganov)
4 points
ggerganov
4 years ago
1 comment
70.
▲
ImTui: Immediate Mode Text-Based User Interface C++ Library
(github.com/ggerganov)
4 points
signa11
5 years ago
1 comment
71.
▲
Llama.cpp PR with 99% of code written by DeepSeek-R1
(github.com/ggerganov)
4 points
zelag
a year ago
discuss
72.
▲
Wchess
(github.com/ggerganov)
4 points
tosh
2 years ago
discuss
73.
▲
Whisper.wasm
(github.com/ggerganov)
4 points
tosh
3 years ago
discuss
74.
▲
AMD ROCm Support Added to Llama.cpp
(github.com/ggerganov)
4 points
irusensei
3 years ago
discuss
75.
▲
Full GPU Inference of LLaMA on Apple Silicon Using Metal
(github.com/ggerganov)
4 points
behnamoh
3 years ago
discuss
76.
▲
Inference at the Edge
(github.com/ggerganov)
4 points
Mizza
3 years ago
discuss
77.
▲
Whisper: performant port of OpenAI's Whisper spech recognition model in C/C++
(github.com/ggerganov)
4 points
nateb2022
4 years ago
discuss
78.
▲
ImTui: Immediate Mode Text-Based User Interface Library for C++
(github.com/ggerganov)
4 points
pcr910303
6 years ago
discuss
79.
▲
ImTui: Immediate mode text-based user interface library
(github.com/ggerganov)
4 points
ingve
6 years ago
discuss
80.
▲
Llama.cpp now supports tool calling (OpenAI-compatible)
(github.com/ggerganov)
3 points
ochafik
a year ago
1 comment
81.
▲
GGML Flash Attention support merged into llama.cpp
(github.com/ggerganov)
3 points
smcleod
2 years ago
1 comment
82.
▲
Show HN: GPT-2 inference on the CPU using C/C++
(github.com/ggerganov)
3 points
ggerganov
4 years ago
1 comment
83.
▲
Show HN: Tweet2Doom – A Twitter bot that plays Doom
(github.com/ggerganov)
3 points
ggerganov
5 years ago
1 comment
84.
▲
Show HN: ggwave – tiny data-over-sound library
(github.com/ggerganov)
3 points
ggerganov
5 years ago
1 comment
85.
▲
Whisper.cpp: Looking for Maintainers
(github.com/ggerganov)
3 points
tech234a
a year ago
discuss
86.
▲
Distributed LLM Inference with Llama.cpp
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
87.
▲
Control Vectors have been added to llama.cpp
(github.com/ggerganov)
3 points
Der_Einzige
2 years ago
discuss
88.
▲
Llama.cpp supports distributed inference across machines on a local network
(github.com/ggerganov)
3 points
behnamoh
2 years ago
discuss
89.
▲
CUDA: Faster Mixtral Prompt Processing
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
90.
▲
Llama.cpp: Support for Phi-2
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
More