Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
91.
▲
Control Vectors have been added to llama.cpp
(github.com/ggerganov)
3 points
Der_Einzige
2 years ago
discuss
92.
▲
Llama.cpp supports distributed inference across machines on a local network
(github.com/ggerganov)
3 points
behnamoh
2 years ago
discuss
93.
▲
CUDA: Faster Mixtral Prompt Processing
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
94.
▲
Llama.cpp: Support for Phi-2
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
95.
▲
QMoE Support for Mixtral
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
96.
▲
Llama 2: poc for running 70B on CPU
(github.com/ggerganov)
3 points
tosh
3 years ago
discuss
97.
▲
Inference at the edge: Efficient transformer model inference on-device
(github.com/ggerganov)
3 points
lioeters
3 years ago
discuss
98.
▲
K-Quants
(github.com/ggerganov)
3 points
tosh
3 years ago
discuss
99.
▲
StableLM already being ported to ggml
(github.com/ggerganov)
3 points
theolivenbaum
3 years ago
discuss
100.
▲
Llama.cpp: Add GPU support to ggml
(github.com/ggerganov)
3 points
mromanuk
3 years ago
discuss
101.
▲
Tweet2Doom: A Twitter bot that plays Doom
(github.com/ggerganov)
3 points
ggerganov
5 years ago
discuss
102.
▲
Kbd-audio – Tools for capturing and analysing keyboard input paired with
(github.com/ggerganov)
3 points
pplonski86
8 years ago
discuss
103.
▲
LLM quantization severely damages model quality and perplexity
(github.com/ggerganov)
2 points
behnamoh
3 years ago
3 comments
104.
▲
Show HN: r2t2 – Transmit data with the PC speaker
(github.com/ggerganov)
2 points
ggerganov
5 years ago
1 comment
105.
▲
Show HN: Using talking buttons and data-over-sound to control devices
(github.com/ggerganov)
2 points
ggerganov
5 years ago
1 comment
106.
▲
Show HN: Waver – Messaging Through Sound
(github.com/ggerganov)
2 points
ggerganov
5 years ago
1 comment
107.
▲
Rust macro to generate AI code at compile-time
(github.com/germangb)
2 points
michidk
5 months ago
discuss
108.
▲
A Transformer-based model predicting the articles of German nouns
(github.com/dominik3141)
2 points
jimmy76615
a year ago
discuss
109.
▲
Llama.vim: Plugin for Neovim
(github.com/ggerganov)
2 points
mariuz
2 years ago
discuss
110.
▲
Llama.vim: Plugin for Neovim
(github.com/ggerganov)
2 points
ibobev
2 years ago
discuss
111.
▲
Attention and final logit soft-capping, update scaling factor to Gemma2
(github.com/ggerganov)
2 points
tosh
2 years ago
discuss
112.
▲
ggml: Add Flash Attention
(github.com/ggerganov)
2 points
tosh
2 years ago
discuss
113.
▲
llama.cpp bfloat16 support
(github.com/ggerganov)
2 points
indigodaddy
2 years ago
discuss
114.
▲
Llama.cpp: Mac Prebuilds
(github.com/ggerganov)
2 points
tosh
2 years ago
discuss
115.
▲
DigesterBot: A telegram bot to help you study
(github.com/german94)
2 points
gpinzon94
2 years ago
discuss
116.
▲
Llama.cpp incoming backends: Vulkan, Kompute, SYCL
(github.com/ggerganov)
2 points
irusensei
2 years ago
discuss
117.
▲
Llama.cpp: Self-Extend Support
(github.com/ggerganov)
2 points
tosh
2 years ago
discuss
118.
▲
GGUF File Format
(github.com/ggerganov)
2 points
warkanlock
2 years ago
discuss
119.
▲
K-Quants
(github.com/ggerganov)
2 points
tosh
2 years ago
discuss
120.
▲
Show HN: Modern C++ implementations of a words counter with benchmarks
(github.com/germandiagogomez)
2 points
germandiago
2 years ago
discuss
More