Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
811.
▲
Llama.cpp now supports tool calling (OpenAI-compatible)
(github.com/ggerganov)
3 points
ochafik
a year ago
1 comment
812.
▲
GGML Flash Attention support merged into llama.cpp
(github.com/ggerganov)
3 points
smcleod
2 years ago
1 comment
813.
▲
LlamaStash – Zero-overhead, terminal-native llama.cpp launcher
(github.com/llamastash)
3 points
deepu105
5 days ago
discuss
814.
▲
ParseBench: Document Parsing Benchmark for AI Agents
(github.com/run-llama)
3 points
firasd
2 months ago
discuss
815.
▲
To Use Snapdragon NPU, HTP Ops Libraries Must Be Signed with Trusted Certs
(github.com/qualcomm)
3 points
WhereIsTheTruth
4 months ago
discuss
816.
▲
Disclaimer: I am not a webdev, this PR was vibe coded
(github.com/olegshulyakov)
3 points
WhereIsTheTruth
9 months ago
discuss
817.
▲
Show HN: Worflows.py, the best way to build agents
(github.com/run-llama)
3 points
pierre
a year ago
discuss
818.
▲
A tool for migrating and optimizing prompts from other LLMs to Llama
(github.com/meta-llama)
3 points
yawnxyz
a year ago
discuss
819.
▲
Open source Claude Artifacts – built with Llama 3.1 405B
(github.com/Nutlope)
3 points
sabrina_ramonov
2 years ago
discuss
820.
▲
Distributed LLM Inference with Llama.cpp
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
821.
▲
Practical Llama 3 inference implemented in a single Java file
(github.com/mukel)
3 points
simonpure
2 years ago
discuss
822.
▲
Meta Llama 3 GitHub
(github.com/meta-llama)
3 points
adif_sgaid
2 years ago
discuss
823.
▲
LlamaIndex is a data framework for your LLM applications
(github.com/run-llama)
3 points
Brajeshwar
2 years ago
discuss
824.
▲
Control Vectors have been added to llama.cpp
(github.com/ggerganov)
3 points
Der_Einzige
2 years ago
discuss
825.
▲
Llama.cpp supports distributed inference across machines on a local network
(github.com/ggerganov)
3 points
behnamoh
2 years ago
discuss
826.
▲
Llama-Terminal-Completion
(github.com/adammpkins)
3 points
tosh
2 years ago
discuss
827.
▲
CUDA: Faster Mixtral Prompt Processing
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
828.
▲
Llama.cpp: Support for Phi-2
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
829.
▲
QMoE Support for Mixtral
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
830.
▲
Karpathy removes llama licence from llama2.c
(github.com/karpathy)
3 points
orwellg1984
3 years ago
discuss
831.
▲
A Clojure Wrapper for Llama.cpp
(github.com/phronmophobic)
3 points
simonpure
3 years ago
discuss
832.
▲
Llama Recipes
(github.com/facebookresearch)
3 points
atg_abhishek
3 years ago
discuss
833.
▲
Llama 2: poc for running 70B on CPU
(github.com/ggerganov)
3 points
tosh
3 years ago
discuss
834.
▲
Inference at the edge: Efficient transformer model inference on-device
(github.com/ggerganov)
3 points
lioeters
3 years ago
discuss
835.
▲
K-Quants
(github.com/ggerganov)
3 points
tosh
3 years ago
discuss
836.
▲
Suddenly 403 Forbidden (LLaMA)
(github.com/facebookresearch)
3 points
grae_QED
3 years ago
discuss
837.
▲
Connect your LLM with external data
(github.com/jerryjliu)
3 points
snork_alt
3 years ago
discuss
838.
▲
Llama.cpp: Add GPU support to ggml
(github.com/ggerganov)
3 points
mromanuk
3 years ago
discuss
839.
▲
LLaMA-Adapter: Efficient Fine-Tuning of LLaMA
(github.com/ZrrSkywalker)
3 points
GaggiX
3 years ago
discuss
840.
▲
Show HN: LlamaBot – Turn any Rails app into an autonomous AI agent in 2 minutes
(github.com/KodyKendall)
2 points
kody_06
a year ago
3 comments
More