Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
811.
Llama.cpp now supports tool calling (OpenAI-compatible) (github.com/ggerganov)
3 points
ochafik
a year ago
1 comment
812.
GGML Flash Attention support merged into llama.cpp (github.com/ggerganov)
3 points
smcleod
2 years ago
1 comment
813.
LlamaStash – Zero-overhead, terminal-native llama.cpp launcher (github.com/llamastash)
3 points
deepu105
5 days ago
discuss
814.
ParseBench: Document Parsing Benchmark for AI Agents (github.com/run-llama)
3 points
firasd
2 months ago
discuss
815.
To Use Snapdragon NPU, HTP Ops Libraries Must Be Signed with Trusted Certs (github.com/qualcomm)
3 points
WhereIsTheTruth
4 months ago
discuss
816.
Disclaimer: I am not a webdev, this PR was vibe coded (github.com/olegshulyakov)
3 points
WhereIsTheTruth
9 months ago
discuss
817.
Show HN: Worflows.py, the best way to build agents (github.com/run-llama)
3 points
pierre
a year ago
discuss
818.
A tool for migrating and optimizing prompts from other LLMs to Llama (github.com/meta-llama)
3 points
yawnxyz
a year ago
discuss
819.
Open source Claude Artifacts – built with Llama 3.1 405B (github.com/Nutlope)
3 points
sabrina_ramonov
2 years ago
discuss
820.
Distributed LLM Inference with Llama.cpp (github.com/ggerganov)
3 points
tosh
2 years ago
discuss
821.
Practical Llama 3 inference implemented in a single Java file (github.com/mukel)
3 points
simonpure
2 years ago
discuss
822.
Meta Llama 3 GitHub (github.com/meta-llama)
3 points
adif_sgaid
2 years ago
discuss
823.
LlamaIndex is a data framework for your LLM applications (github.com/run-llama)
3 points
Brajeshwar
2 years ago
discuss
824.
Control Vectors have been added to llama.cpp (github.com/ggerganov)
3 points
Der_Einzige
2 years ago
discuss
825.
Llama.cpp supports distributed inference across machines on a local network (github.com/ggerganov)
3 points
behnamoh
2 years ago
discuss
826.
Llama-Terminal-Completion (github.com/adammpkins)
3 points
tosh
2 years ago
discuss
827.
CUDA: Faster Mixtral Prompt Processing (github.com/ggerganov)
3 points
tosh
2 years ago
discuss
828.
Llama.cpp: Support for Phi-2 (github.com/ggerganov)
3 points
tosh
2 years ago
discuss
829.
QMoE Support for Mixtral (github.com/ggerganov)
3 points
tosh
2 years ago
discuss
830.
Karpathy removes llama licence from llama2.c (github.com/karpathy)
3 points
orwellg1984
3 years ago
discuss
831.
A Clojure Wrapper for Llama.cpp (github.com/phronmophobic)
3 points
simonpure
3 years ago
discuss
832.
Llama Recipes (github.com/facebookresearch)
3 points
atg_abhishek
3 years ago
discuss
833.
Llama 2: poc for running 70B on CPU (github.com/ggerganov)
3 points
tosh
3 years ago
discuss
834.
Inference at the edge: Efficient transformer model inference on-device (github.com/ggerganov)
3 points
lioeters
3 years ago
discuss
835.
K-Quants (github.com/ggerganov)
3 points
tosh
3 years ago
discuss
836.
Suddenly 403 Forbidden (LLaMA) (github.com/facebookresearch)
3 points
grae_QED
3 years ago
discuss
837.
Connect your LLM with external data (github.com/jerryjliu)
3 points
snork_alt
3 years ago
discuss
838.
Llama.cpp: Add GPU support to ggml (github.com/ggerganov)
3 points
mromanuk
3 years ago
discuss
839.
LLaMA-Adapter: Efficient Fine-Tuning of LLaMA (github.com/ZrrSkywalker)
3 points
GaggiX
3 years ago
discuss
840.
Show HN: LlamaBot – Turn any Rails app into an autonomous AI agent in 2 minutes (github.com/KodyKendall)
2 points
kody_06
a year ago
3 comments
More