Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
781.
Llama 3.1 is available for home AI clusters! Run 8B model with the full context (github.com/b4rtaz)
4 points
b4rtazz
2 years ago
discuss
782.
Distributed Llama (github.com/b4rtaz)
4 points
lagniappe
2 years ago
discuss
783.
Directly run and investigate Llama models locally with only PyTorch (github.com/anordin95)
3 points
anordin95
2 years ago
1 comment
784.
Grinder12: 0.96-Bit Lossless Streaming KV-Cache (16.55x VRAM Savings (github.com/ggml-org)
3 points
AMICLLC
a month ago
discuss
785.
AMD teams contributing to the llama.cpp codebase (github.com/ggml-org)
3 points
gzer0
10 months ago
discuss
786.
Llamafile 0.8.6 CPU Benchmark (github.com/Mozilla-Ocho)
3 points
tosh
2 years ago
discuss
787.
Llamafile 0.8.2 Release with Embedding Subcmd in CLI and Performance Boost (github.com/Mozilla-Ocho)
3 points
lijunhao
2 years ago
discuss
788.
Go Bindings for LLaMa.cpp (github.com/matthewrennie)
3 points
matthewrennie
3 years ago
discuss
789.
A better Llama-CLI help doc (github.com/ggml-org)
2 points
dcreater
9 months ago
1 comment
790.
Inference Llama models in one file of pure C for Win98 (github.com/exo-explore)
2 points
mastar2323
a year ago
1 comment
791.
LlamaSim – Simulate political polling with LLMs (github.com/jw-source)
2 points
jw12
2 years ago
1 comment
792.
Fun project that makes a llama.cpp server LLM chat interface with Htmx and Rust (github.com/richardanaya)
2 points
richardanaya
2 years ago
1 comment
793.
Llama.cpp b9180: MTP support landed (github.com/ggml-org)
2 points
usagisushi
21 days ago
discuss
794.
LlamaBarn: A cosy home for your LLMs (github.com/ggml-org)
2 points
tosh
4 months ago
discuss
795.
LlamaBarn – automatically configure models based on your Mac's hardware (github.com/ggml-org)
2 points
smoser
7 months ago
discuss
796.
Guide: Running GPT-OSS with Llama.cpp (github.com/ggml-org)
2 points
homarp
10 months ago
discuss
797.
Node-Llama-cpp – Run AI models locally on your machine with Node.js (github.com/withcatai)
2 points
javatuts
a year ago
discuss
798.
Released llamafile 0.8.13 with gemma2, new whisper and Stable Diffusion CLI (github.com/Mozilla-Ocho)
2 points
mseri
2 years ago
discuss
799.
What do you check first in PR review? Help shape our AI Code review tool (github.com/JetXu-LLM)
1 point
Jet_Xu
a year ago
1 comment
800.
Llama and Spec: MTP Support (github.com/ggml-org)
1 point
jhoho
a month ago
discuss
801.
QuantumLeap: 2.3× faster MoE inference with intelligent expert caching (github.com/MartinCrespoC)
1 point
ikharoz
2 months ago
discuss
802.
Llamafile (github.com/Mozilla-Ocho)
1 point
brundolf
a year ago
discuss
803.
Show HN: I Used Llama-70B Logprobs for Better, Cheaper and Faster Chunking (github.com/ZeroEntropy-AI)
1 point
ghita_
2 years ago
discuss
804.
CPU beating GPU in token generation speed (github.com/ikawrakow)
1 point
bratao
2 years ago
discuss
805.
Distributed Grok-1 (314B) (github.com/b4rtaz)
1 point
b4rtazz
2 years ago
discuss
806.
LLaMA2-Accessory: An Open-Source Toolkit for LLM Development (github.com/Alpha-VLLM)
1 point
johnsutor
2 years ago
discuss
807.
LLaMA-VID: An Image Is Worth 2 Tokens in LLM (github.com/dvlab-research)
1 point
turrini
3 years ago
discuss
808.
Show HN: WhatsApp-Llama: A clone of yourself from your WhatsApp conversations (github.com/Ads-cmu)
124 points
advaith08
3 years ago
51 comments
809.
OpenLLaMA: An Open Reproduction of LLaMA (github.com/openlm-research)
484 points
sadiq
3 years ago
180 comments
810.
OpenLLaMA to train beyond 1T tokens (github.com/openlm-research)
2 points
tosh
3 years ago
discuss
More