Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
691.
▲
Show HN: I'm tired of my LLM bullshitting. So I fixed it
5 points
BobbyLLM
5 months ago
9 comments
692.
▲
Show HN: Dive into Transformers and LLM World – Llama 3.1 in Go, Step by Step
(github.com/adalkiran)
4 points
adalkiran
2 years ago
2 comments
693.
▲
Show HN: I built BakLLaVA and llama.cpp demo and it went viral on X
4 points
Obertr
3 years ago
1 comment
694.
▲
Ask HN: Why the LLaMA code base is so short
3 points
kureikain
3 years ago
2 comments
695.
▲
Fixed a llama.cpp bug silently disabling Vulkan GPU on all 32-bit ARM devices
3 points
perinban
2 months ago
discuss
696.
▲
Show HN: LLaMA Nuts and Bolts, A holistic way of understanding how LLMs run
(github.com/adalkiran)
3 points
adalkiran
2 years ago
discuss
697.
▲
Apple predicted the rise of local LLMs, hence the M2 Ultra
2 points
behnamoh
3 years ago
3 comments
698.
▲
Show HN: LlamaFarm – Working on binary AI Project deployment – (early preview)
(github.com/llama-farm)
2 points
rgthelen
a year ago
1 comment
699.
▲
Show HN: I Built a GitHub Action to Monitor LlamaIndex Performance
2 points
Ephil012
2 years ago
1 comment
700.
▲
Show HN: LlamaPReview – AI code reviewer trusted by 2000 repos, 40%+ effective
(jetxu-llm.github.io)
2 points
Jet_Xu
2 years ago
discuss
701.
▲
Ask HN: Are We Approaching Code Reviews Wrong?
2 points
Jet_Xu
2 years ago
discuss
702.
▲
Ask HN: Do you know any new llama2.c implementations not mentioned in the repo
1 point
mikepapadim
2 years ago
1 comment
703.
▲
Show HN: Running LLM on smartwatch – found llama.cpp loading model twice in RAM
1 point
perinban
2 months ago
discuss
704.
▲
Llama.cpp 30B runs with only 6GB of RAM now
(github.com/ggerganov)
1311 points
msoad
3 years ago
414 comments
705.
▲
Llama3 implemented from scratch
(github.com/naklecha)
1041 points
Hadi7546
2 years ago
269 comments
706.
▲
Llama.cpp: Port of Facebook's LLaMA model in C/C++, with Apple Silicon support
(github.com/ggerganov)
989 points
mrtksn
3 years ago
284 comments
707.
▲
Facebook LLAMA is being openly distributed via torrents
(github.com/facebookresearch)
909 points
micro_charm
3 years ago
693 comments
708.
▲
Llama.cpp: Full CUDA GPU Acceleration
(github.com/ggerganov)
728 points
gzer0
3 years ago
310 comments
709.
▲
Llama2.c: Inference llama 2 in one file of pure C
(github.com/karpathy)
707 points
anjneymidha
3 years ago
165 comments
710.
▲
Show HN: Llama 3.2 Interpretability with Sparse Autoencoders
(github.com/PaulPauls)
579 points
PaulPauls
2 years ago
99 comments
711.
▲
Llama: Add grammar-based sampling
(github.com/ggerganov)
417 points
davepeck
3 years ago
105 comments
712.
▲
New exponent functions that make SiLU and SoftMax 2x faster, at full accuracy
(github.com/ggerganov)
382 points
weinzierl
2 years ago
72 comments
713.
▲
Show HN: Llama-dl – high-speed download of LLaMA, Facebook's 65B GPT model
(github.com/shawwn)
343 points
sillysaurusx
3 years ago
130 comments
714.
▲
LLama.cpp now has a web interface
(github.com/ggerganov)
328 points
xal
3 years ago
49 comments
715.
▲
NotebookLlama: An open source version of NotebookLM
(github.com/meta-llama)
322 points
bibinmohan
2 years ago
72 comments
716.
▲
Llama 2 Everywhere (L2E): Standalone, Binary Portable, Bootable Llama 2
(github.com/trholding)
320 points
jjwiseman
3 years ago
55 comments
717.
▲
Llama 3.1 Omni Model
(github.com/ictnlp)
304 points
taikon
2 years ago
41 comments
718.
▲
M2 Ultra can run 128 streams of Llama 2 7B in parallel
(github.com/ggerganov)
268 points
behnamoh
3 years ago
173 comments
719.
▲
Fork of Facebook’s LLaMa model to run on CPU
(github.com/markasoftware)
246 points
__anon-2023__
3 years ago
170 comments
720.
▲
Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning
(github.com/KhoomeiK)
239 points
KhoomeiK
2 years ago
28 comments
More