Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
691.
Show HN: I'm tired of my LLM bullshitting. So I fixed it
5 points
BobbyLLM
5 months ago
9 comments
692.
Show HN: Dive into Transformers and LLM World – Llama 3.1 in Go, Step by Step (github.com/adalkiran)
4 points
adalkiran
2 years ago
2 comments
693.
Show HN: I built BakLLaVA and llama.cpp demo and it went viral on X
4 points
Obertr
3 years ago
1 comment
694.
Ask HN: Why the LLaMA code base is so short
3 points
kureikain
3 years ago
2 comments
695.
Fixed a llama.cpp bug silently disabling Vulkan GPU on all 32-bit ARM devices
3 points
perinban
2 months ago
discuss
696.
Show HN: LLaMA Nuts and Bolts, A holistic way of understanding how LLMs run (github.com/adalkiran)
3 points
adalkiran
2 years ago
discuss
697.
Apple predicted the rise of local LLMs, hence the M2 Ultra
2 points
behnamoh
3 years ago
3 comments
698.
Show HN: LlamaFarm – Working on binary AI Project deployment – (early preview) (github.com/llama-farm)
2 points
rgthelen
a year ago
1 comment
699.
Show HN: I Built a GitHub Action to Monitor LlamaIndex Performance
2 points
Ephil012
2 years ago
1 comment
700.
Show HN: LlamaPReview – AI code reviewer trusted by 2000 repos, 40%+ effective (jetxu-llm.github.io)
2 points
Jet_Xu
2 years ago
discuss
701.
Ask HN: Are We Approaching Code Reviews Wrong?
2 points
Jet_Xu
2 years ago
discuss
702.
Ask HN: Do you know any new llama2.c implementations not mentioned in the repo
1 point
mikepapadim
2 years ago
1 comment
703.
Show HN: Running LLM on smartwatch – found llama.cpp loading model twice in RAM
1 point
perinban
2 months ago
discuss
704.
Llama.cpp 30B runs with only 6GB of RAM now (github.com/ggerganov)
1311 points
msoad
3 years ago
414 comments
705.
Llama3 implemented from scratch (github.com/naklecha)
1041 points
Hadi7546
2 years ago
269 comments
706.
Llama.cpp: Port of Facebook's LLaMA model in C/C++, with Apple Silicon support (github.com/ggerganov)
989 points
mrtksn
3 years ago
284 comments
707.
Facebook LLAMA is being openly distributed via torrents (github.com/facebookresearch)
909 points
micro_charm
3 years ago
693 comments
708.
Llama.cpp: Full CUDA GPU Acceleration (github.com/ggerganov)
728 points
gzer0
3 years ago
310 comments
709.
Llama2.c: Inference llama 2 in one file of pure C (github.com/karpathy)
707 points
anjneymidha
3 years ago
165 comments
710.
Show HN: Llama 3.2 Interpretability with Sparse Autoencoders (github.com/PaulPauls)
579 points
PaulPauls
2 years ago
99 comments
711.
Llama: Add grammar-based sampling (github.com/ggerganov)
417 points
davepeck
3 years ago
105 comments
712.
New exponent functions that make SiLU and SoftMax 2x faster, at full accuracy (github.com/ggerganov)
382 points
weinzierl
2 years ago
72 comments
713.
Show HN: Llama-dl – high-speed download of LLaMA, Facebook's 65B GPT model (github.com/shawwn)
343 points
sillysaurusx
3 years ago
130 comments
714.
LLama.cpp now has a web interface (github.com/ggerganov)
328 points
xal
3 years ago
49 comments
715.
NotebookLlama: An open source version of NotebookLM (github.com/meta-llama)
322 points
bibinmohan
2 years ago
72 comments
716.
Llama 2 Everywhere (L2E): Standalone, Binary Portable, Bootable Llama 2 (github.com/trholding)
320 points
jjwiseman
3 years ago
55 comments
717.
Llama 3.1 Omni Model (github.com/ictnlp)
304 points
taikon
2 years ago
41 comments
718.
M2 Ultra can run 128 streams of Llama 2 7B in parallel (github.com/ggerganov)
268 points
behnamoh
3 years ago
173 comments
719.
Fork of Facebook’s LLaMa model to run on CPU (github.com/markasoftware)
246 points
__anon-2023__
3 years ago
170 comments
720.
Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning (github.com/KhoomeiK)
239 points
KhoomeiK
2 years ago
28 comments
More