Search: github.com/ollama | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

691.

Show HN: I'm tired of my LLM bullshitting. So I fixed it

5 points

5 months ago

692.

Show HN: Dive into Transformers and LLM World – Llama 3.1 in Go, Step by Step (github.com/adalkiran)

4 points

2 years ago

693.

Show HN: I built BakLLaVA and llama.cpp demo and it went viral on X

4 points

3 years ago

694.

Ask HN: Why the LLaMA code base is so short

3 points

3 years ago

695.

Fixed a llama.cpp bug silently disabling Vulkan GPU on all 32-bit ARM devices

3 points

2 months ago

696.

Show HN: LLaMA Nuts and Bolts, A holistic way of understanding how LLMs run (github.com/adalkiran)

3 points

2 years ago

697.

Apple predicted the rise of local LLMs, hence the M2 Ultra

2 points

3 years ago

698.

Show HN: LlamaFarm – Working on binary AI Project deployment – (early preview) (github.com/llama-farm)

2 points

a year ago

699.

Show HN: I Built a GitHub Action to Monitor LlamaIndex Performance

2 points

2 years ago

700.

Show HN: LlamaPReview – AI code reviewer trusted by 2000 repos, 40%+ effective (jetxu-llm.github.io)

2 points

2 years ago

701.

Ask HN: Are We Approaching Code Reviews Wrong?

2 points

2 years ago

702.

Ask HN: Do you know any new llama2.c implementations not mentioned in the repo

1 point

2 years ago

703.

Show HN: Running LLM on smartwatch – found llama.cpp loading model twice in RAM

1 point

2 months ago

704.

Llama.cpp 30B runs with only 6GB of RAM now (github.com/ggerganov)

1311 points

3 years ago

705.

Llama3 implemented from scratch (github.com/naklecha)

1041 points

2 years ago

706.

Llama.cpp: Port of Facebook's LLaMA model in C/C++, with Apple Silicon support (github.com/ggerganov)

989 points

3 years ago

707.

Facebook LLAMA is being openly distributed via torrents (github.com/facebookresearch)

909 points

3 years ago

708.

Llama.cpp: Full CUDA GPU Acceleration (github.com/ggerganov)

728 points

3 years ago

709.

Llama2.c: Inference llama 2 in one file of pure C (github.com/karpathy)

707 points

3 years ago

710.

Show HN: Llama 3.2 Interpretability with Sparse Autoencoders (github.com/PaulPauls)

579 points

2 years ago

711.

Llama: Add grammar-based sampling (github.com/ggerganov)

417 points

3 years ago

712.

New exponent functions that make SiLU and SoftMax 2x faster, at full accuracy (github.com/ggerganov)

382 points

2 years ago

713.

Show HN: Llama-dl – high-speed download of LLaMA, Facebook's 65B GPT model (github.com/shawwn)

343 points

3 years ago

714.

LLama.cpp now has a web interface (github.com/ggerganov)

328 points

3 years ago

715.

NotebookLlama: An open source version of NotebookLM (github.com/meta-llama)

322 points

2 years ago

716.

Llama 2 Everywhere (L2E): Standalone, Binary Portable, Bootable Llama 2 (github.com/trholding)

320 points

3 years ago

717.

Llama 3.1 Omni Model (github.com/ictnlp)

304 points

2 years ago

718.

M2 Ultra can run 128 streams of Llama 2 7B in parallel (github.com/ggerganov)

268 points

3 years ago

719.

Fork of Facebook’s LLaMa model to run on CPU (github.com/markasoftware)

246 points

3 years ago

720.

Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning (github.com/KhoomeiK)

239 points

2 years ago