Search: github.com/ollama | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

751.

30B model now needs only 5.8GB of RAM? How? (github.com/ggerganov)

31 points

3 years ago

752.

WIP Llama.cpp Vulkan Implementations (github.com/ggerganov)

24 points

3 years ago

753.

Gemma Is Added to Llama.cpp (github.com/ggerganov)

17 points

2 years ago

754.

Speculative: PoC for speeding-up inference via speculative sampling by ggerganov (github.com/ggerganov)

16 points

3 years ago

755.

Show HN: LiteParse v2, now in Rust 100x faster (github.com/run-llama)

15 points

10 days ago

756.

An open source Claude Artifacts – generate small apps with one prompt (github.com/Nutlope)

12 points

2 years ago

757.

Show HN: LiteParse, a fast open-source document parser for AI agents (github.com/run-llama)

12 points

3 months ago

758.

Llama 3.2 Release (github.com/meta-llama)

12 points

2 years ago

759.

Show HN: Llama2.ipynb (github.com/rbitr)

12 points

3 years ago

760.

Show HN: Chaos Llama – Chaos Monkey Build on AWS Lambda (github.com/hassy)

12 points

10 years ago

761.

Grok-1 Support for Llama.cpp (github.com/ggerganov)

11 points

2 years ago

762.

Liteparse (github.com/run-llama)

9 points

2 months ago

763.

Show HN: LlamaChat - interact with your favourite LLaMA models on macOS (github.com/alexrozanski)

9 points

3 years ago

764.

Show HN: LlamaExtract, a tool to automatically extract schema from documents (github.com/run-llama)

8 points

2 years ago

765.

Llama2.c64: a port of llama2.c to the Commodore C64 (github.com/ytmytm)

8 points

a year ago

766.

Llama 3.1: 405B, the largest openly available model released (github.com/meta-llama)

8 points

2 years ago

767.

gg: "M2 Ultra is the absolute best personal LLM inference node you can buy." (github.com/ggerganov)

8 points

3 years ago

768.

LLaMA-rs: a Rust port of llama.cpp for fast LLaMA inference on CPU (github.com/setzer22)

8 points

3 years ago

769.

Llama2 + Haystack on Colab (github.com/anakin87)

7 points

3 years ago

770.

Ggml 2x WASM Speed with SIMD Optimization Using 99% DeekSeek-R1-Generated Code (github.com/ggerganov)

7 points

a year ago

771.

Llama as a System (github.com/meta-llama)

7 points

2 years ago

772.

Distributed LLama3 Inference (github.com/evilsocket)

7 points

2 years ago

773.

Llama.cpp Working on Support for Llama3 (github.com/ggerganov)

7 points

2 years ago

774.

Show HN: secinsights.ai – An open-source full-stack app using LlamaIndex (github.com/run-llama)

7 points

3 years ago

775.

Karpathy's llama2.c ported to pure Python (github.com/tairov)

6 points

3 years ago

776.

DeepSeek-R1 speeds up llama.cpp code by x2 (github.com/ggerganov)

6 points

a year ago

777.

L2E llama2.c running in a PDF in a Shroedinger PNG [pdf] (github.com/trholding)

6 points

a year ago

778.

Show HN: LlaMaKey – One master key for all cloud LLM/GenAI APIs (github.com/TexteaInc)

6 points

2 years ago

779.

Llama2.c Running in a PDF (github.com/trholding)

6 points

a year ago

780.

llama.cpp now supports StarCoder model series (github.com/ggerganov)

6 points

3 years ago