Search: github.com/ollama | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

781.

Llama 3.1 is available for home AI clusters! Run 8B model with the full context (github.com/b4rtaz)

4 points

2 years ago

782.

Distributed Llama (github.com/b4rtaz)

4 points

2 years ago

783.

Directly run and investigate Llama models locally with only PyTorch (github.com/anordin95)

3 points

2 years ago

784.

Grinder12: 0.96-Bit Lossless Streaming KV-Cache (16.55x VRAM Savings (github.com/ggml-org)

3 points

a month ago

785.

AMD teams contributing to the llama.cpp codebase (github.com/ggml-org)

3 points

10 months ago

786.

Llamafile 0.8.6 CPU Benchmark (github.com/Mozilla-Ocho)

3 points

2 years ago

787.

Llamafile 0.8.2 Release with Embedding Subcmd in CLI and Performance Boost (github.com/Mozilla-Ocho)

3 points

2 years ago

788.

Go Bindings for LLaMa.cpp (github.com/matthewrennie)

3 points

3 years ago

789.

A better Llama-CLI help doc (github.com/ggml-org)

2 points

9 months ago

790.

Inference Llama models in one file of pure C for Win98 (github.com/exo-explore)

2 points

a year ago

791.

LlamaSim – Simulate political polling with LLMs (github.com/jw-source)

2 points

2 years ago

792.

Fun project that makes a llama.cpp server LLM chat interface with Htmx and Rust (github.com/richardanaya)

2 points

2 years ago

793.

Llama.cpp b9180: MTP support landed (github.com/ggml-org)

2 points

21 days ago

794.

LlamaBarn: A cosy home for your LLMs (github.com/ggml-org)

2 points

4 months ago

795.

LlamaBarn – automatically configure models based on your Mac's hardware (github.com/ggml-org)

2 points

7 months ago

796.

Guide: Running GPT-OSS with Llama.cpp (github.com/ggml-org)

2 points

10 months ago

797.

Node-Llama-cpp – Run AI models locally on your machine with Node.js (github.com/withcatai)

2 points

a year ago

798.

Released llamafile 0.8.13 with gemma2, new whisper and Stable Diffusion CLI (github.com/Mozilla-Ocho)

2 points

2 years ago

799.

What do you check first in PR review? Help shape our AI Code review tool (github.com/JetXu-LLM)

1 point

a year ago

800.

Llama and Spec: MTP Support (github.com/ggml-org)

1 point

a month ago

801.

QuantumLeap: 2.3× faster MoE inference with intelligent expert caching (github.com/MartinCrespoC)

1 point

2 months ago

802.

Llamafile (github.com/Mozilla-Ocho)

1 point

a year ago

803.

Show HN: I Used Llama-70B Logprobs for Better, Cheaper and Faster Chunking (github.com/ZeroEntropy-AI)

1 point

2 years ago

804.

CPU beating GPU in token generation speed (github.com/ikawrakow)

1 point

2 years ago

805.

Distributed Grok-1 (314B) (github.com/b4rtaz)

1 point

2 years ago

806.

LLaMA2-Accessory: An Open-Source Toolkit for LLM Development (github.com/Alpha-VLLM)

1 point

2 years ago

807.

LLaMA-VID: An Image Is Worth 2 Tokens in LLM (github.com/dvlab-research)

1 point

3 years ago

808.

Show HN: WhatsApp-Llama: A clone of yourself from your WhatsApp conversations (github.com/Ads-cmu)

124 points

3 years ago

809.

OpenLLaMA: An Open Reproduction of LLaMA (github.com/openlm-research)

484 points

3 years ago

810.

OpenLLaMA to train beyond 1T tokens (github.com/openlm-research)

2 points

3 years ago