Search: github.com/tnfe | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

331.

MetalChat – Llama Inference for Apple Silicone (github.com/ybubnov)

5 points

4 months ago

332.

Voxtral.c Voxtral Realtime 4B model inference as a C library (github.com/antirez)

5 points

4 months ago

333.

llama2.zig: Inference Llama 2 in one file of pure Zig (github.com/cgbur)

5 points

6 months ago

334.

T-Mac: Low-bit LLM inference on CPU/NPU with lookup table (github.com/microsoft)

5 points

8 months ago

335.

Show HN: gline-rs – an inference engine for GLiNER models, in Rust (github.com/fbilhaut)

5 points

a year ago

336.

Fast LLM Inference in Rust (github.com/EricLBuehler)

5 points

2 years ago

337.

Fast and hackable PyTorch native transformer inference (github.com/pytorch-labs)

5 points

3 years ago

338.

Lepton: An open-source library (Apache 2.0) for scaling model inference (github.com/leptonai)

5 points

3 years ago

339.

Run LLaMA Inference on CPU, with Rust (github.com/rustformers)

5 points

3 years ago

340.

Three-processor inference on AMD Ryzen AI 300 (github.com/Peterc3-dev)

4 points

2 months ago

341.

LangPatrol: A static analyzer for LLM prompts that catches bugs before inference (github.com/langpatrol)

4 points

6 months ago

342.

Show HN: Inference Mixtral 8x7B in pure Rust (github.com/moritztng)

4 points

2 years ago

343.

Show HN: Ggml.js – Serverless AI Inference on Browser with Web Assembly (rahuldshetty.github.io)

4 points

3 years ago

344.

TensorSharp: Open-Source Local LLM Inference Engine (github.com/zhongkaifu)

4 points

3 days ago

345.

Train and inference GPT in 243 lines of pure, dependency-free Python by Karpathy (gist.github.com)

4 points

4 months ago

346.

PasLLM: An Object Pascal inference engine for LLM models (github.com/BeRo1985)

4 points

6 months ago

347.

Distributed-Llama: Connect home devices into a cluster for LLM inference (github.com/b4rtaz)

4 points

a year ago

348.

Practical Llama 3 inference in Java (github.com/mukel)

4 points

2 years ago

349.

Llama.cpp speculative sampling: 2x faster inference for large models (github.com/ggerganov)

4 points

3 years ago

350.

Zig GPT-2 inference engine (github.com/EugenHotaj)

4 points

3 years ago

351.

Stable Diffusion inference locally on iOS / macOS using MPSGraph (github.com/mortenjust)

4 points

4 years ago

352.

Pytype checks and infers types for your Python code (github.com/google)

4 points

7 years ago

353.

Inferential database seeding in Clojure (michaeldrogalis.github.com)

4 points

MichaelDrogalis

13 years ago

354.

Show HN: Static-allocation MLP inference in ANSI C using a 2-slot ring buffer (github.com/GiorgosXou)

4 points

9 days ago

355.

Mtplx – 2.24x faster TPS – The native MTP inference engine for Apple Silicon (github.com/youssofal)

4 points

a month ago

356.

Show HN: Open-source GDPR router for LLMs detects PII, forces EU-only inference (github.com/mahadillahm4di-cyber)

4 points

2 months ago

357.

Iris – a C inference pipeline for image synthesis models (github.com/antirez)

4 points

2 months ago

358.

Show HN: Our command line tool to transpile AI Inference from Python to C++ (github.com/muna-ai)

4 points

4 months ago

359.

Show HN: I wrote inference for Qwen3 0.6B in C/CUDA (github.com/asdf93074)

4 points

8 months ago

360.

Show HN: Klartraum, a neural rendering inference engine (github.com/fortmeier)

4 points

a year ago