Search: github.com/inferrd | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

121.

Hypura – A storage-tier-aware LLM inference scheduler for Apple Silicon (github.com/t8)

221 points

2 months ago

122.

Microsoft BitNet: inference framework for 1-bit LLMs (github.com/microsoft)

173 points

2 years ago

123.

Show HN: Apple II clock using interrupts from physical pendulum clock (github.com/wkjagt)

157 points

2 years ago

124.

Launch HN: Cactus (YC S25) – AI inference on smartphones (github.com/cactus-compute)

123 points

9 months ago

125.

Parakeet.cpp – Parakeet ASR inference in pure C++ with Metal GPU acceleration (github.com/Frikallo)

114 points

3 months ago

126.

Open source inference time compute example from HuggingFace (github.com/huggingface)

88 points

a year ago

127.

Fast GPT-2 inference written in Fortran (github.com/certik)

83 points

3 years ago

128.

Show HN: ZSE – Open-source LLM inference engine with 3.9s cold starts (github.com/Zyora-Dev)

58 points

3 months ago

129.

GPU-accelerated Llama3.java inference in pure Java using TornadoVM (github.com/beehive-lab)

48 points

a year ago

130.

Show HN: LLM, a Rust Crate/CLI for CPU Inference of LLMs (LLaMA, GPT-NeoX, etc.) (github.com/rustformers)

45 points

3 years ago

131.

Show HN: React-hint – 150LoC Tooltip Component for React, Preact and Inferno (github.com/slmgc)

37 points

9 years ago

132.

DoWhy is a Python library for causal inference (github.com/py-why)

37 points

4 years ago

133.

ZML - High performance AI inference stack (github.com/zml)

36 points

2 years ago

134.

DeepSeek-V3/R1 Inference System Overview (github.com/deepseek-ai)

27 points

a year ago

135.

Node9 – Inferno-Like Hosted OS Using LuaJIT (github.com/jvburnes)

23 points

11 years ago

136.

RTNeural – real-time neural network inferencing engine (github.com/jatinchowdhury18)

16 points

4 years ago

137.

RDMA-Powered Distributed Cache for Fast AI Training and Inference (github.com/blackbird-io)

16 points

9 months ago

138.

CLIP inference in plain C/C++ with no extra dependencies (github.com/monatis)

12 points

3 years ago

139.

DeepCamera: Local inference engine, Home Assistant intrusion detection AI camera (github.com/SharpAI)

12 points

4 years ago

140.

Show HN: N0x – LLM inference, agents, RAG, Python exec in browser, no back end (n0xth.vercel.app)

9 points

3 months ago

141.

BharatMLStack – Realtime Inference, MLOps (github.com/Meesho)

8 points

a year ago

142.

Show HN: Bhumi–OSS Python Library w Rust Underhead for 2.5x Faster LLM Inference (bhumi.trilok.ai)

8 points

a year ago

143.

gg: "M2 Ultra is the absolute best personal LLM inference node you can buy." (github.com/ggerganov)

8 points

3 years ago

144.

Alpa: Auto-parallelizing large model training and inference (by UC Berkeley) (github.com/alpa-projects)

7 points

4 years ago

145.

Show HN: Secure XGBoost training and inference on encrypted data (github.com/mc2-project)

7 points

6 years ago

146.

Show HN: Composable middleware for LLM inference Optimization Passes (github.com/liquidos-ai)

7 points

3 months ago

147.

Llama.cpp: Deterministic Inference Mode (CUDA): RMSNorm, MatMul, Attention (github.com/ggml-org)

6 points

9 months ago

148.

Rust+OpenCL+AVX2 implementation of LLaMA inference code (github.com/Noeda)

6 points

3 years ago

149.

Ask HN: What is the best tool to infer data type of tabular data?

5 points

5 years ago

150.

I implemented CLIP inference in plain C/C++ (github.com/monatis)

5 points

3 years ago