Search: github.com/tnfe | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

181.

New open-source model with 8k context runs on CPU, outperforms GPT-3 (github.com/abacaj)

5 points

3 years ago

182.

Accelerating LLM Serving with Speculative Inference and Token Tree Verification (github.com/flexflow)

3 points

3 years ago

183.

Hugging Face reverts the license back to Apache 2.0 (github.com/huggingface)

3 points

2 years ago

184.

Fast inference for text models using Rust (github.com/huggingface)

3 points

3 years ago

185.

MPT 30B inference code using CPU (github.com/abacaj)

3 points

3 years ago

186.

Text to Speech CUDA Programming (github.com/Saurabh-29)

3 points

7 years ago

187.

Bayesian inference and forecast of Covid-19 in Germany by a Max-Planck-Institute (github.com/Priesemann-Group)

2 points

6 years ago

188.

Diffbot GraphRAG LLM (github.com/diffbot)

2 points

a year ago

189.

GPT4ALL Python3 Local LLM Conversation Recorder (github.com/13alvone)

2 points

3 years ago

190.

Show HN: Bert NLP inference in browser using WebAssembly-SIMD (github.com/jobergum)

2 points

4 years ago

191.

Private Decentralized Inference on Consumer Hardware [pdf] (github.com/Layr-Labs)

1 point

a month ago

192.

Open Source Stable Diffusion with LCM-LoRA (github.com/joshfischer1108)

1 point

joshfischer1108

3 years ago

193.

Private decentralized inference on consumer hardware [pdf] (github.com/Layr-Labs)

1 point

2 months ago

194.

VGGT PyTorch Inference (github.com/ibaiGorordo)

1 point

a year ago

195.

Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model (github.com/cactus-compute)

776 points

25 days ago

196.

Launch HN: Hyprnote (YC S25) – An open-source AI meeting notetaker

270 points

10 months ago

197.

Show HN: Fastify's slow startup is an AJV problem – here's a drop-in fix

2 points

3 months ago

198.

Finished a project mixing GNNs, RL, and operations research (github.com/MehdiZouitine)

1 point

a year ago

199.

Show HN: I built an Image Embedding API inspired by text-embedding-inference (github.com/bernardo-sb)

1 point

a year ago

200.

Show HN: ImageEmbeddingInference – like text-embeddings-inference but for images (github.com/bernardo-sb)

1 point

a year ago

201.

Show HN: Sightline – Shodan-style search for real-world infra using OSM Data (github.com/ni5arga)

26 points

4 months ago

202.

Ask HN: Are you saving inference costs on GPUs at your company

5 points

a year ago

203.

Show HN: Revibing nanochat's inference model in C++ with ggml (github.com/k-ye)

5 points

5 months ago

204.

Show HN: Letting an LLM write robot programs (boesch.dev)

3 points

2 months ago

205.

Show HN: MLX-Ruby – Ruby Bindings for Apple's MLX ML Framework (github.com/skryl)

1 point

4 months ago

206.

Show HN: ReFlow Studio – An offline tool to dub, translate, and censor videos (github.com/ananta-sj)

1 point

5 months ago

207.

Auto-unloading models using __init_subclass__ (Python) (github.com/Vrroom)

1 point

3 years ago

208.

Bookish: math-infested markdown to HTML and latex (github.com/parrt)

1 point

8 years ago

209.

Show HN: Mamba-Chat – A Chat LLM Based on State Space Models (github.com/havenhq)

9 points

2 years ago

210.

Ask HN: Which cloud provider offers AMD MI250/MI300?

2 points

2 years ago