Search: github.com/vraa | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

91.

Show HN: Salad, a distributed cloud for AI (like Airbnb for GPUs)

15 points

2 years ago

92.

Show HN: KTransformers:671B DeepSeek-R1 on a Single Machine-286 tokens/s Prefill (github.com/kvcache-ai)

14 points

a year ago

93.

Show HN: Willow Inference Server: Optimized ASR/TTS/LLM for Willow/WebRTC/REST (github.com/toverainc)

13 points

3 years ago

94.

Ask HN: Help me improve my C-like language, C3

12 points

6 years ago

95.

Show HN: Lightweight Llama3 Inference Engine – CUDA C (github.com/abhisheknair10)

12 points

a year ago

96.

Show HN: Automatic 1111, but as a Python Package (github.com/saketh12)

11 points

2 years ago

97.

Show HN: Coderive – Iterating through 1 Quintillion Inside a Loop in just 50ms (github.com/DanexCodr)

8 points

5 months ago

98.

Show HN: onprem unstructured data extraction with 4 lines of code (github.com/NanoNets)

8 points

a year ago

99.

Show HN: Local GLaDOS (old.reddit.com)

8 points

2 years ago

100.

Show HN: WaveletLM – wavelet-based, attention-free model with O(n log n) scaling (github.com/ramongougis)

7 points

a month ago

101.

Show HN: Serve 100 Large AI models on a single GPU with low impact to TTFT (github.com/leoheuler)

7 points

7 months ago

102.

Show HN: Federation of robots collaboratively train an object manipulation model (github.com/adap)

7 points

a year ago

103.

Show HN: OS Megakernel that match M5 Max Tok/w at 2x the Throughput on RTX 3090 (github.com/Luce-Org)

6 points

2 months ago

104.

Show HN: Blink-Edit – Cursor-style next-edit predictions for Neovim (local LLMs) (github.com/BlinkResearchLabs)

6 points

4 months ago

105.

Show HN: I'm tired of my LLM bullshitting. So I fixed it

5 points

4 months ago

106.

Show HN: AI Council – multi-model deliberation that runs in the browser (github.com/prijak)

5 points

3 months ago

107.

Show HN: I built an AI movie making and design engine in Rust (github.com/storytold)

5 points

4 months ago

108.

Show HN: ClawMem – Open-source agent memory with SOTA local GPU retrieval (github.com/yoloshii)

5 points

2 months ago

109.

TinyTTS: Ultra-light English TTS (9M params, 20MB), 8x CPU, 67x GPU

5 points

3 months ago

110.

Show HN: Clawbernetes – Replace kubectl with conversation (Rust) (github.com/clawbernetes)

5 points

4 months ago

111.

Show HN: Open-source fine-tuning in a Colab notebook (colab.research.google.com)

5 points

2 years ago

112.

Show HN: Self-hosted RAG with MCP support for OpenClaw (github.com/2dogsandanerd)

4 points

4 months ago

113.

Show HN: Pile Programming Language (github.com/sixfootbeard)

4 points

3 years ago

114.

Show HN: NSED is public – Mixture-of-Models to Hit SOTA using self-hosted AI (github.com/peeramid-labs)

4 points

4 months ago

115.

Show HN: ArtCraft AI crafting engine, written in Rust (github.com/storytold)

4 points

4 months ago

116.

Show HN: HORenderer3: A C++ software renderer implementing OpenGL 3.3 pipeline (github.com/Hobanghann)

4 points

5 months ago

117.

Run 35B LLMs on Dual Pascal GPUs with QLoRA

4 points

8 months ago

118.

Show HN: Generic and variadic printing library in C (github.com/agvxov)

4 points

a year ago

119.

Show HN: A reasoning model that infers over whole tasks in 1ms in latent space (github.com/OrderOneAI)

3 points

a year ago

120.

Show HN: Turn any ComfyUI workflow into a web app or API

3 points

a year ago