Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
91.
▲
Show HN: Salad, a distributed cloud for AI (like Airbnb for GPUs)
15 points
bobjmiles
2 years ago
4 comments
92.
▲
Show HN: KTransformers:671B DeepSeek-R1 on a Single Machine-286 tokens/s Prefill
(github.com/kvcache-ai)
14 points
sssummer
a year ago
discuss
93.
▲
Show HN: Willow Inference Server: Optimized ASR/TTS/LLM for Willow/WebRTC/REST
(github.com/toverainc)
13 points
kkielhofner
3 years ago
13 comments
94.
▲
Ask HN: Help me improve my C-like language, C3
12 points
Nuoji
6 years ago
7 comments
95.
▲
Show HN: Lightweight Llama3 Inference Engine – CUDA C
(github.com/abhisheknair10)
12 points
abhisheknair10
a year ago
discuss
96.
▲
Show HN: Automatic 1111, but as a Python Package
(github.com/saketh12)
11 points
saketh105
2 years ago
discuss
97.
▲
Show HN: Coderive – Iterating through 1 Quintillion Inside a Loop in just 50ms
(github.com/DanexCodr)
8 points
DanexCodr
5 months ago
13 comments
98.
▲
Show HN: onprem unstructured data extraction with 4 lines of code
(github.com/NanoNets)
8 points
souvik3333
a year ago
discuss
99.
▲
Show HN: Local GLaDOS
(old.reddit.com)
8 points
dnhkng
2 years ago
discuss
100.
▲
Show HN: WaveletLM – wavelet-based, attention-free model with O(n log n) scaling
(github.com/ramongougis)
7 points
anarmorarm
a month ago
1 comment
101.
▲
Show HN: Serve 100 Large AI models on a single GPU with low impact to TTFT
(github.com/leoheuler)
7 points
leonheuler
7 months ago
1 comment
102.
▲
Show HN: Federation of robots collaboratively train an object manipulation model
(github.com/adap)
7 points
jafermarq
a year ago
discuss
103.
▲
Show HN: OS Megakernel that match M5 Max Tok/w at 2x the Throughput on RTX 3090
(github.com/Luce-Org)
6 points
GreenGames
2 months ago
1 comment
104.
▲
Show HN: Blink-Edit – Cursor-style next-edit predictions for Neovim (local LLMs)
(github.com/BlinkResearchLabs)
6 points
atemyipod
4 months ago
discuss
105.
▲
Show HN: I'm tired of my LLM bullshitting. So I fixed it
5 points
BobbyLLM
4 months ago
9 comments
106.
▲
Show HN: AI Council – multi-model deliberation that runs in the browser
(github.com/prijak)
5 points
prijak
3 months ago
1 comment
107.
▲
Show HN: I built an AI movie making and design engine in Rust
(github.com/storytold)
5 points
echelon
4 months ago
1 comment
108.
▲
Show HN: ClawMem – Open-source agent memory with SOTA local GPU retrieval
(github.com/yoloshii)
5 points
yoloshii
2 months ago
discuss
109.
▲
TinyTTS: Ultra-light English TTS (9M params, 20MB), 8x CPU, 67x GPU
5 points
letrghieu
3 months ago
discuss
110.
▲
Show HN: Clawbernetes – Replace kubectl with conversation (Rust)
(github.com/clawbernetes)
5 points
redclaw
4 months ago
discuss
111.
▲
Show HN: Open-source fine-tuning in a Colab notebook
(colab.research.google.com)
5 points
danielhanchen
2 years ago
discuss
112.
▲
Show HN: Self-hosted RAG with MCP support for OpenClaw
(github.com/2dogsandanerd)
4 points
2dogsanerd
4 months ago
2 comments
113.
▲
Show HN: Pile Programming Language
(github.com/sixfootbeard)
4 points
jhhh
3 years ago
2 comments
114.
▲
Show HN: NSED is public – Mixture-of-Models to Hit SOTA using self-hosted AI
(github.com/peeramid-labs)
4 points
t_peersky
4 months ago
discuss
115.
▲
Show HN: ArtCraft AI crafting engine, written in Rust
(github.com/storytold)
4 points
echelon
4 months ago
discuss
116.
▲
Show HN: HORenderer3: A C++ software renderer implementing OpenGL 3.3 pipeline
(github.com/Hobanghann)
4 points
zghdls
5 months ago
discuss
117.
▲
Run 35B LLMs on Dual Pascal GPUs with QLoRA
4 points
rickesh_tn
8 months ago
discuss
118.
▲
Show HN: Generic and variadic printing library in C
(github.com/agvxov)
4 points
agvxov
a year ago
discuss
119.
▲
Show HN: A reasoning model that infers over whole tasks in 1ms in latent space
(github.com/OrderOneAI)
3 points
orderone_ai
a year ago
6 comments
120.
▲
Show HN: Turn any ComfyUI workflow into a web app or API
3 points
jjdelannoy
a year ago
6 comments
More