Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
91.
Show HN: Salad, a distributed cloud for AI (like Airbnb for GPUs)
15 points
bobjmiles
2 years ago
4 comments
92.
Show HN: KTransformers:671B DeepSeek-R1 on a Single Machine-286 tokens/s Prefill (github.com/kvcache-ai)
14 points
sssummer
a year ago
discuss
93.
Show HN: Willow Inference Server: Optimized ASR/TTS/LLM for Willow/WebRTC/REST (github.com/toverainc)
13 points
kkielhofner
3 years ago
13 comments
94.
Ask HN: Help me improve my C-like language, C3
12 points
Nuoji
6 years ago
7 comments
95.
Show HN: Lightweight Llama3 Inference Engine – CUDA C (github.com/abhisheknair10)
12 points
abhisheknair10
a year ago
discuss
96.
Show HN: Automatic 1111, but as a Python Package (github.com/saketh12)
11 points
saketh105
2 years ago
discuss
97.
Show HN: Coderive – Iterating through 1 Quintillion Inside a Loop in just 50ms (github.com/DanexCodr)
8 points
DanexCodr
5 months ago
13 comments
98.
Show HN: onprem unstructured data extraction with 4 lines of code (github.com/NanoNets)
8 points
souvik3333
a year ago
discuss
99.
Show HN: Local GLaDOS (old.reddit.com)
8 points
dnhkng
2 years ago
discuss
100.
Show HN: WaveletLM – wavelet-based, attention-free model with O(n log n) scaling (github.com/ramongougis)
7 points
anarmorarm
a month ago
1 comment
101.
Show HN: Serve 100 Large AI models on a single GPU with low impact to TTFT (github.com/leoheuler)
7 points
leonheuler
7 months ago
1 comment
102.
Show HN: Federation of robots collaboratively train an object manipulation model (github.com/adap)
7 points
jafermarq
a year ago
discuss
103.
Show HN: OS Megakernel that match M5 Max Tok/w at 2x the Throughput on RTX 3090 (github.com/Luce-Org)
6 points
GreenGames
2 months ago
1 comment
104.
Show HN: Blink-Edit – Cursor-style next-edit predictions for Neovim (local LLMs) (github.com/BlinkResearchLabs)
6 points
atemyipod
4 months ago
discuss
105.
Show HN: I'm tired of my LLM bullshitting. So I fixed it
5 points
BobbyLLM
4 months ago
9 comments
106.
Show HN: AI Council – multi-model deliberation that runs in the browser (github.com/prijak)
5 points
prijak
3 months ago
1 comment
107.
Show HN: I built an AI movie making and design engine in Rust (github.com/storytold)
5 points
echelon
4 months ago
1 comment
108.
Show HN: ClawMem – Open-source agent memory with SOTA local GPU retrieval (github.com/yoloshii)
5 points
yoloshii
2 months ago
discuss
109.
TinyTTS: Ultra-light English TTS (9M params, 20MB), 8x CPU, 67x GPU
5 points
letrghieu
3 months ago
discuss
110.
Show HN: Clawbernetes – Replace kubectl with conversation (Rust) (github.com/clawbernetes)
5 points
redclaw
4 months ago
discuss
111.
Show HN: Open-source fine-tuning in a Colab notebook (colab.research.google.com)
5 points
danielhanchen
2 years ago
discuss
112.
Show HN: Self-hosted RAG with MCP support for OpenClaw (github.com/2dogsandanerd)
4 points
2dogsanerd
4 months ago
2 comments
113.
Show HN: Pile Programming Language (github.com/sixfootbeard)
4 points
jhhh
3 years ago
2 comments
114.
Show HN: NSED is public – Mixture-of-Models to Hit SOTA using self-hosted AI (github.com/peeramid-labs)
4 points
t_peersky
4 months ago
discuss
115.
Show HN: ArtCraft AI crafting engine, written in Rust (github.com/storytold)
4 points
echelon
4 months ago
discuss
116.
Show HN: HORenderer3: A C++ software renderer implementing OpenGL 3.3 pipeline (github.com/Hobanghann)
4 points
zghdls
5 months ago
discuss
117.
Run 35B LLMs on Dual Pascal GPUs with QLoRA
4 points
rickesh_tn
8 months ago
discuss
118.
Show HN: Generic and variadic printing library in C (github.com/agvxov)
4 points
agvxov
a year ago
discuss
119.
Show HN: A reasoning model that infers over whole tasks in 1ms in latent space (github.com/OrderOneAI)
3 points
orderone_ai
a year ago
6 comments
120.
Show HN: Turn any ComfyUI workflow into a web app or API
3 points
jjdelannoy
a year ago
6 comments
More