Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
391.
Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model (github.com/cactus-compute)
776 points
HenryNdubuaku
25 days ago
211 comments
392.
Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks (github.com/antoinezambelli)
687 points
zambelli
18 days ago
252 comments
393.
Show HN: InvokeAI, an open source Stable Diffusion toolkit and WebUI (github.com/invoke-ai)
414 points
sophrocyne
4 years ago
102 comments
394.
Show HN: Doom (1993) in a PDF (doompdf.pages.dev)
369 points
vk6
a year ago
74 comments
395.
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training (github.com/alainnothere)
265 points
xlayn
3 months ago
80 comments
396.
Launch HN: Deepsilicon (YC S24) – Software and hardware for ternary transformers
189 points
areddyyt
2 years ago
79 comments
397.
Show HN: WebGPU enables local LLM in the browser – demo site with AI chat (andreinwald.github.io)
145 points
andreinwald
10 months ago
54 comments
398.
Show HN: Ephe – A minimalist open-source Markdown paper for today (github.com/unvalley)
143 points
unvalley
a year ago
54 comments
399.
Show HN: I made a volumetric audio visualizer (a-sumo.github.io)
90 points
rslice
4 years ago
42 comments
400.
Show HN: AudioNimbus – Steam Audio's immersive spatial audio, now in Rust (github.com/MaxenceMaire)
77 points
mxncmr
a year ago
7 comments
401.
Show HN: Crazierl – An Erlang Operating System (crazierl.org)
72 points
toast0
2 months ago
14 comments
402.
Show HN: Open Sourcing Our No-Code WebXR Editor After 5 Years of Development (github.com/transferthought)
67 points
keenanTT
2 years ago
15 comments
403.
Show HN: ZSE – Open-source LLM inference engine with 3.9s cold starts (github.com/Zyora-Dev)
58 points
zyoralabs
3 months ago
9 comments
404.
Launch HN: General Instinct (YC P26) – Frontier models on edge devices
51 points
guanming0717
18 hours ago
15 comments
405.
Show HN: I built a RISC-V emulator that runs DOOM (github.com/lalitshankarch)
50 points
Flex247A
a month ago
4 comments
406.
Asking for help: Please encourage Microsoft to add AAAA records for artefacts
36 points
BartjeD
4 years ago
15 comments
407.
Show HN: Local task classifier and dispatcher on RTX 3080 (github.com/resilientworkflowsentinel)
26 points
Shubham_Amb
4 months ago
2 comments
408.
Show HN: Bashtorio – Factorio-Like in the Browser Backed by a Linux VM (bashtorio.xyz)
23 points
elijahcham
4 months ago
discuss
409.
Ask: How to announce my 14 years old web development framework?
22 points
xeora
8 years ago
25 comments
410.
Show HN: KTransformers–236B Model and 1M Context LLM Inference on Local Machines (github.com/kvcache-ai)
20 points
sssummer
2 years ago
3 comments
411.
Show HN: Demon – open-source real-time music diffusion engine, 25Hz local GPU (daydreamlive.github.io)
17 points
ryanontheinside
10 days ago
13 comments
412.
Show HN: EVRTHN.com, a Search Engine for the “Metaverse” (evrthn.com)
17 points
herval
4 years ago
discuss
413.
Show HN: Finetune Llama-3.1 2x faster in a Colab (colab.research.google.com)
16 points
danielhanchen
2 years ago
2 comments
414.
Show HN: Salad, a distributed cloud for AI (like Airbnb for GPUs)
15 points
bobjmiles
2 years ago
4 comments
415.
Achieving a zero-downtime Postgres major version upgrade (medplum.com)
15 points
mattlong
a year ago
3 comments
416.
Ping.gg monitoring engine now open source (Go)
15 points
vruiz
11 years ago
1 comment
417.
Show HN: KTransformers:671B DeepSeek-R1 on a Single Machine-286 tokens/s Prefill (github.com/kvcache-ai)
14 points
sssummer
a year ago
discuss
418.
Show HN: Willow Inference Server: Optimized ASR/TTS/LLM for Willow/WebRTC/REST (github.com/toverainc)
13 points
kkielhofner
3 years ago
13 comments
419.
Show HN: Lightweight Llama3 Inference Engine – CUDA C (github.com/abhisheknair10)
12 points
abhisheknair10
a year ago
discuss
420.
Show HN: Automatic 1111, but as a Python Package (github.com/saketh12)
11 points
saketh105
2 years ago
discuss
More