Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
61.
▲
Show HN: Collider – the platform for local LLM debug and inference at warp speed
(github.com/gotzmann)
3 points
Ambix
3 years ago
1 comment
62.
▲
Fixed a llama.cpp bug silently disabling Vulkan GPU on all 32-bit ARM devices
3 points
perinban
2 months ago
discuss
63.
▲
Tq-KV – Rust implementation of TurboQuant that works on GGUF models
3 points
onurgokyildiz
2 months ago
discuss
64.
▲
Show HN: I built a 2nd-order PyTorch optimizer for LLMs that runs on 16GB GPUs
2 points
dnosoz
a month ago
4 comments
65.
▲
Show HN: Loft CLI – Fine-tune and run LLMs (1–3B) on 8 GB MacBook Air, no GPUs
2 points
dips2umar
a year ago
1 comment
66.
▲
Show HN: LlamaFarm – Working on binary AI Project deployment – (early preview)
(github.com/llama-farm)
2 points
rgthelen
a year ago
1 comment
67.
▲
Sumi – Open-source voice-to-text with local AI polishing
2 points
alkd
3 months ago
discuss
68.
▲
Show HN: Reduction Blockprint Planner/Simulator
(reduction-planner.hirson.xyz)
2 points
gh5000
3 months ago
discuss
69.
▲
Show HN: OctoFlow v1.0.0 – GPU VM where the GPU runs autonomously, CPU is BIOS
2 points
mr_octopus
3 months ago
discuss
70.
▲
Show HN: Local Voice Assistant
2 points
armcat
4 months ago
discuss
71.
▲
Show HN: Promptscout a local prompt enricher for Claude Code
(github.com/obsfx)
2 points
obsfx
4 months ago
discuss
72.
▲
Show HN: TrendScope – Real-time financial sentiment analysis on a cheap VPS
(trendscope.akamaar.dev)
2 points
mohammede
4 months ago
discuss
73.
▲
Show HN: SpeedyEDA – One-line exploratory data analysis
2 points
dawitworku
5 months ago
discuss
74.
▲
Running a 270M LLM on Android (architecture and benchmarks)
2 points
ayushranjan99
6 months ago
discuss
75.
▲
Show HN: ONNX optimized SigLIP and related foundation models
(github.com/rhysdg)
2 points
rhysdg
2 years ago
discuss
76.
▲
DSPTools: Open Source DSP simulator for iOS devices
2 points
medius
14 years ago
discuss
77.
▲
Show HN: Python Bindings for llama.cpp with some CLIs
(github.com/thomasantony)
2 points
tantony
3 years ago
discuss
78.
▲
Show HN: Localvoxtral – Local real-time dictation on macOS with streaming STT
(github.com/T0mSIlver)
1 point
T0mSIlver
3 months ago
2 comments
79.
▲
Day 1 of trying to fit a Chatbot into a QR Code
1 point
kuberwastaken
a year ago
2 comments
80.
▲
Off Grid: On-device AI-web browsing, tools, vision, image gen, voice – 3x faster
1 point
ali_chherawalla
3 months ago
1 comment
81.
▲
Show HN: Mixture of Voices–Open source goal-based AI router-uses BGE transformer
1 point
KylieM
9 months ago
1 comment
82.
▲
Show HN: WayInfer – Native GGUF engine that runs models larger than your RAM
1 point
ahmedm24
2 months ago
discuss
83.
▲
I built two Loihi-parity neuromorphic processors from scratch
1 point
catalyst-neuro
4 months ago
discuss
84.
▲
Show HN: Running an LLM Inside Scratch
(github.com/Broyojo)
1 point
broyojo
4 months ago
discuss
85.
▲
Show HN: ARIA – P2P distributed inference protocol for 1-bit LLMs on CPU
(github.com/spmfrance-cloud)
1 point
anthonymu
4 months ago
discuss
86.
▲
Show HN: Loclean – Local semantic data cleaning with LLMs and Pydantic
(github.com/nxank4)
1 point
nxank4
4 months ago
discuss
87.
▲
Show HN: Satya – Offline-first AI tutor for rural schools (Phi-1.5 and RAG)
(github.com/aa-sikkkk)
1 point
aashikbaruwal
5 months ago
discuss
88.
▲
Show HN: LLviM – A conversational coding plugin for Vim and Local LLMs
(github.com/gkchestertron)
1 point
trs83
a year ago
discuss
89.
▲
Show HN: Traitorous Models- Reality Show with Open Source LLMs
(github.com/michaelgiba)
1 point
michaelgiba
a year ago
discuss
90.
▲
Is My Approach to Vectorizing and Storing 1.5 Trillion Tokens Reasonable?
1 point
reutinger
2 years ago
discuss
More