Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
61.
▲
Show HN: Recurser lib reduces GPT2-XL VRAM usage by 25% and runs it on Colab
(github.com/max-ng)
5 points
homo_sapiens
3 years ago
1 comment
62.
▲
Show HN: A Vaadin Algebra and Calculus Solver Built with AI Assistance
4 points
bellaOxmyx
3 months ago
1 comment
63.
▲
Show HN: AudioGhost AI – Run Meta's Sam-Audio on Consumer GPUs (4GB-6GB VRAM)
(github.com/0x0funky)
3 points
0x0funky
5 months ago
1 comment
64.
▲
Shimmy v1.7.0: Running 42B Moe Models on Consumer GPUs with 99.9% VRAM Reduction
(github.com/Michael-A-Kuykendall)
3 points
MKuykendall
8 months ago
1 comment
65.
▲
Grinder12: 0.96-Bit Lossless Streaming KV-Cache (16.55x VRAM Savings
(github.com/ggml-org)
3 points
AMICLLC
a month ago
discuss
66.
▲
Show HN: L88 – A Local RAG System on 8GB VRAM (Need Architecture Feedback)
(github.com/Hundred-Trillion)
3 points
adithyadrdo
3 months ago
discuss
67.
▲
Unsloth – Train LLMs 2x faster with 70% less VRAM
(github.com/unslothai)
3 points
jhack
6 months ago
discuss
68.
▲
Quansloth Using Google's Turboquant Breaks the "VRAM Wall" for Local LLMs
(github.com/PacifAIst)
2 points
gunzfanatic
2 months ago
1 comment
69.
▲
Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"
(github.com/pheonix-delta)
2 points
shubham-coder
4 months ago
1 comment
70.
▲
Show HN: A Vaadin 24, Spring algebra calculator with dynamic variable buttons
2 points
bellaOxmyx
6 months ago
1 comment
71.
▲
Dead Simple Web UI for Training Flux LoRA with Low VRAM (12GB/16GB/20GB) Support
(github.com/cocktailpeanut)
2 points
cocktailpeanut
2 years ago
discuss
72.
▲
Show HN: Parakeet LLM Demo (378M param. 8GB VRAM)
2 points
razodactyl
2 years ago
discuss
73.
▲
Adjust VRAM/RAM Split on Apple Silicon
(github.com/ggerganov)
1 point
tosh
3 years ago
1 comment
74.
▲
VDPAU-to-VAAPI accelerates Flash video on Intel GFX
(github.com/i-rinat)
1 point
ddalex
13 years ago
discuss
75.
▲
2.3x KV Cache Compression at 32k Context – Cut VRAM Costs by 50%
(github.com/Jamie2111)
1 point
JamieObala
21 days ago
discuss
76.
▲
Show HN: VAAK (Voice-Activated Autonomous-Knowledge-System)
(github.com/ayushmaanbhav)
1 point
ayushmaanbhav
5 months ago
discuss
77.
▲
Show HN: QKV Core – Run 7B LLMs on 4GB VRAM via surgical memory alignment
(github.com/QKV-Core)
1 point
broxytr
6 months ago
discuss
78.
▲
Super Merryo Trolls: An Adventure from the Days Before VRAM
(github.com/GBirkel)
1 point
vatys
2 years ago
discuss
79.
▲
Rust Wishlist: functions with keyword args, default args, varargs
(github.com/rust-lang)
1 point
nurettin
6 years ago
discuss
80.
▲
Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks
(github.com/antoinezambelli)
687 points
zambelli
17 days ago
252 comments
81.
▲
Show HN: InvokeAI, an open source Stable Diffusion toolkit and WebUI
(github.com/invoke-ai)
414 points
sophrocyne
4 years ago
102 comments
82.
▲
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training
(github.com/alainnothere)
265 points
xlayn
3 months ago
80 comments
83.
▲
Launch HN: Deepsilicon (YC S24) – Software and hardware for ternary transformers
189 points
areddyyt
2 years ago
79 comments
84.
▲
Tell HN: Please Stop Using Imgur
69 points
MzHN
4 years ago
34 comments
85.
▲
Show HN: ZSE – Open-source LLM inference engine with 3.9s cold starts
(github.com/Zyora-Dev)
58 points
zyoralabs
3 months ago
9 comments
86.
▲
Show HN: I built a RISC-V emulator that runs DOOM
(github.com/lalitshankarch)
50 points
Flex247A
a month ago
4 comments
87.
▲
Show HN: Local task classifier and dispatcher on RTX 3080
(github.com/resilientworkflowsentinel)
26 points
Shubham_Amb
4 months ago
2 comments
88.
▲
Show HN: KTransformers–236B Model and 1M Context LLM Inference on Local Machines
(github.com/kvcache-ai)
20 points
sssummer
2 years ago
3 comments
89.
▲
Show HN: Demon – open-source real-time music diffusion engine, 25Hz local GPU
(daydreamlive.github.io)
17 points
ryanontheinside
8 days ago
13 comments
90.
▲
Show HN: Finetune Llama-3.1 2x faster in a Colab
(colab.research.google.com)
16 points
danielhanchen
2 years ago
2 comments
More