Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
151.
▲
Implementing ray serve multi GPUs with autoscaling for stable-diffusion-webui
(github.com/AUTOMATIC1111)
1 point
kelsey9876543
3 years ago
discuss
152.
▲
Nvflashk – Flash any BIOS to Nvidia GPUs – Safe board ID bypass up to 4xx series
(github.com/notfromstatefarm)
1 point
hardenedvault
3 years ago
discuss
153.
▲
Fast and Memory efficient lattice Boltzmann CFD software, running on all GPUs
(github.com/ProjectPhysX)
1 point
cbracketdash
3 years ago
discuss
154.
▲
MiniLLM is a minimal system for running modern LLMs on consumer-grade GPUs
(github.com/kuleshov)
1 point
tosh
3 years ago
discuss
155.
▲
TRTorch: PyTorch/TorchScript Compiler for Nvidia GPUs Using TensorRT
(github.com/NVIDIA)
1 point
lnyan
5 years ago
discuss
156.
▲
CUDA on Intel GPUs in Rust
(github.com/vosen)
1 point
gyre007
6 years ago
discuss
157.
▲
Purge-nvda: Optimize external graphics for Macs with discrete Nvidia GPUs
(github.com/mayankk2308)
1 point
lnyan
6 years ago
discuss
158.
▲
Build and Run Docker Containers Leveraging Nvidia GPUs
(github.com/NVIDIA)
1 point
thushanfernando
7 years ago
discuss
159.
▲
Gpu_monitor: Monitor your GPUs whether they are on a computer or in a cluster
(github.com/msalvaris)
1 point
sytelus
8 years ago
discuss
160.
▲
Set-eGPU: allow use of external GPUs in macOS, even on internal displays
(github.com/mayankk2308)
1 point
ingve
8 years ago
discuss
161.
▲
Monitor your GPUs whether they are on a single computer or in a cluster
(github.com/msalvaris)
1 point
jonbaer
8 years ago
discuss
162.
▲
Build and run Docker containers leveraging Nvidia GPUs (experimental)
(github.com/NVIDIA)
1 point
885895
11 years ago
discuss
163.
▲
Show HN: Kitten TTS – 25MB CPU-Only, Open-Source TTS Model
(github.com/KittenML)
1003 points
divamgupta
10 months ago
361 comments
164.
▲
Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU
(github.com/xaskasdf)
395 points
xaskasdf
4 months ago
101 comments
165.
▲
Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning
(github.com/unslothai)
385 points
danielhanchen
3 years ago
119 comments
166.
▲
Show HN: Generative Fill with AI and 3D
(github.com/fill3d)
360 points
olokobayusuf
3 years ago
102 comments
167.
▲
Launch HN: Silurian (YC S24) – Simulate the Earth
338 points
rejuvyesh
2 years ago
141 comments
168.
▲
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training
(github.com/alainnothere)
265 points
xlayn
3 months ago
80 comments
169.
▲
Launch HN: Hatchet (YC W24) – Open-source task queue, now with a cloud version
245 points
abelanger
2 years ago
95 comments
170.
▲
Less Slow C++
(github.com/ashvardanian)
198 points
ashvardanian
a year ago
97 comments
171.
▲
Launch HN: Deepsilicon (YC S24) – Software and hardware for ternary transformers
189 points
areddyyt
2 years ago
79 comments
172.
▲
Show HN: Tune LLaMa3.1 on Google Cloud TPUs
(github.com/felafax)
189 points
felarof
2 years ago
52 comments
173.
▲
Ollama for Linux – Run LLMs on Linux with GPU Acceleration
(github.com/jmorganca)
173 points
jmorgan
3 years ago
54 comments
174.
▲
Show HN: TensorDock Core GPU Cloud – GPU servers from $0.29/hr
(tensordock.com)
147 points
jonathanlei
4 years ago
52 comments
175.
▲
Launch HN: ParaQuery (YC X25) – GPU Accelerated Spark/SQL
135 points
winwang
a year ago
81 comments
176.
▲
Show HN: ART – a new open-source RL framework for training agents
(github.com/OpenPipe)
116 points
kcorbitt
a year ago
12 comments
177.
▲
Show HN: Open-source real time data framework for LLM applications
(getindexify.ai)
92 points
diptanu
2 years ago
6 comments
178.
▲
Tell HN: GpuOwl/PRPLL, GPU software used to find the largest prime number
72 points
mpreda
2 years ago
43 comments
179.
▲
Show HN: A personalised AI tutor with < 1s voice responses
(educationbot.cerebrium.ai)
72 points
za_mike157
2 years ago
24 comments
180.
▲
Show HN: A GPU-accelerated binary vector index
(rlafuente.com)
65 points
andes314
a year ago
7 comments
More