Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
841.
▲
Show HN: Nabla – Pure Rust GPU math engine, 7.5× faster matmul than PyTorch
(github.com/fumishiki)
1 point
fumishiki
3 months ago
1 comment
842.
▲
Mimir: GPU Computational Kernels in Rust
(github.com/ccleavinger)
1 point
cjcleav
4 months ago
1 comment
843.
▲
Show HN: Finding stragglers in multi-GPU PyTorch (DDP) training
(github.com/traceopt-ai)
1 point
traceopt-ai
4 months ago
1 comment
844.
▲
Show HN: Turkish Sieve Engine – Reaching 1.1T-Items/S on a Single GPU
(github.com/bilgisofttr)
1 point
bilgisoft
4 months ago
1 comment
845.
▲
I trained a 90-day weather AI on a single GPU using 150 years of data
(github.com/consigcody94)
1 point
sentinelowl
5 months ago
1 comment
846.
▲
SparseFlow – trying 2:4 sparsity on RTX GPUs
(github.com/MapleSilicon)
1 point
maplesilicon
5 months ago
1 comment
847.
▲
NeuroxAI – GPU-Accelerated Neuromorphic Computing Platform
(github.com/TheRemyyy)
1 point
TheRemyyy
5 months ago
1 comment
848.
▲
Whisper-Turbo – Cross-Platform, GPU Accelerated Whisper
(github.com/FL33TW00D)
1 point
montyanderson
6 months ago
1 comment
849.
▲
Axiom-X: A GPU-Accelerated Evolutionary Engine
(github.com/BMV-AI)
1 point
BMV-AI
6 months ago
1 comment
850.
▲
[OPEN-SOURCE] Whisper finetuning, inference, auto GPU upscale, proxy and co
(github.com)
1 point
amarcel
6 months ago
1 comment
851.
▲
Show HN: GPU-Based Kubernetes HPA for Triton Inference Server
(github.com/uzunenes)
1 point
uzunenes
6 months ago
1 comment
852.
▲
Chronos – Fair GPU Time-Sharing (Side Project)
(github.com/Oabraham1)
1 point
oabraham1
8 months ago
1 comment
853.
▲
Show HN: Iris – Distributed GPU Programming with RMA in Pure Python/Triton
(github.com/ROCm)
1 point
mawad
9 months ago
1 comment
854.
▲
ParaAttention: Speed Up Flux and Mochi Inference with Multiple GPUs
(github.com/chengzeyi)
1 point
chengzeyi
2 years ago
1 comment
855.
▲
E2E LLM finetuning on a single 24GB GPU
(github.com/jdecourval)
1 point
jdecourval
2 years ago
1 comment
856.
▲
Python toolkit for image clustering using PCA and K-means with support for GPU
(github.com/cobanov)
1 point
cobanov
2 years ago
1 comment
857.
▲
Develop+Deploy RAG Bots with LlamaEdge: Across OSes, NPUs, GPUs Using Vector DB
(github.com/WasmEdge)
1 point
3Sophons
2 years ago
1 comment
858.
▲
PyTorch to support Apple M1 GPU
(github.com/pytorch)
1 point
jonathanbgn
5 years ago
1 comment
859.
▲
Bc7enc – Fast BC1-7 GPU Texture Encoders with Rate Distortion Optimization (RDO)
(github.com/richgel999)
1 point
tosh
5 years ago
1 comment
860.
▲
Show HN: NLP PyTorch Tutorial (fire up the GPUs)
(github.com/will-thompson-k)
1 point
wilhelm___
5 years ago
1 comment
861.
▲
A basic example of using ManagedCUDA via C# to execute logic on the GPU
(github.com/mgravell)
1 point
jsingleton
10 years ago
1 comment
862.
▲
4k monitor at 50hz. Guide for overclocking displays on GNU/Linux with Intel GPU
(github.com/kevinlekiller)
1 point
empiricus
11 years ago
1 comment
863.
▲
Show HN: Boost.Compute – A C++ GPU Computing Library for OpenCL
(github.com/kylelutz)
1 point
kylelutz
12 years ago
discuss
864.
▲
GPU incrementing an array example (10x faster)
(github.com/marcortiztorres)
1 point
marcortiztorres
12 years ago
discuss
865.
▲
GPU-accelerated natural language parser
(github.com/jcanny)
1 point
mlla
13 years ago
discuss
866.
▲
Harlan: a Language That Simplifies GPU Programming
(github.com/eholk)
1 point
twthewizard
13 years ago
discuss
867.
▲
Harlan for GPU Computing
(github.com/eholk)
1 point
brini
13 years ago
discuss
868.
▲
Gspace.sh - Compress git repos (git gc) recursively
(gist.github.com)
1 point
nvk
13 years ago
discuss
869.
▲
Micro-Expert-Router: Running Mixtral-Class Moe Models on NVMe SSDs Without a GPU
(github.com/randyap8-wq)
1 point
randyap8
13 days ago
discuss
870.
▲
MegaTrain Full Precision Training of 100B+ Parameter LLMs on a Single GPU
(github.com/DLYuanGod)
1 point
adulau
22 days ago
discuss
More