Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
31.
I fixed a segfault in Triton that broke every RTX 5070/5080/5090 (github.com/triton-lang)
1 point
pat90000
3 months ago
discuss
32.
Triton CUDA Tile IR Back End (github.com/triton-lang)
1 point
my123
4 months ago
discuss
33.
Show HN: Autogenerate efficient backward kernels for Triton (github.com/IaroslavElistratov)
1 point
iaroo
7 months ago
discuss
34.
Show HN: Efficient `Torch.cdist` Using Triton (github.com/jinensetpal)
1 point
codeinassembly
a year ago
discuss
35.
Automatic Warp Specialization in Triton (github.com/triton-lang)
1 point
subharmonicon
a year ago
discuss
36.
OpenAI Triton: language and compiler for highly efficient Deep-Learning (github.com/openai)
1 point
tosh
2 years ago
discuss
37.
Triton: Runtime for highly efficient custom Deep-Learning primitives (github.com/openai)
1 point
nateb2022
3 years ago
discuss
38.
Triton Kubernetes, a multi-cloud Kubernetes solution (github.com/joyent)
1 point
merqurio
8 years ago
discuss
39.
Triton-Augment: GPU Kernel Fusion for 5-73x Faster Image/Video Augmentation (github.com/yuhezhang-ai)
3 points
seedlingfl
7 months ago
2 comments
40.
Real-Time Streaming Apps with Nvidia Open Source Triton Inference (github.com/nickaggarwal)
3 points
agcat
2 years ago
discuss
41.
Ask HN: What Inference Server do you use to host TTS Models?
1 point
samagra14
a year ago
discuss
42.
Show HN: Friction – A trilogy of archival fiction told via GitHub Markdown (github.com/andreas-breidenthal)
3 points
a-breidenthal
5 months ago
1 comment
43.
RCE in Nvidia Triton Inference Server (github.com/protectai)
3 points
byt3bl33d3r
2 years ago
discuss
44.
Liger-Kernel: Efficient Triton kernels for LLM training (github.com/linkedin)
15 points
letmehandle
2 years ago
2 comments
45.
Show HN: Attorch – PyTorch's nn module written in Python using OpenAI's Triton (github.com/BobMcDear)
4 points
bornaahz
2 years ago
discuss
46.
Show HN: PreQL/Trilogy – A Higher-Level, Composable SQL (github.com/preqldata)
3 points
efromvt
2 years ago
5 comments
47.
Bounty for Optimized Triton Kernels for full fine tunes (github.com/OpenAccess-AI-Collective)
3 points
bratao
2 years ago
discuss
48.
The pi type trilogy (Rust RFC) (github.com/rust-lang)
3 points
miqkt
9 years ago
discuss
49.
Show HN: We built an LLM inference engine in pure Python – no PyTorch, no Triton (github.com/Zyora-Dev)
2 points
zyoraclub
2 days ago
discuss
50.
Solving an Obfuscated Crackme with BinaryNinja and Triton (github.com/jeffli678)
2 points
ingve
6 years ago
discuss
51.
List of Parallels Between the Original Trilogy and Ep. VII TFA (gist.github.com)
2 points
galori
10 years ago
discuss
52.
Show HN: Iris – Distributed GPU Programming with RMA in Pure Python/Triton (github.com/ROCm)
1 point
mawad
9 months ago
1 comment
53.
PyTorch 2.3: User-Defined Triton Kernels, Tensor Parallelism in Distributed (github.com/pytorch)
1 point
lnyan
2 years ago
discuss
54.
Show HN: Tabby – A self-hosted GitHub Copilot (github.com/TabbyML)
627 points
wsxiaoys
3 years ago
126 comments
55.
Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning (github.com/unslothai)
385 points
danielhanchen
3 years ago
119 comments
56.
Show HN: Metashade – a Pythonic GPU shading/compute EDSL (github.com/ppenenko)
47 points
ppenenko
2 years ago
8 comments
57.
Show HN: Sleuth, open source workspace search in natural language (getsleuth.xyz)
31 points
ayanb9440
3 years ago
8 comments
58.
Show HN: Finetune Llama-3.1 2x faster in a Colab (colab.research.google.com)
16 points
danielhanchen
2 years ago
2 comments
59.
Show HN: Bhumi–OSS Python Library w Rust Underhead for 2.5x Faster LLM Inference (bhumi.trilok.ai)
8 points
rachpradhan
a year ago
discuss
60.
Show HN: Dbg – One CLI debugger for every language (AI-agent ready) (redknightlois.github.io)
7 points
redknight666
2 months ago
discuss
More