Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
31.
▲
I fixed a segfault in Triton that broke every RTX 5070/5080/5090
(github.com/triton-lang)
1 point
pat90000
3 months ago
discuss
32.
▲
Triton CUDA Tile IR Back End
(github.com/triton-lang)
1 point
my123
4 months ago
discuss
33.
▲
Show HN: Autogenerate efficient backward kernels for Triton
(github.com/IaroslavElistratov)
1 point
iaroo
7 months ago
discuss
34.
▲
Show HN: Efficient `Torch.cdist` Using Triton
(github.com/jinensetpal)
1 point
codeinassembly
a year ago
discuss
35.
▲
Automatic Warp Specialization in Triton
(github.com/triton-lang)
1 point
subharmonicon
a year ago
discuss
36.
▲
OpenAI Triton: language and compiler for highly efficient Deep-Learning
(github.com/openai)
1 point
tosh
2 years ago
discuss
37.
▲
Triton: Runtime for highly efficient custom Deep-Learning primitives
(github.com/openai)
1 point
nateb2022
3 years ago
discuss
38.
▲
Triton Kubernetes, a multi-cloud Kubernetes solution
(github.com/joyent)
1 point
merqurio
8 years ago
discuss
39.
▲
Triton-Augment: GPU Kernel Fusion for 5-73x Faster Image/Video Augmentation
(github.com/yuhezhang-ai)
3 points
seedlingfl
7 months ago
2 comments
40.
▲
Real-Time Streaming Apps with Nvidia Open Source Triton Inference
(github.com/nickaggarwal)
3 points
agcat
2 years ago
discuss
41.
▲
Ask HN: What Inference Server do you use to host TTS Models?
1 point
samagra14
a year ago
discuss
42.
▲
Show HN: Friction – A trilogy of archival fiction told via GitHub Markdown
(github.com/andreas-breidenthal)
3 points
a-breidenthal
5 months ago
1 comment
43.
▲
RCE in Nvidia Triton Inference Server
(github.com/protectai)
3 points
byt3bl33d3r
2 years ago
discuss
44.
▲
Liger-Kernel: Efficient Triton kernels for LLM training
(github.com/linkedin)
15 points
letmehandle
2 years ago
2 comments
45.
▲
Show HN: Attorch – PyTorch's nn module written in Python using OpenAI's Triton
(github.com/BobMcDear)
4 points
bornaahz
2 years ago
discuss
46.
▲
Show HN: PreQL/Trilogy – A Higher-Level, Composable SQL
(github.com/preqldata)
3 points
efromvt
2 years ago
5 comments
47.
▲
Bounty for Optimized Triton Kernels for full fine tunes
(github.com/OpenAccess-AI-Collective)
3 points
bratao
2 years ago
discuss
48.
▲
The pi type trilogy (Rust RFC)
(github.com/rust-lang)
3 points
miqkt
9 years ago
discuss
49.
▲
Show HN: We built an LLM inference engine in pure Python – no PyTorch, no Triton
(github.com/Zyora-Dev)
2 points
zyoraclub
2 days ago
discuss
50.
▲
Solving an Obfuscated Crackme with BinaryNinja and Triton
(github.com/jeffli678)
2 points
ingve
6 years ago
discuss
51.
▲
List of Parallels Between the Original Trilogy and Ep. VII TFA
(gist.github.com)
2 points
galori
10 years ago
discuss
52.
▲
Show HN: Iris – Distributed GPU Programming with RMA in Pure Python/Triton
(github.com/ROCm)
1 point
mawad
9 months ago
1 comment
53.
▲
PyTorch 2.3: User-Defined Triton Kernels, Tensor Parallelism in Distributed
(github.com/pytorch)
1 point
lnyan
2 years ago
discuss
54.
▲
Show HN: Tabby – A self-hosted GitHub Copilot
(github.com/TabbyML)
627 points
wsxiaoys
3 years ago
126 comments
55.
▲
Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning
(github.com/unslothai)
385 points
danielhanchen
3 years ago
119 comments
56.
▲
Show HN: Metashade – a Pythonic GPU shading/compute EDSL
(github.com/ppenenko)
47 points
ppenenko
2 years ago
8 comments
57.
▲
Show HN: Sleuth, open source workspace search in natural language
(getsleuth.xyz)
31 points
ayanb9440
3 years ago
8 comments
58.
▲
Show HN: Finetune Llama-3.1 2x faster in a Colab
(colab.research.google.com)
16 points
danielhanchen
2 years ago
2 comments
59.
▲
Show HN: Bhumi–OSS Python Library w Rust Underhead for 2.5x Faster LLM Inference
(bhumi.trilok.ai)
8 points
rachpradhan
a year ago
discuss
60.
▲
Show HN: Dbg – One CLI debugger for every language (AI-agent ready)
(redknightlois.github.io)
7 points
redknight666
2 months ago
discuss
More