Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
571.
Understanding R1-Zero-Like Training: A Critical Perspective (github.com/sail-sg)
160 points
pama
a year ago
21 comments
572.
Forth implemented in Rust trait system (github.com/Ashymad)
157 points
Ashymad
6 years ago
84 comments
573.
Train CIFAR10 to 94% in under 10 seconds on a single A100 (github.com/tysam-code)
151 points
tysam_and
3 years ago
50 comments
574.
Show HN: Streamdal – an open-source tail -f for your data (github.com/streamdal)
148 points
dsies
3 years ago
37 comments
575.
Paper Tape Is All You Need – Training a Transformer on a 1976 Minicomputer (github.com/dbrll)
145 points
rahen
2 months ago
26 comments
576.
Extreme video compression with prediction using pre-trainded diffusion models (github.com/ElesionKyrie)
144 points
john_g
2 years ago
88 comments
577.
01-AI/Yi: A series of large language models trained from scratch (github.com/01-ai)
143 points
simonpure
3 years ago
52 comments
578.
Open-Llama: Complete training pipeline for building large language models (github.com/s-JoL)
141 points
bayes-song
3 years ago
12 comments
579.
Ask HN: Has Anyone Trained a personal LLM using their personal notes?
138 points
Erazal
2 years ago
69 comments
580.
Using OpenAI Gym to train an open-source 3D printed robot (github.com/nicrusso7)
135 points
nicrusso7
6 years ago
26 comments
581.
Diffusion training from scratch on a micro-budget (github.com/SonyResearch)
135 points
lnyan
a year ago
23 comments
582.
YaFSDP: a sharded data parallelism framework, faster for pre-training LLMs (github.com/yandex)
135 points
wiradikusuma
2 years ago
16 comments
583.
Schedule-Free Learning – A New Way to Train (github.com/facebookresearch)
131 points
ironbound
2 years ago
43 comments
584.
TScale – Distributed training on consumer GPUs (github.com/Foreseerr)
130 points
zX41ZdbW
a year ago
27 comments
585.
TensorFlow Code for Google Research's BERT: Pre-Training Method for NLP Tasks (github.com/google-research)
129 points
ArtWomb
8 years ago
13 comments
586.
Show HN: Set of trained deep learning models for computer vision (github.com/fchollet)
127 points
fchollet
10 years ago
15 comments
587.
Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL (github.com/Danau5tin)
125 points
Danau5tin
10 months ago
12 comments
588.
Show HN: Python decorator that enables arbitrarily-deep tail/non-tail recursion (github.com/tylerhou)
119 points
tylerhou
4 years ago
21 comments
589.
Tail-call optimization added to 6to5 compiler (github.com/6to5)
117 points
insertion
11 years ago
31 comments
590.
Training open-source LLMs on ChatGPT output is a really bad idea. (gist.github.com)
114 points
laprise
3 years ago
76 comments
591.
NanoGPT: The simplest, fastest repository for training medium-sized GPTs (github.com/karpathy)
114 points
ulrischa
2 years ago
21 comments
592.
How to Train and Build a Conversational News Chatbot (github.com/tzano)
108 points
tzano
8 years ago
8 comments
593.
List of European train stations and associated metadata (github.com/capitainetrain)
107 points
thibaut_barrere
11 years ago
55 comments
594.
Show HN: Ts-SSH – SSH over Tailscale without running the daemon (github.com/derekg)
103 points
i8code
a year ago
37 comments
595.
Driving dataset for car autopilot AI training (github.com/commaai)
100 points
EvgeniyZh
10 years ago
44 comments
596.
Horovod: Distributed Training Framework for TensorFlow, Keras, and PyTorch (github.com/uber)
100 points
axiomdata316
8 years ago
9 comments
597.
Agent Lightning: Train agents with RL (no code changes needed) (github.com/microsoft)
98 points
bakigul
7 months ago
14 comments
598.
Show HN: Tacopy – Tail Call Optimization for Python (github.com/raaidrt)
95 points
raaid-rt
6 months ago
54 comments
599.
Show HN: less than 650 LOC trainable GPT only using NumPy (github.com/joennlae)
90 points
joennlae
3 years ago
18 comments
600.
Show HN: Wanderer – an open-source trail database (github.com/Flomp)
89 points
get_flomped
2 years ago
13 comments
More