Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
61.
▲
Show HN: Mistral-7B training using pyspark,DeepSpeed
(github.com/genji970)
2 points
gituser123
10 months ago
discuss
62.
▲
Sacrificial Training
(github.com/jmward01)
2 points
thunderbong
a year ago
discuss
63.
▲
Training LLMs on 1080 Tis without shadow weights
(github.com/batteryphil)
1 point
batteryphil
4 months ago
1 comment
64.
▲
Free Ruby AI Training Materials
(github.com/thedayisntgray)
1 point
thedayisntgray
a year ago
discuss
65.
▲
A Tutorial on Training Self-Play Agents
(github.com/hardmaru)
1 point
hardmaru
6 years ago
discuss
66.
▲
Open-Source LaMDA Model
27 points
EnricoShippole
4 years ago
discuss
67.
▲
Show HN: Fine-tuning an LLM on your code for better code completions
(prvn.sh)
4 points
prvnsmpth
a year ago
discuss
68.
▲
Nebulgym, a new open-source that accelerates AI training (~1.5-2x)
3 points
emilec___
4 years ago
1 comment
69.
▲
Show HN: TorchSubmit – Painless multi-node training with PyTorch (no SLURM/K8s)
(github.com/dream3d-ai)
3 points
tony_francis
2 years ago
discuss
70.
▲
Security of BIOS/UEFI System Firmware from Attacker and Defender Perspectives
(github.com/advanced-threat-research)
57 points
adulau
9 years ago
3 comments
71.
▲
I have trained StyleGAN2 from scratch with a dataset of female portraits
(github.com/l4rz)
20 points
EvgeniyZh
5 years ago
20 comments
72.
▲
Building a Simple (Android) User Interface (using JRuby / Ruboto)
(github.com/KCErb)
2 points
MrBra
12 years ago
discuss
73.
▲
ml-engineering/training/performance
(github.com/stas00)
2 points
lordswork
10 months ago
discuss
74.
▲
Radare2 from a to Z (extended edition) [reverse engineering]
(github.com/radareorg)
2 points
j_s
10 years ago
discuss
75.
▲
MNIST Training in C# – Deep Learning
(github.com/deepakkumar1984)
1 point
siadroid
9 years ago
discuss
76.
▲
Show HN: Syna – Minimal ML and RL Framework Built from Scratch with NumPy
(github.com/sql-hkr)
8 points
sql-hkr
8 months ago
discuss
77.
▲
How Are My Hyperparameters Affecting My Training Time?
(github.com/sigopt)
2 points
alexcmu
10 years ago
discuss
78.
▲
Hands-on workshops and training sessions at Universe
(github.com/blog)
1 point
dwaxe
10 years ago
discuss
79.
▲
Llm.c – LLM training in simple, pure C/CUDA
(github.com/karpathy)
1050 points
tosh
2 years ago
168 comments
80.
▲
DeepSeek open source DeepEP – library for MoE training and Inference
(github.com/deepseek-ai)
536 points
helloericsf
a year ago
71 comments
81.
▲
CoreNet: A library for training deep neural networks
(github.com/apple)
494 points
rocauc
2 years ago
131 comments
82.
▲
SmolGPT: A minimal PyTorch implementation for training a small LLM from scratch
(github.com/Om-Alve)
434 points
amrrs
a year ago
55 comments
83.
▲
Show HN: Every Breath You Take – Heart Rate Variability Training
(github.com/kbre93)
348 points
kbre93
3 years ago
118 comments
84.
▲
Databricks Releases 15K Record Training Corpus for Instruction Tuning LLMs
(github.com/databrickslabs)
347 points
xatalytic
3 years ago
89 comments
85.
▲
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training
(github.com/alainnothere)
265 points
xlayn
3 months ago
80 comments
86.
▲
Full LLM training and evaluation toolkit
(github.com/huggingface)
249 points
testerui
2 years ago
6 comments
87.
▲
DeepSpeed Chat: Easy, fast and affordable RLHF training of ChatGPT-like models
(github.com/microsoft)
240 points
quantisan
3 years ago
55 comments
88.
▲
LLMs can see and hear without any training
(github.com/facebookresearch)
210 points
T-A
a year ago
66 comments
89.
▲
Autoresearch: Agents researching on single-GPU nanochat training automatically
(github.com/karpathy)
208 points
simonpure
3 months ago
58 comments
90.
▲
Show HN: A Python tool for text-based AI training and generation using GPT-2
(github.com/minimaxir)
174 points
minimaxir
6 years ago
41 comments
More