Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
61.
Show HN: Mistral-7B training using pyspark,DeepSpeed (github.com/genji970)
2 points
gituser123
10 months ago
discuss
62.
Sacrificial Training (github.com/jmward01)
2 points
thunderbong
a year ago
discuss
63.
Training LLMs on 1080 Tis without shadow weights (github.com/batteryphil)
1 point
batteryphil
4 months ago
1 comment
64.
Free Ruby AI Training Materials (github.com/thedayisntgray)
1 point
thedayisntgray
a year ago
discuss
65.
A Tutorial on Training Self-Play Agents (github.com/hardmaru)
1 point
hardmaru
6 years ago
discuss
66.
Open-Source LaMDA Model
27 points
EnricoShippole
4 years ago
discuss
67.
Show HN: Fine-tuning an LLM on your code for better code completions (prvn.sh)
4 points
prvnsmpth
a year ago
discuss
68.
Nebulgym, a new open-source that accelerates AI training (~1.5-2x)
3 points
emilec___
4 years ago
1 comment
69.
Show HN: TorchSubmit – Painless multi-node training with PyTorch (no SLURM/K8s) (github.com/dream3d-ai)
3 points
tony_francis
2 years ago
discuss
70.
Security of BIOS/UEFI System Firmware from Attacker and Defender Perspectives (github.com/advanced-threat-research)
57 points
adulau
9 years ago
3 comments
71.
I have trained StyleGAN2 from scratch with a dataset of female portraits (github.com/l4rz)
20 points
EvgeniyZh
5 years ago
20 comments
72.
Building a Simple (Android) User Interface (using JRuby / Ruboto) (github.com/KCErb)
2 points
MrBra
12 years ago
discuss
73.
ml-engineering/training/performance (github.com/stas00)
2 points
lordswork
10 months ago
discuss
74.
Radare2 from a to Z (extended edition) [reverse engineering] (github.com/radareorg)
2 points
j_s
10 years ago
discuss
75.
MNIST Training in C# – Deep Learning (github.com/deepakkumar1984)
1 point
siadroid
9 years ago
discuss
76.
Show HN: Syna – Minimal ML and RL Framework Built from Scratch with NumPy (github.com/sql-hkr)
8 points
sql-hkr
8 months ago
discuss
77.
How Are My Hyperparameters Affecting My Training Time? (github.com/sigopt)
2 points
alexcmu
10 years ago
discuss
78.
Hands-on workshops and training sessions at Universe (github.com/blog)
1 point
dwaxe
10 years ago
discuss
79.
Llm.c – LLM training in simple, pure C/CUDA (github.com/karpathy)
1050 points
tosh
2 years ago
168 comments
80.
DeepSeek open source DeepEP – library for MoE training and Inference (github.com/deepseek-ai)
536 points
helloericsf
a year ago
71 comments
81.
CoreNet: A library for training deep neural networks (github.com/apple)
494 points
rocauc
2 years ago
131 comments
82.
SmolGPT: A minimal PyTorch implementation for training a small LLM from scratch (github.com/Om-Alve)
434 points
amrrs
a year ago
55 comments
83.
Show HN: Every Breath You Take – Heart Rate Variability Training (github.com/kbre93)
348 points
kbre93
3 years ago
118 comments
84.
Databricks Releases 15K Record Training Corpus for Instruction Tuning LLMs (github.com/databrickslabs)
347 points
xatalytic
3 years ago
89 comments
85.
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training (github.com/alainnothere)
265 points
xlayn
3 months ago
80 comments
86.
Full LLM training and evaluation toolkit (github.com/huggingface)
249 points
testerui
2 years ago
6 comments
87.
DeepSpeed Chat: Easy, fast and affordable RLHF training of ChatGPT-like models (github.com/microsoft)
240 points
quantisan
3 years ago
55 comments
88.
LLMs can see and hear without any training (github.com/facebookresearch)
210 points
T-A
a year ago
66 comments
89.
Autoresearch: Agents researching on single-GPU nanochat training automatically (github.com/karpathy)
208 points
simonpure
3 months ago
58 comments
90.
Show HN: A Python tool for text-based AI training and generation using GPT-2 (github.com/minimaxir)
174 points
minimaxir
6 years ago
41 comments
More