Search: github.com/ftrain | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

151.

ml-engineering/training/performance (github.com/stas00)

2 points

10 months ago

152.

Lessons learned from training 104B model (github.com/bigscience-workshop)

2 points

4 years ago

153.

Radare2 from a to Z (extended edition) [reverse engineering] (github.com/radareorg)

2 points

10 years ago

154.

MNIST Training in C# – Deep Learning (github.com/deepakkumar1984)

1 point

9 years ago

155.

Launch HN: Deepsilicon (YC S24) – Software and hardware for ternary transformers

189 points

2 years ago

156.

Show HN: TabPFN v2 – A SOTA foundation model for small tabular data (nature.com)

153 points

a year ago

157.

Show HN: ART – a new open-source RL framework for training agents (github.com/OpenPipe)

116 points

a year ago

158.

IG65M-PyTorch: video models pre-trained on over 65M Instagram videos

10 points

7 years ago

159.

Show HN: Syna – Minimal ML and RL Framework Built from Scratch with NumPy (github.com/sql-hkr)

8 points

8 months ago

160.

Ask HN: AI Voice Reverse

3 points

2 years ago

161.

Show HN: RETVec: Resilient and Efficient Text Vectorizer

3 points

3 years ago

162.

Transformer from scratch HTTPS://github.com/Eamon2009/Transformer-language-model

2 points

2 months ago

163.

Full forward pass of GPT-2 in one file of pure CUDA (github.com/karpathy)

63 points

2 years ago

164.

Show HN: 6DoF Object detection and tracking in web browser – WebAR.rocks.train (github.com/WebAR-rocks)

12 points

a year ago

165.

Show HN: MXNet Implementation of Quantization Aware Training (github.com/Ldpe2G)

2 points

7 years ago

166.

How Are My Hyperparameters Affecting My Training Time? (github.com/sigopt)

2 points

10 years ago

167.

Hands-on workshops and training sessions at Universe (github.com/blog)

1 point

10 years ago

168.

Llm.c – LLM training in simple, pure C/CUDA (github.com/karpathy)

1050 points

2 years ago

169.

History LLMs: Models trained exclusively on pre-1913 texts (github.com/DGoettlich)

897 points

6 months ago

170.

If you don't opt out by Apr 24 GitHub will train on your private repos

745 points

2 months ago

171.

TimeCapsuleLLM: LLM trained only on data from 1800-1875 (github.com/haykgrigo3)

737 points

5 months ago

172.

Gpt4all: A chatbot trained on ~800k GPT-3.5-Turbo Generations based on LLaMa (github.com/nomic-ai)

593 points

3 years ago

173.

DeepSeek open source DeepEP – library for MoE training and Inference (github.com/deepseek-ai)

536 points

a year ago

174.

CoreNet: A library for training deep neural networks (github.com/apple)

494 points

2 years ago

175.

Train Your Own LLM from Scratch (github.com/angelos-p)

478 points

a month ago

176.

SmolGPT: A minimal PyTorch implementation for training a small LLM from scratch (github.com/Om-Alve)

434 points

a year ago

177.

Show HN: Every Breath You Take – Heart Rate Variability Training (github.com/kbre93)

348 points

3 years ago

178.

Databricks Releases 15K Record Training Corpus for Instruction Tuning LLMs (github.com/databrickslabs)

347 points

3 years ago

179.

Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training (github.com/alainnothere)

265 points

3 months ago

180.

Full LLM training and evaluation toolkit (github.com/huggingface)

249 points

2 years ago