Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
331.
▲
Microsoft releases TRELLIS dataset: 500k 3D assets for model generation training
(github.com/microsoft)
2 points
summarity
a year ago
discuss
332.
▲
Autotrain: Sota model training, open source, no code
(github.com/huggingface)
2 points
abhi1thakur
2 years ago
discuss
333.
▲
Apple ML – Reduce LLM training memory without speed loss
(github.com/apple)
2 points
taikon
2 years ago
discuss
334.
▲
Show HN: Polaris – Training On-Device AI Agents with Real User Data
(github.com/cyrilzakka)
2 points
archiv
2 years ago
discuss
335.
▲
Pax: A Jax-based machine learning framework for training large scale models
(github.com/google)
2 points
lnyan
2 years ago
discuss
336.
▲
Dead Simple Web UI for Training Flux LoRA with Low VRAM (12GB/16GB/20GB) Support
(github.com/cocktailpeanut)
2 points
cocktailpeanut
2 years ago
discuss
337.
▲
Distributed Training over the Internet [pdf]
(github.com/NousResearch)
2 points
FergusArgyll
2 years ago
discuss
338.
▲
Preliminary Report on DisTrO (Distributed Training Over-the-Internet) [pdf]
(github.com/NousResearch)
2 points
jasondavies
2 years ago
discuss
339.
▲
Liger Kernel: One line to make LLM Training and20% faster and -60% memory
(github.com/linkedin)
2 points
byhsu
2 years ago
discuss
340.
▲
TraDiffusion:Trajectory-Based Training-Free Image Generation
(github.com/och-mac)
2 points
lnyan
2 years ago
discuss
341.
▲
Show HN: Supamodel -- monitor ML training sessions on your iPhone
(supamodel.ai)
2 points
sarangzambare
2 years ago
discuss
342.
▲
Show HN: I reproduced Code Llama fill-in-the-middle code completion training
(github.com/BohdanPetryshyn)
2 points
BohdanPetryshyn
2 years ago
discuss
343.
▲
The Era of 1-Bit LLMs: Training Tips, Code And FAQ [pdf]
(github.com/microsoft)
2 points
histories
2 years ago
discuss
344.
▲
The Era of 1 Bit LLMs – Training, Tips, Code [pdf]
(github.com/microsoft)
2 points
netsec_burn
2 years ago
discuss
345.
▲
Unified Training of Universal Time Series Forecasting Transformers
(github.com/SalesforceAIResearch)
2 points
gorold
2 years ago
discuss
346.
▲
Reference Architecture for ML Training and Batch on GKE with Kueue
(github.com/GoogleCloudPlatform)
2 points
smarterclayton
2 years ago
discuss
347.
▲
Show HN: Tensorli – NumPy Only Transformer Training (<650 lines)
(github.com/joennlae)
2 points
joennlae
3 years ago
discuss
348.
▲
Pax: A Jax-based machine learning framework for training large scale models
(github.com/google)
2 points
spallas
3 years ago
discuss
349.
▲
OSS for training, serving, and evaluating LLM based ChatBots
(github.com/lm-sys)
2 points
yujian
3 years ago
discuss
350.
▲
Curated list for LLMs: papers, training frameworks, tools to deploy, public APIs
(github.com/Hannibal046)
2 points
alister
3 years ago
discuss
351.
▲
Take a video and replace the face in it with a face of your choice. No training
(github.com/s0md3v)
2 points
draugadrotten
3 years ago
discuss
352.
▲
A PreTrainer's Guide to Training Data [pdf]
(github.com/shayne-longpre)
2 points
tim_sw
3 years ago
discuss
353.
▲
Why we need Reinforcement Learning for Language Model training
(gist.github.com)
2 points
yamrzou
3 years ago
discuss
354.
▲
Blip-2: harvesting development of pretrained vision models for LLM training
(github.com/salesforce)
2 points
anigbrowl
3 years ago
discuss
355.
▲
The simplest, fastest repository for training and fine-tuning medium-sized GPTs
(github.com/karpathy)
2 points
Terretta
3 years ago
discuss
356.
▲
KataGo changes training framework from TensorFlow to PyTorch
(github.com/lightvector)
2 points
gslin
3 years ago
discuss
357.
▲
Show HN: Slack bot to monitor/stop/restart ML model training remotely
(github.com/rahuldan)
2 points
rahuldan
4 years ago
discuss
358.
▲
Colossal-AI: A Unified Deep Learning System for Large-Scale Training
(github.com/hpcaitech)
2 points
forrest_blue
4 years ago
discuss
359.
▲
Lessons learned from training 104B model
(github.com/bigscience-workshop)
2 points
EvgeniyZh
4 years ago
discuss
360.
▲
Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training
(github.com/hpcaitech)
2 points
forrest_blue
4 years ago
discuss
More