Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
601.
▲
Pax: A Jax-based machine learning framework for training large scale models
(github.com/google)
2 points
lnyan
2 years ago
discuss
602.
▲
Dead Simple Web UI for Training Flux LoRA with Low VRAM (12GB/16GB/20GB) Support
(github.com/cocktailpeanut)
2 points
cocktailpeanut
2 years ago
discuss
603.
▲
Distributed Training over the Internet [pdf]
(github.com/NousResearch)
2 points
FergusArgyll
2 years ago
discuss
604.
▲
Preliminary Report on DisTrO (Distributed Training Over-the-Internet) [pdf]
(github.com/NousResearch)
2 points
jasondavies
2 years ago
discuss
605.
▲
Liger Kernel: One line to make LLM Training and20% faster and -60% memory
(github.com/linkedin)
2 points
byhsu
2 years ago
discuss
606.
▲
TraDiffusion:Trajectory-Based Training-Free Image Generation
(github.com/och-mac)
2 points
lnyan
2 years ago
discuss
607.
▲
Show HN: Deep learning framework from scratch, trains GPT-2 in 3 days
(github.com/bclarkson-code)
2 points
netwrt
2 years ago
discuss
608.
▲
Show HN: Supamodel -- monitor ML training sessions on your iPhone
(supamodel.ai)
2 points
sarangzambare
2 years ago
discuss
609.
▲
Show HN: I reproduced Code Llama fill-in-the-middle code completion training
(github.com/BohdanPetryshyn)
2 points
BohdanPetryshyn
2 years ago
discuss
610.
▲
Train an AI voice model in 10 minutes
(github.com/RVC-Project)
2 points
ambigious7777
2 years ago
discuss
611.
▲
Fish-Speech 1.1: Llama-based TTS trained on 150K hours of trilingual speach data
(github.com/fishaudio)
2 points
nicognaw
2 years ago
discuss
612.
▲
Train a Low Cost Robot Arm with <30 Demonstrations
(github.com/Shaka-Labs)
2 points
vateseif
2 years ago
discuss
613.
▲
StarCoder: A language model trained on source code and natural language text
(github.com/bigcode-project)
2 points
tosh
2 years ago
discuss
614.
▲
The Era of 1-Bit LLMs: Training Tips, Code And FAQ [pdf]
(github.com/microsoft)
2 points
histories
2 years ago
discuss
615.
▲
The Era of 1 Bit LLMs – Training, Tips, Code [pdf]
(github.com/microsoft)
2 points
netsec_burn
2 years ago
discuss
616.
▲
Unified Training of Universal Time Series Forecasting Transformers
(github.com/SalesforceAIResearch)
2 points
gorold
2 years ago
discuss
617.
▲
Text Generation with Ted – Trainable Exponential Decay(s)
(github.com/blpj)
2 points
anotherpaulg
2 years ago
discuss
618.
▲
Show HN: Train a GPT to write like Shakespeare-from scratch, in one Python file
(gist.github.com)
2 points
s-casci
2 years ago
discuss
619.
▲
Reference Architecture for ML Training and Batch on GKE with Kueue
(github.com/GoogleCloudPlatform)
2 points
smarterclayton
2 years ago
discuss
620.
▲
Train GPT-V to Mimic your Browser Actions
(github.com/vignshwarar)
2 points
vignesh_warar
2 years ago
discuss
621.
▲
Show HN: Tensorli – NumPy Only Transformer Training (<650 lines)
(github.com/joennlae)
2 points
joennlae
3 years ago
discuss
622.
▲
Pax: A Jax-based machine learning framework for training large scale models
(github.com/google)
2 points
spallas
3 years ago
discuss
623.
▲
Train neural networks up to 7x faster
(github.com/mosaicml)
2 points
udev4096
3 years ago
discuss
624.
▲
Hivemind: Train deep learning models on thousands of volunteers across the world
(github.com/learning-at-home)
2 points
agnosticmantis
3 years ago
discuss
625.
▲
OSS for training, serving, and evaluating LLM based ChatBots
(github.com/lm-sys)
2 points
yujian
3 years ago
discuss
626.
▲
OpenLLaMA to train beyond 1T tokens
(github.com/openlm-research)
2 points
tosh
3 years ago
discuss
627.
▲
Curated list for LLMs: papers, training frameworks, tools to deploy, public APIs
(github.com/Hannibal046)
2 points
alister
3 years ago
discuss
628.
▲
Take a video and replace the face in it with a face of your choice. No training
(github.com/s0md3v)
2 points
draugadrotten
3 years ago
discuss
629.
▲
A PreTrainer's Guide to Training Data [pdf]
(github.com/shayne-longpre)
2 points
tim_sw
3 years ago
discuss
630.
▲
Show HN: ChatGPT based chatbot trained on your website content
(github.com/webwhiz-ai)
2 points
sachinneravath
3 years ago
discuss
More