Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
601.
Pax: A Jax-based machine learning framework for training large scale models (github.com/google)
2 points
lnyan
2 years ago
discuss
602.
Dead Simple Web UI for Training Flux LoRA with Low VRAM (12GB/16GB/20GB) Support (github.com/cocktailpeanut)
2 points
cocktailpeanut
2 years ago
discuss
603.
Distributed Training over the Internet [pdf] (github.com/NousResearch)
2 points
FergusArgyll
2 years ago
discuss
604.
Preliminary Report on DisTrO (Distributed Training Over-the-Internet) [pdf] (github.com/NousResearch)
2 points
jasondavies
2 years ago
discuss
605.
Liger Kernel: One line to make LLM Training and20% faster and -60% memory (github.com/linkedin)
2 points
byhsu
2 years ago
discuss
606.
TraDiffusion:Trajectory-Based Training-Free Image Generation (github.com/och-mac)
2 points
lnyan
2 years ago
discuss
607.
Show HN: Deep learning framework from scratch, trains GPT-2 in 3 days (github.com/bclarkson-code)
2 points
netwrt
2 years ago
discuss
608.
Show HN: Supamodel -- monitor ML training sessions on your iPhone (supamodel.ai)
2 points
sarangzambare
2 years ago
discuss
609.
Show HN: I reproduced Code Llama fill-in-the-middle code completion training (github.com/BohdanPetryshyn)
2 points
BohdanPetryshyn
2 years ago
discuss
610.
Train an AI voice model in 10 minutes (github.com/RVC-Project)
2 points
ambigious7777
2 years ago
discuss
611.
Fish-Speech 1.1: Llama-based TTS trained on 150K hours of trilingual speach data (github.com/fishaudio)
2 points
nicognaw
2 years ago
discuss
612.
Train a Low Cost Robot Arm with <30 Demonstrations (github.com/Shaka-Labs)
2 points
vateseif
2 years ago
discuss
613.
StarCoder: A language model trained on source code and natural language text (github.com/bigcode-project)
2 points
tosh
2 years ago
discuss
614.
The Era of 1-Bit LLMs: Training Tips, Code And FAQ [pdf] (github.com/microsoft)
2 points
histories
2 years ago
discuss
615.
The Era of 1 Bit LLMs – Training, Tips, Code [pdf] (github.com/microsoft)
2 points
netsec_burn
2 years ago
discuss
616.
Unified Training of Universal Time Series Forecasting Transformers (github.com/SalesforceAIResearch)
2 points
gorold
2 years ago
discuss
617.
Text Generation with Ted – Trainable Exponential Decay(s) (github.com/blpj)
2 points
anotherpaulg
2 years ago
discuss
618.
Show HN: Train a GPT to write like Shakespeare-from scratch, in one Python file (gist.github.com)
2 points
s-casci
2 years ago
discuss
619.
Reference Architecture for ML Training and Batch on GKE with Kueue (github.com/GoogleCloudPlatform)
2 points
smarterclayton
2 years ago
discuss
620.
Train GPT-V to Mimic your Browser Actions (github.com/vignshwarar)
2 points
vignesh_warar
2 years ago
discuss
621.
Show HN: Tensorli – NumPy Only Transformer Training (<650 lines) (github.com/joennlae)
2 points
joennlae
3 years ago
discuss
622.
Pax: A Jax-based machine learning framework for training large scale models (github.com/google)
2 points
spallas
3 years ago
discuss
623.
Train neural networks up to 7x faster (github.com/mosaicml)
2 points
udev4096
3 years ago
discuss
624.
Hivemind: Train deep learning models on thousands of volunteers across the world (github.com/learning-at-home)
2 points
agnosticmantis
3 years ago
discuss
625.
OSS for training, serving, and evaluating LLM based ChatBots (github.com/lm-sys)
2 points
yujian
3 years ago
discuss
626.
OpenLLaMA to train beyond 1T tokens (github.com/openlm-research)
2 points
tosh
3 years ago
discuss
627.
Curated list for LLMs: papers, training frameworks, tools to deploy, public APIs (github.com/Hannibal046)
2 points
alister
3 years ago
discuss
628.
Take a video and replace the face in it with a face of your choice. No training (github.com/s0md3v)
2 points
draugadrotten
3 years ago
discuss
629.
A PreTrainer's Guide to Training Data [pdf] (github.com/shayne-longpre)
2 points
tim_sw
3 years ago
discuss
630.
Show HN: ChatGPT based chatbot trained on your website content (github.com/webwhiz-ai)
2 points
sachinneravath
3 years ago
discuss
More