Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
751.
▲
ARCHE3-7B – Sparse Moe with SmartRouter and Foundation Curriculum Training
1 point
OpenSynapseLabs
2 months ago
discuss
752.
▲
Grove: Distributed ML Training Across MacBooks
(github.com/swarnim-j)
1 point
hasheddan
2 months ago
discuss
753.
▲
Show HN: Sigil – A zero-knowledge steganography vault for AI training data
1 point
nishalk
2 months ago
discuss
754.
▲
Rust Training
(github.com/microsoft)
1 point
dcuthbertson
2 months ago
discuss
755.
▲
Train the smallest LM you can that fits in 16MB. Best model wins
(github.com/openai)
1 point
bilsbie
2 months ago
discuss
756.
▲
Show HN: Mamba SSM in Rust – training and inference with custom CUDA kernels
(github.com/silvermpx)
1 point
silvermpx
3 months ago
discuss
757.
▲
Native Pytorch distributed training backend for Apple Silicon
(github.com/mps-ddp)
1 point
sassoshots44
3 months ago
discuss
758.
▲
Physics-based validation for sensor data before ML training
(github.com/timbo4u1)
1 point
s2sphysical
3 months ago
discuss
759.
▲
Show HN: ResonanceNet – Proof-of-Training Blockchain
(github.com/Kristian5013)
1 point
kristianXXI
3 months ago
discuss
760.
▲
Ask HN: Should training bottleneck detection be a product or just a feature?
1 point
traceopt-ai
3 months ago
discuss
761.
▲
Show HN: Quantum-PULSE – compress-then-encrypt vault for LLM training data
(github.com/Naveenub)
1 point
naveenub
3 months ago
discuss
762.
▲
I trained an LLM from loss 11.47 to loss 2.35 on one TPU v5e for $1.16
(github.com/2001sameersharma)
1 point
twodollarllm
3 months ago
discuss
763.
▲
Show HN: I trained a small local model to translate natural language to CLI
(github.com/spicy-lemonade)
1 point
kiki_kuuki
3 months ago
discuss
764.
▲
Show HN: easy-torch-tpu – A Flexible Training Pipeline for PyTorch Models on TPU
(github.com/aklein4)
1 point
in-silico
3 months ago
discuss
765.
▲
SteptronOss: Lightweight, AI-native training framework for large language models
(github.com/stepfun-ai)
1 point
limoce
3 months ago
discuss
766.
▲
Show HN: Synthesize complex agent training data with just a few lines of code
(github.com/OpenDCAI)
1 point
Junnn
3 months ago
discuss
767.
▲
Show HN: Train a 230KB text classifier from 50 examples – no API keys, no GPU
(github.com/expressibleai)
1 point
veniyer
3 months ago
discuss
768.
▲
Show HN: MicroGPT-C – C99 GPT for Edge Training and Tiny Model Pipelines
(github.com/enjector)
1 point
Ajay__soni
4 months ago
discuss
769.
▲
Peon Training feature piggybacks on AI coding session
(github.com/PeonPing)
1 point
mthwsjc_
4 months ago
discuss
770.
▲
MicroGPT: train & inference in 243 lines of code
(gist.github.com)
1 point
RyanShook
4 months ago
discuss
771.
▲
MicroGPT - Train and inference a GPT in pure, dependency-free Python (200 lines)
(gist.github.com)
1 point
susam
4 months ago
discuss
772.
▲
The Yellow Wallpaper Problem: RLHF Safety Training as Ontology Enforcement
(github.com/Palmerschallon)
1 point
palmerschallon
4 months ago
discuss
773.
▲
Theseus - Train like a foundation lab
(github.com/Jemoka)
1 point
shetaye
4 months ago
discuss
774.
▲
Strangerbench: A benchmark for AI forecasting after training cut-off dates
(github.com/firasd)
1 point
firasd
4 months ago
discuss
775.
▲
Train AI models in 3 clicks
(github.com/belocci)
1 point
belocci
4 months ago
discuss
776.
▲
Show HN: WLM – A 70B model trained to decode "I'm fine" with 94.7% accuracy
(github.com/gabewillen)
1 point
gwillen85
4 months ago
discuss
777.
▲
Kauldron: Modular, scalable library to train ML models
(github.com/google-research)
1 point
lairv
4 months ago
discuss
778.
▲
Zero Training One-Shot Neural Networks
(github.com/117l11)
1 point
117l11
5 months ago
discuss
779.
▲
Show HN: Train Core ML models from the command line
(github.com/schappim)
1 point
schappim
5 months ago
discuss
780.
▲
Contrakit: Predicting Model Hallucination Before Training
(github.com/off-by-some)
1 point
off-by-some
5 months ago
discuss
More