Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
181.
Moe-LLaVA: Mixture of Experts for Large Vision-Language Models (github.com/PKU-YuanGroup)
2 points
GaggiX
2 years ago
discuss
182.
Hydra – Model of Experts (github.com/SkunkworksAI)
2 points
tosh
3 years ago
discuss
183.
Pruning GPT-OSS 4.8B to 20B (232 models) (github.com/AmanPriyanshu)
3 points
privacyhateai
10 months ago
1 comment
184.
Rails MVC VS Sproutcore MVC (gmoeck.github.com)
44 points
gmoeck
15 years ago
8 comments
185.
Show HN: A different interface for reading Hacker News (moeffju.github.com)
36 points
moeffju
15 years ago
19 comments
186.
Don't Make Your Code "More Testable" (gmoeck.github.com)
4 points
michaelfairley
14 years ago
discuss
187.
Why you should care about encapsulation (gmoeck.github.com)
1 point
mapleoin
15 years ago
discuss
188.
Löb and möb: strange loops in Haskell (2015) (github.com/quchen)
153 points
hjnkk
3 years ago
60 comments
189.
Löb and Möb: Loops in Haskell (2013) (github.com/quchen)
91 points
fanf2
7 months ago
16 comments
190.
Löb and möb: strange loops in Haskell (2013) (github.com/quchen)
86 points
improv32
8 years ago
10 comments
191.
Löb and möb: strange loops in Haskell (github.com/quchen)
4 points
lelf
13 years ago
discuss
192.
Löb and möb: strange loops in Haskell (github.com/quchen)
4 points
isaac21259
4 years ago
discuss
193.
Löb and möb: strange loops in Haskell (github.com/quchen)
2 points
wz1000
11 years ago
discuss
194.
DeepSeek open source DeepEP – library for MoE training and Inference (github.com/deepseek-ai)
536 points
helloericsf
a year ago
71 comments
195.
Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model (github.com/MoonshotAI)
352 points
ConteMascetti71
a year ago
2 comments
196.
Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model (twitter.com)
348 points
c4pt0r
a year ago
179 comments
197.
LPLB: An early research stage MoE load balancer based on linear programming (github.com/deepseek-ai)
43 points
simonpure
7 months ago
discuss
198.
DeepSeek-VL2: MoE Vision-Language Models for Advanced Multimodal Understanding (github.com/deepseek-ai)
36 points
selvan
a year ago
7 comments
199.
DeepSeek-V2: A Strong, Economical, and Efficient Moe Language Model (github.com/deepseek-ai)
14 points
jasondavies
2 years ago
3 comments
200.
A Library to build MoE from HF models
9 points
zmy999
2 years ago
6 comments
201.
Show HN: Phase Router – capacity-aware routing for MoE (github.com/TSltd)
5 points
TSltd
a month ago
1 comment
202.
LongCat-Flash, a language model with 560B total parameters, MoE architecture (github.com/meituan-longcat)
4 points
jinqueeny
9 months ago
discuss
203.
HuggingFace: Support for the Mixtral Moe (github.com/huggingface)
4 points
tosh
2 years ago
discuss
204.
Slicing an 80B MoE LLM into 40B domain specialists (github.com/JThomas-CoE)
3 points
JThomas-CoE
3 months ago
1 comment
205.
Show HN: A 6.9B Moe LLM in Rust, Go, and Python (github.com/fumi-engineer)
3 points
NightBlossom
5 months ago
1 comment
206.
Shimmy v1.7.0: Running 42B Moe Models on Consumer GPUs with 99.9% VRAM Reduction (github.com/Michael-A-Kuykendall)
3 points
MKuykendall
8 months ago
1 comment
207.
Huawei's Pangu Pro MoE model is likely derived from Qwen model (github.com/HonestAGI)
3 points
delifue
a year ago
1 comment
208.
Show HN: Modernizing my old PhD work in an evening with little Qwen3.6 MoE (github.com/verdverm)
3 points
verdverm
16 days ago
discuss
209.
QMoE Support for Mixtral (github.com/ggerganov)
3 points
tosh
2 years ago
discuss
210.
OpenMoE – A family of open-sourced Mixture-of-Experts (MoE) LLMs (github.com/XueFuzhao)
3 points
tim_sw
3 years ago
discuss
More