Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
181.
▲
Moe-LLaVA: Mixture of Experts for Large Vision-Language Models
(github.com/PKU-YuanGroup)
2 points
GaggiX
2 years ago
discuss
182.
▲
Hydra – Model of Experts
(github.com/SkunkworksAI)
2 points
tosh
3 years ago
discuss
183.
▲
Pruning GPT-OSS 4.8B to 20B (232 models)
(github.com/AmanPriyanshu)
3 points
privacyhateai
10 months ago
1 comment
184.
▲
Rails MVC VS Sproutcore MVC
(gmoeck.github.com)
44 points
gmoeck
15 years ago
8 comments
185.
▲
Show HN: A different interface for reading Hacker News
(moeffju.github.com)
36 points
moeffju
15 years ago
19 comments
186.
▲
Don't Make Your Code "More Testable"
(gmoeck.github.com)
4 points
michaelfairley
14 years ago
discuss
187.
▲
Why you should care about encapsulation
(gmoeck.github.com)
1 point
mapleoin
15 years ago
discuss
188.
▲
Löb and möb: strange loops in Haskell (2015)
(github.com/quchen)
153 points
hjnkk
3 years ago
60 comments
189.
▲
Löb and Möb: Loops in Haskell (2013)
(github.com/quchen)
91 points
fanf2
7 months ago
16 comments
190.
▲
Löb and möb: strange loops in Haskell (2013)
(github.com/quchen)
86 points
improv32
8 years ago
10 comments
191.
▲
Löb and möb: strange loops in Haskell
(github.com/quchen)
4 points
lelf
13 years ago
discuss
192.
▲
Löb and möb: strange loops in Haskell
(github.com/quchen)
4 points
isaac21259
4 years ago
discuss
193.
▲
Löb and möb: strange loops in Haskell
(github.com/quchen)
2 points
wz1000
11 years ago
discuss
194.
▲
DeepSeek open source DeepEP – library for MoE training and Inference
(github.com/deepseek-ai)
536 points
helloericsf
a year ago
71 comments
195.
▲
Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model
(github.com/MoonshotAI)
352 points
ConteMascetti71
a year ago
2 comments
196.
▲
Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model
(twitter.com)
348 points
c4pt0r
a year ago
179 comments
197.
▲
LPLB: An early research stage MoE load balancer based on linear programming
(github.com/deepseek-ai)
43 points
simonpure
7 months ago
discuss
198.
▲
DeepSeek-VL2: MoE Vision-Language Models for Advanced Multimodal Understanding
(github.com/deepseek-ai)
36 points
selvan
a year ago
7 comments
199.
▲
DeepSeek-V2: A Strong, Economical, and Efficient Moe Language Model
(github.com/deepseek-ai)
14 points
jasondavies
2 years ago
3 comments
200.
▲
A Library to build MoE from HF models
9 points
zmy999
2 years ago
6 comments
201.
▲
Show HN: Phase Router – capacity-aware routing for MoE
(github.com/TSltd)
5 points
TSltd
a month ago
1 comment
202.
▲
LongCat-Flash, a language model with 560B total parameters, MoE architecture
(github.com/meituan-longcat)
4 points
jinqueeny
9 months ago
discuss
203.
▲
HuggingFace: Support for the Mixtral Moe
(github.com/huggingface)
4 points
tosh
2 years ago
discuss
204.
▲
Slicing an 80B MoE LLM into 40B domain specialists
(github.com/JThomas-CoE)
3 points
JThomas-CoE
3 months ago
1 comment
205.
▲
Show HN: A 6.9B Moe LLM in Rust, Go, and Python
(github.com/fumi-engineer)
3 points
NightBlossom
5 months ago
1 comment
206.
▲
Shimmy v1.7.0: Running 42B Moe Models on Consumer GPUs with 99.9% VRAM Reduction
(github.com/Michael-A-Kuykendall)
3 points
MKuykendall
8 months ago
1 comment
207.
▲
Huawei's Pangu Pro MoE model is likely derived from Qwen model
(github.com/HonestAGI)
3 points
delifue
a year ago
1 comment
208.
▲
Show HN: Modernizing my old PhD work in an evening with little Qwen3.6 MoE
(github.com/verdverm)
3 points
verdverm
16 days ago
discuss
209.
▲
QMoE Support for Mixtral
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
210.
▲
OpenMoE – A family of open-sourced Mixture-of-Experts (MoE) LLMs
(github.com/XueFuzhao)
3 points
tim_sw
3 years ago
discuss
More