Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Show HN: Adaptive-K – Cut MoE inference costs 30-50% with entropy-guided routing (github.com/Gabrobals)
1 point
Gabrielebalsamo
5 months ago
discuss