1.5x Faster Moe Training with Custom MXFP8 Kernels | Heykuki News