Training 3x larger model on the same GPU cards | Heykuki News