A Recipe for Training Large Models using 2nd Order Methods (Distributed Shampoo) | Heykuki News