I Reproduced Inverse Reinforcement Alignment Training Method from DeepMind | Heykuki News