Co-Training Transformer with Videos and Images Improves Action Recognition | Heykuki News