Scaling pretraining affects RL sample efficiency | Heykuki News