Deep RL Through Policy Optimization – P. Abbeel, J. Schulman (NIPS 2016 Slides) [pdf] | Heykuki News