Deep Reinforcement Learning
Module 2

Policy Gradient Algorithms

VPG, TRPO, and PPO — the on-policy family of algorithms that directly optimize the policy objective.

3 readings 1 quiz 1 lab
Readings
Quizzes
Labs