DescriptionApply gradient-based supervised machine learning methods to reinforcement learning. Understand the relationship between reinforcement learning and psychology. Implement 17 different reinforcement learning algorithms. Understand reinforcement learning on a technical level.
What’s covered in this course?
The multi-armed bandit problem and the explore-exploit dilemma
Ways to calculate means and moving averages and their relationship to stochastic gradient descent
Markov Decision Processes (MDPs)
Temporal Difference (TD) Learning
Approximation Methods (i.e. how to plug in a deep neural network or other differentiable model into your RL algorithm)