Description
In this course, you will :
- Formalize problems as Markov Decision Processes.
 - Understand basic exploration methods and the exploration / exploitation tradeoff.
 - Understand value functions, as a general-purpose tool for optimal decision-making.
 - Know how to implement dynamic programming as an efficient solution approach to an industrial control problem.
 








