Description
In this course, you will :
- Formalize problems as Markov Decision Processes.
- Understand basic exploration methods and the exploration / exploitation tradeoff.
- Understand value functions, as a general-purpose tool for optimal decision-making.
- Know how to implement dynamic programming as an efficient solution approach to an industrial control problem.