PhD Student @ RPI studying Automated Reasoning in AI and Linux Enthusiast.
Chapter 1: An Introduction
Chapter 2: Multi-armed Bandits
Chapter 3: Markov Decision Processes
Chapter 4: Dynamic Programming
Chapter 5: Monte Carlo Methods