PhD Student @ RPI, Writer of Tidbits, and Linux Enthusiast
Chapter 1: An Introduction
Chapter 2: Multi-armed Bandits
Chapter 3: Markov Decision Processes
Chapter 4: Dynamic Programming
Chapter 5: Monte Carlo Methods