Skip to main content

SIAG/OPT Prize Lecture: Efficiency of the Simplex and Policy Iteration Methods for Markov Decision Processes