Total Expected Discounted Reward MDPS: Existence of Optimal Policies
Published Online: 15 FEB 2011
Copyright © 2010 John Wiley & Sons, Inc. All rights reserved.
Wiley Encyclopedia of Operations Research and Management Science
How to Cite
Feinberg, E. A. 2011. Total Expected Discounted Reward MDPS: Existence of Optimal Policies. Wiley Encyclopedia of Operations Research and Management Science. .
- Published Online: 15 FEB 2011
This article describes the results on the existence of optimal and nearly optimal policies for Markov Decision Processes (MDPs) with total expected discounted rewards. The problem of optimization of total expected discounted rewards for MDPs is also known under the name of discounted dynamic programming.
- Markov decision process;
- dynamic programming;
- reward function;
- optimal policy;
- discounted rewards