Econometrica: Jan, 1979, Volume 47, Issue 1
Dynamic Choice Theory and Dynamic Programming
https://doi.org/0012-9682(197901)47:1<91:DCTADP>2.0.CO;2-J
p. 91-100
David M. Kreps, Evan L. Porteus
Finite horizon sequential decision problems with a "temporal von Neumann-Morgenstern utility" criterion are analyzed. This criterion, as developed in [7], is a generalization of von Neumann-Morgenstern (expected) utility of the vector of rewards, wherein an individual's preferences concerning the timing of the resolution of uncertainty are taken into account. The preference theory underlying this criterion is reviewed and then extended in natural fashion to yield preferences for strategies in sequential decision problems. The main result is that value functions for sequential decision problems can be defined by a dynamic programming recursion using the functions which represent the original preferences, and these value functions represent the preferences defined on strategies. This permits citation of standard results from the dynamic programming literature, concerning the existence of (memory less) strategies which are optimal with respect to the given preference relation.