Markov decision processes : discrete stochastic dynamic programming

書誌事項

Markov decision processes : discrete stochastic dynamic programming

Martin L. Puterman

(Wiley series in probability and mathematical statistics, . Applied probability and statistics)

Wiley, c1994

大学図書館所蔵 件 / 65

この図書・雑誌をさがす

注記

"A Wiley-Interscience publication"

Bibliography: p. 613-642

Includes index

内容説明・目次

内容説明

A Markov chain is a sequence of events where the probability of each event is dependent on the event immediately preceding it, but independent of earlier events. Models and numerical equations are used to describe the patterns. This process is particularly useful in operations research and decision science for plotting the sequence of actions which will cause a system to perform optimally. This study provides a unified treatment of the theory, applications and computational methods for Markov decision processes. Important topics featured include action elimination methods, value iteration in the average reward case and sensitive discount optimality.

目次

Model Formulation. Examples. Finite--Horizon Markov Decision Processes. Infinite--Horizon Models: Foundations. Discounted Markov Decision Problems. The Expected Total--Reward Criterion. Average Reward and Related Criteria. The Average Reward Criterion--Multichain and Communicating Models. Sensitive Discount Optimality. Continuous--Time Models. Afterword. Notation. Appendices. Bibliography. Index.

「Nielsen BookData」 より

関連文献: 1件中  1-1を表示

詳細情報

ページトップへ