Sampling Policy that Improves Performance of Policy in Reinforcement Learning

SENDA Kei, OTSUBO RitsuSamuel

doi:10.9746/sicetr.54.365

Bibliographic Information

Other Title

強化学習における方策の性能を向上するサンプリング方策
キョウカガクシュウニオケルホウサクノセイノウオコウジョウスルサンプリングホウサク

Search this article

Abstract

<p>When applying a reinforcement learning method, the estimation accuracy of the state transition probabilities affects the performance of the policy obtained from the estimated plant. Therefore, we find a sampling condition guaranteeing that the optimal policy from the estimated plant is also optimal for the real plant with the desired degree of reliability, and a sampling methods based on it is proposed. Not by the sampling for the reliability in which the policy is optimal for the real plant, but by the sampling for the policy to be effective irrespective of estimation errors, we can further reduce the number of samples. We show the problem setting for finding the policy which is guaranteed to be effective for estimation errors from the real transition probabilities with the desired degree of reliability, and we propose a sampling method as a solution of this problem. The effectiveness of the proposed method is verified by numerical simulations.</p>

Journal

Transactions of the Society of Instrument and Control Engineers

Transactions of the Society of Instrument and Control Engineers 54 (3), 365-372, 2018

The Society of Instrument and Control Engineers

Keywords

Details 詳細情報について

CRID: 1390282679486391808

NII Article ID: 130006512912

NII Book ID: AN00072392

DOI: 10.9746/sicetr.54.365

ISSN: 18838189; 04534654

NDL BIB ID: 028916196

Web Site: https://ndlsearch.ndl.go.jp/books/R000000004-I028916196; https://www.jstage.jst.go.jp/article/sicetr/54/3/54_365/_pdf

Text Lang: ja

Data Source

JaLC
NDL
Crossref
CiNii Articles
KAKEN

Abstract License Flag: Disallowed

Export

Sampling Policy that Improves Performance of Policy in Reinforcement Learning

Bibliographic Information

Search this article

Abstract

Journal

References(8)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

Sampling Policy that Improves Performance of Policy in Reinforcement Learning

Bibliographic Information

Search this article

Abstract

Journal

References(8)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

Project list