Approximation Bayesian Reinforcement Learning based on Estimation of Plant Variation and its Application to Peg-in-Hole Task

Bibliographic Information

Other Title
  • プラント変動の推定に基づく近似ベイジアン強化学習とペグ・イン・ホール・タスクへの適用
  • プラント ヘンドウ ノ スイテイ ニ モトズク キンジ ベイジアン キョウカ ガクシュウ ト ペグ ・ イン ・ ホール ・ タスク エ ノ テキヨウ

Search this article

Abstract

In a general reinforcement learning problem, a plant, i.e. state transition probabilities, is estimated, and a learning policy for the estimated plant is applied to a real plant. If there is a difference between the estimated plant and the real plant, the obtained policy may not work well for the real plant. In this study, the real plant variation is parameterized by an interpolation of several estimated plants. This study proposes a reinforcement learning method based on estimation of parameter variation, and applies this method to 2-dimensional Peg-in-Hole Task. The effectiveness of the proposed method is demonstrated by numerical and experimental results.

Journal

References(2)*help

See more

Related Projects

See more

Details 詳細情報について

Report a problem

Back to top