適応的探索法を用いた強化学習

書誌事項

タイトル別名
  • Reinforcement Learning Using Adaptive Search Method
  • テキオウテキ タンサクホウ オ モチイタ キョウカ ガクシュウ

この論文をさがす

抄録

We propose an adaptive probability density function (PDF) to select an effective action on reinforcement learning (RL). The uniform distribution function and the normal distribution function of an action are often used to select an action. When these fuctions are used, however, the information of search direction is net considered. The proposed method utilizing the information of it enables RL to reduce the number of trials, which is needed to real environment learning. Furthermore, the proposed method can be applied easily to various methods of RL, for example, actor-critic, stochastic gradient ascent method. The performance of our proposed method is demonstrated by computer simulations.

収録刊行物

被引用文献 (2)*注記

もっと見る

参考文献 (11)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ