確率的傾斜法とメモリベース的な手法を組み合わせた強化学習法

山田 孝文, 山口 智

doi:10.1541/ieejeiss.128.1123

書誌事項

タイトル別名

A Reinforcement Learning Using a Stochastic Gradient Method with Memory-Based Learning
カクリツテキケイシャホウトメモリベーステキナシュホウオクミアワセタキョウカガクシュウホウ

この論文をさがす

抄録

In this paper, for agents working on POMDP, a learning algorithm combining the memory-less learning and the memory-based learning is proposed. At first stage of the propposed algorithm, memory-less learning is applied. As a memory-less learning algorithm, the stochastic gradient method is employed. While the first stage, a state-action set series that accmplish the task is stored in memory. In the second stage, the memory-based learning is applied. In this process, only the series that obtained the first stage is used, so that this method is able to reduce the number of required memory effectively.<br>The proposed algorithm are applied three kinds of simulation to be compared with memory-less learning algorithm. Through the computer simulations, it shown that the proposed algorithms works effectively in POMDP than ordinary memory-less learnings.

収録刊行物

電気学会論文誌Ｃ（電子・情報・システム部門誌）

電気学会論文誌Ｃ（電子・情報・システム部門誌） 128 (7), 1123-1130, 2008

一般社団法人電気学会

キーワード

詳細情報詳細情報について

CRID: 1390282679581749632

NII論文ID: 10021133129

NII書誌ID: AN10065950

DOI: 10.1541/ieejeiss.128.1123

ISSN: 13488155; 03854221

NDL書誌ID: 9564339

Web Site: https://ndlsearch.ndl.go.jp/books/R000000004-I9564339; http://www.jstage.jst.go.jp/article/ieejeiss/128/7/128_7_1123/_pdf

本文言語コード: ja

データソース種別

JaLC
NDL
Crossref
CiNii Articles

抄録ライセンスフラグ: 使用不可

確率的傾斜法とメモリベース的な手法を組み合わせた強化学習法

書誌事項

この論文をさがす

抄録

収録刊行物

参考文献 (19)*注記

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

確率的傾斜法とメモリベース的な手法を組み合わせた強化学習法

書誌事項

この論文をさがす

抄録

収録刊行物

参考文献 (19)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について