書誌事項
- タイトル別名
-
- A Reinforcement Learning Using a Stochastic Gradient Method with Memory-Based Learning
- カクリツテキ ケイシャホウ ト メモリベーステキナ シュホウ オ クミアワセタ キョウカ ガクシュウホウ
この論文をさがす
抄録
In this paper, for agents working on POMDP, a learning algorithm combining the memory-less learning and the memory-based learning is proposed. At first stage of the propposed algorithm, memory-less learning is applied. As a memory-less learning algorithm, the stochastic gradient method is employed. While the first stage, a state-action set series that accmplish the task is stored in memory. In the second stage, the memory-based learning is applied. In this process, only the series that obtained the first stage is used, so that this method is able to reduce the number of required memory effectively.<br>The proposed algorithm are applied three kinds of simulation to be compared with memory-less learning algorithm. Through the computer simulations, it shown that the proposed algorithms works effectively in POMDP than ordinary memory-less learnings.
収録刊行物
-
- 電気学会論文誌C(電子・情報・システム部門誌)
-
電気学会論文誌C(電子・情報・システム部門誌) 128 (7), 1123-1130, 2008
一般社団法人 電気学会
- Tweet
キーワード
詳細情報 詳細情報について
-
- CRID
- 1390282679581749632
-
- NII論文ID
- 10021133129
-
- NII書誌ID
- AN10065950
-
- ISSN
- 13488155
- 03854221
-
- NDL書誌ID
- 9564339
-
- 本文言語コード
- ja
-
- データソース種別
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- 抄録ライセンスフラグ
- 使用不可