A Meta-Parameter Learning Method in Reinforcement Learning Based on Temporal Difference Error

Mizoue Hiroyuki, Kobayashi Kunikazu, Kuremoto Takashi, Obayashi Masanao

doi:10.1541/ieejeiss.129.1730

【Created on October 31, 2023】 Integration of CiNii Dissertations and CiNii Books into CiNii Research

Impact of the Release of the New "NDL Search" on CiNii Services

Performing network maintenance work from 9:30 to 10:30 on June 13 (Thu.) [JST]

A Meta-Parameter Learning Method in Reinforcement Learning Based on Temporal Difference Error

DOI Web Site Web Site 18 References

Mizoue Hiroyuki

Graduate School of Science and Engineering, Yamaguchi University
Kobayashi Kunikazu

Graduate School of Science and Engineering, Yamaguchi University
Kuremoto Takashi

Graduate School of Science and Engineering, Yamaguchi University
Obayashi Masanao

Graduate School of Science and Engineering, Yamaguchi University

Bibliographic Information

Other Title

ＴＤ誤差に基づく強化学習のメタパラメータ学習法
TD ゴサニモトズクキョウカガクシュウノメタパラメータガクシュウホウ

Search this article

Abstract

In general, meta-parameters in a reinforcement learning system such as learning rate are empirically determined and fixed during the learning. Therefore, when an external environment has changed, the sytem cannot adjust to the change. Meanwhile, it is suggested that the biological brain could conduct reinforcement learning and adjust to the external environment by controlling neuromodulators corresponding to meta-parameters. In the present paper, based on the above suggestion, a method to adjust meta-parameters using the TD-error is proposed. Through computer simulations using maze problem and inverted pendulum control problem, it is verified that meta-parameters are appropriately adjusted according to the amplitude of the TD-error.

Journal

IEEJ Transactions on Electronics, Information and Systems

IEEJ Transactions on Electronics, Information and Systems 129 (9), 1730-1736, 2009

The Institute of Electrical Engineers of Japan

References(18)*help

Related Projects

Keywords

Details 詳細情報について

CRID

1390282679582797824
NII Article ID

10025102012
NII Book ID

AN10065950
DOI

10.1541/ieejeiss.129.1730
ISSN

13488155

03854221
NDL BIB ID

10421449
Web Site

http://id.ndl.go.jp/bib/10421449

https://ndlsearch.ndl.go.jp/books/R000000004-I10421449

http://www.jstage.jst.go.jp/article/ieejeiss/129/9/129_9_1730/_pdf
Text Lang

ja
Data Source
- JaLC
- NDL
- Crossref
- CiNii Articles
- KAKEN
Abstract License Flag
Disallowed

A Meta-Parameter Learning Method in Reinforcement Learning Based on Temporal Difference Error

Bibliographic Information

Search this article

Abstract

Journal

References(18)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

A Meta-Parameter Learning Method in Reinforcement Learning Based on Temporal Difference Error

Bibliographic Information

Search this article

Abstract

Journal

References(18)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

Project list