主観報酬を用いた強化学習によるイモムシ型ロボットの行動獲得

山科 亮太, 黒田 将史, 藪田 哲郎

doi:10.1299/kikaic.79.366

書誌事項

タイトル別名

Caterpillar Robot Locomotion Based on Reinforcement Learning Using Subjective Reward

抄録

This note presents an application of reinforcement learning to caterpillar robot locomotion. An excellent advantage of reinforcement learning is that an action can be acquired using only a simple reward. In our previous work, the reward was a forward distance measured using a sensor. This reward was completely an “Objective Reward.” On the other hand, this study uses the reward given by the human's subjective judgment, which is defined as “Subjective Reward.” The main purpose of this study is to compare its performance between the “Objective Reward” obtained from the sensor and the “Subjective Reward” given by a human teacher. The results show that the “Subjective Reward” can give better results than that of the “Objective Reward”, because the “Subjective Reward” has more information than the “Objective Reward”. On the other hand, this note discusses the good teacher who gives an excellent “Subjective Reward”

収録刊行物

日本機械学会論文集Ｃ編

日本機械学会論文集Ｃ編 79 (798), 366-370, 2013

一般社団法人日本機械学会

キーワード

詳細情報詳細情報について

CRID: 1390001206388497920

NII論文ID: 130003374931

DOI: 10.1299/kikaic.79.366

ISSN: 18848354; 03875024

Web Site: https://www.jstage.jst.go.jp/article/kikaic/79/798/79_366/_pdf

本文言語コード: ja

データソース種別

JaLC
Crossref
CiNii Articles
KAKEN

抄録ライセンスフラグ: 使用不可

主観報酬を用いた強化学習によるイモムシ型ロボットの行動獲得

書誌事項

抄録

収録刊行物

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

主観報酬を用いた強化学習によるイモムシ型ロボットの行動獲得

書誌事項

抄録

収録刊行物

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について