2P1-F30 連続な状態行動空間において学習可能なQ-learningの提案

山田 和明

doi:10.1299/jsmermd.2010._2p1-f30_1

書誌事項

タイトル別名

2P1-F30 Q-learning in Continuous State and Action Spaces

抄録

This paper proposes the new Q-learning that can learn mapping from continue state spaces to continue action spaces. The proposed method estimates the expectation value of actions on a state by using artificial neural networks, and decides an action according to the distribution of the estimated expectation value. In this paper, we investigate the performance of the proposed method through two types of simple experimentations.

収録刊行物

ロボティクス・メカトロニクス講演会講演概要集

ロボティクス・メカトロニクス講演会講演概要集 2010 (0), _2P1-F30_1-_2P1-F30_4, 2010

一般社団法人日本機械学会

キーワード

詳細情報詳細情報について

CRID: 1390282680914868480

NII論文ID: 110008742202

DOI: 10.1299/jsmermd.2010._2p1-f30_1

ISSN: 24243124

Web Site: https://www.jstage.jst.go.jp/article/jsmermd/2010/0/2010__2P1-F30_1/_pdf

本文言語コード: ja

データソース種別

JaLC
Crossref
CiNii Articles

抄録ライセンスフラグ: 使用不可

2P1-F30 連続な状態行動空間において学習可能なQ-learningの提案

書誌事項

抄録

収録刊行物

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

2P1-F30 連続な状態行動空間において学習可能なQ-learningの提案

書誌事項

抄録

収録刊行物

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について