関数近似手法を用いた強化学習アルゴリズム

釜谷 博行, 藤村 敦子, 工藤 憲昌, 阿部 健一

doi:10.24704/hnctech.43.0_23

書誌事項

タイトル別名

Reinforcement Learning Algorithm with Function Approximation
カンスウキンジシュホウオモチイタキョウカガクシュウアルゴリズム

この論文をさがす

抄録

In this paper, we propose a new RL algorithm with Locally Weighted Partial Least Squares (LWPLS) as a function approximator. LWPLS is a class of techniques from nonparametric statistics that is ideally suited to reduce the computational complexity and to avoid numerical problems. The principle of LWPLS is to fit linear models using a hierarchy of univariate regressions along selected projections in input space. The projections are chosen according to the correlation between input and output data, and the algorithm assures that subsequent projections are orthogonal in input space. This new RL algorithm is compared with the usual way of quantizing the state space with grids in a mobile robot navigation task. The results of the extensive simulation demonstrate that our proposed algorithm is clearly outperforming the usual way.

収録刊行物

八戸工業高等専門学校紀要

八戸工業高等専門学校紀要 43 (0), 23-27, 2008-12-17

国立高等専門学校機構八戸工業高等専門学校

詳細情報詳細情報について

CRID: 1390564238051428224

NII論文ID: 110007126392

NII書誌ID: AN00205099

DOI: 10.24704/hnctech.43.0_23

ISSN: 24332003; 03854124

NDL書誌ID: 10236959

Web Site: https://ndlsearch.ndl.go.jp/books/R000000004-I10236959

本文言語コード: ja

データソース種別

JaLC
NDL
CiNii Articles
KAKEN

抄録ライセンスフラグ: 使用不可

関数近似手法を用いた強化学習アルゴリズム

書誌事項

この論文をさがす

抄録

収録刊行物

関連プロジェクト

詳細情報詳細情報について

書き出し

問題の指摘

関数近似手法を用いた強化学習アルゴリズム

書誌事項

この論文をさがす

抄録

収録刊行物

関連プロジェクト

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について