学習進度を反映した割引率の調整  [in Japanese] Adjustment of Discount Rate Using Index for Progress of Learning  [in Japanese]

    • 尾川 順子 OGAWA Naoko
    • 東京大学大学院情報理工学系研究科 Graduate School of Information Science and Technology, Univ. of Tokyo
    • 並木 明夫 NAMIKI Akio
    • 科学技術振興事業団戦略的基礎推進事業:東京大学大学院情報理工学系研究科 CREST, JST:Graduate School of Information Science and Technology, Univ. of Tokyo

Abstract

強化学習における割引率を学習進度によって調整することの有用性を示す.学習進度が浅いときには割引率を下げて即時報酬を重視し,学習が進むにつれて次第に割引率を大きくして,将来の報酬も考慮していくという戦略を提案する.また,学習進度の調整法として,指数的調整,TD誤差による調整,信頼度による調整を提案する.これをwindy gridworld 課題により検証する.

We show that it can be effective to adjust the discount rate using an index for progress of learning. In the strategy that we propose, the discount rate is small when the learning does not progress enough, and is increased as the learning advances. We also propose three methods for its adjustment ; exponential, by TD error, and by reliability, which are verificated by numerical experiments for a windy gridworld task.

Journal

IEICE technical report. Neurocomputing   [List of Volumes]

IEICE technical report. Neurocomputing 102(628), 73-78, 2003-01-28  [Table of Contents]

The Institute of Electronics, Information and Communication Engineers

References:  17

You must have a user ID to see the references.If you already have a user ID, please click "Login" to access the info.New users can click "Sign Up" to register for an user ID.

Cited by:  7

You must have a user ID to see the cited references.If you already have a user ID, please click "Login" to access the info.New users can click "Sign Up" to register for an user ID.

Preview

Preview

Codes

  • NII Article ID (NAID) :
    110003232277
  • NII NACSIS-CAT ID (NCID) :
    AN10091178
  • Text Lang :
    JPN
  • Article Type :
    Journal Article
  • ISSN :
    09135685
  • NDL Article ID :
    6505500
  • NDL Source Classification :
    ZN33(科学技術--電気工学・電気機械工業--電子工学・電気通信)
  • NDL Call No. :
    Z16-940
  • Databases :
    CJP  CJPref  NDL  NII-ELS