Motion Planning of a Mobile Robot Using Reinforcement Learning
-
- Igarashi Harukazu
- School of Engineering, Kinki University.
Bibliographic Information
- Other Title
-
- 強化学習を用いた自律移動型ロボットの行動計画法の提案
- キョウカ ガクシュウ オ モチイタ ジリツ イドウガタ ロボット ノ コウドウ ケイカクホウ ノ テイアン
Search this article
Abstract
In a previous paper, we proposed a solution to navigation of a mobile robot. In our approach, we formulated the following two problems at each time step as discrete optimization problems: 1) estimation of position and direction of a robot, and 2)action decision. While the results of our simulation showed the effectiveness of our approach, the values of weights in the objective functions were given by a heuristic method. This paper presents a theoretical method using reinforcement learning for adjusting the weight parameters in the objective function that includes pieces of heuristic knowledge on the action decision. In our reinforcement learning, the expectation of a reward given to a robot’s trajectory is defined as the value function to maximize. The robot’s trajectories are generated stochastically because we used a probabilistic policy for determining actions of a robot to search for the global optimal trajectory. However, this decision process is not a Markov decision process because the objective function includes an action at the previous time. Thus, Q-learning, which is a conventional method of reinforcement learning, cannot be applied to this problem. In this paper, we applied Williams’s episodic REINFORCE approach to the action decision and derived a learning rule for the weight parameters of the objective function. Moreover, we applied the stochastic hill-climbing method to maximizing the value function to reduce computation time. The learning rule was verified by our experiment.
Journal
-
- Transactions of the Japanese Society for Artificial Intelligence
-
Transactions of the Japanese Society for Artificial Intelligence 16 501-509, 2001
The Japanese Society for Artificial Intelligence
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390282680083500416
-
- NII Article ID
- 10015770439
- 10019668373
- 30009884202
-
- NII Book ID
- AA11579226
-
- ISSN
- 13468030
- 13460714
-
- NDL BIB ID
- 5987922
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- Abstract License Flag
- Disallowed