教示者による学習支援に基づくエージェントのオンライン行動獲得 [in Japanese] Online Bahavior Aquisition of an Agent based on Coaching as Learning Assistance [in Japanese]
Access this Article
This paper describes a novel methodology, namely ``Coaching'', which allows humans to give a subjective evaluation to an agent in an iterative manner. This is an interactive learning method to improve the reinforcement learning by modifying a reward function dynamically according to given evaluations by a trainer and the learning situation of the agent. We demonstrate that the agent can learn different reward functions by given instructions such as ``good or bad'' by human's observation, and can also obtain a set of behavior based on the learnt reward functions through several experiments.
- Transactions of the Japanese Society for Artificial Intelligence
Transactions of the Japanese Society for Artificial Intelligence 25(6), 694-702, 2010
The Japanese Society for Artificial Intelligence