On the Effects of Domain Size and Complexity in Empirical Distribution of Reinforcement Learning(Artificial Intelligence and Cognitive Science)

    • IWATA Kazunori
    • the Department of Systems Science, Graduate School of Informatics, Kyoto University
    • IKEDA Kazushi
    • the Department of Systems Science, Graduate School of Informatics, Kyoto University
    • SAKAI Hideaki
    • the Department of Systems Science, Graduate School of Informatics, Kyoto University

Abstract

We regard the events of a Markov decision process as the outputs from a Markov information source in order to analyze the randomness of an empirical sequence by the codeword length of the sequence. The randomness is an important viewpoint in reinforcement learning since the learning is to eliminate the randomness and to find an optimal policy. The occurrence of optimal empirical sequence also depends on the randomness. We then introduce the Lempel-Ziv coding for measuring the randomness which consists of the domain size and the stochastic complexity. In experimental results, we confirm that the learning and the occurrence of optimal empirical sequence depend on the randomness and show the fact that in early stages the randomness is mainly characterized by the domain size and as the number of time steps increases the randomness depends greatly on the complexity of Markov decision processes.

Journal

IEICE transactions on information and systems   [List of Volumes]

IEICE transactions on information and systems E88-D(1), 135-142, 2005-01-01  [Table of Contents]

The Institute of Electronics, Information and Communication Engineers

References:  16

You must have a user ID to see the references.If you already have a user ID, please click "Login" to access the info.New users can click "Sign Up" to register for an user ID.

Preview

Preview

Codes

  • NII Article ID (NAID) :
    110003214146
  • NII NACSIS-CAT ID (NCID) :
    AA10826272
  • Text Lang :
    ENG
  • Article Type :
    ART
  • ISSN :
    09168532
  • Databases :
    CJP  NII-ELS 

Share