TY - JOUR
T1 - R-learning with multiple state-action value tables
AU - Ishikawa, Koichiro
AU - Sakurai, Akito
AU - Fujinami, Tsutomu
AU - Kunifuji, Susumu
N1 - Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 2006
Y1 - 2006
N2 - We propose a method to improve the performance of R-learning, a reinforcement learning algorithm, by using multiple state-action value tables. Unlike Q- or Sarsa learning, R-learning learns a policy to maximize undiscounted rewards. Multiple state-action value tables cause substantial explorations as needed and make R-learnings to work well. Efficiency of the proposed method is verified through experiments in simulation environment.
AB - We propose a method to improve the performance of R-learning, a reinforcement learning algorithm, by using multiple state-action value tables. Unlike Q- or Sarsa learning, R-learning learns a policy to maximize undiscounted rewards. Multiple state-action value tables cause substantial explorations as needed and make R-learnings to work well. Efficiency of the proposed method is verified through experiments in simulation environment.
KW - Autonomous mobile robot
KW - R-learning
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=33749571691&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33749571691&partnerID=8YFLogxK
U2 - 10.1541/ieejeiss.126.72
DO - 10.1541/ieejeiss.126.72
M3 - Article
AN - SCOPUS:33749571691
SN - 0385-4221
VL - 126
SP - 72
EP - 82
JO - IEEJ Transactions on Electronics, Information and Systems
JF - IEEJ Transactions on Electronics, Information and Systems
IS - 1
ER -