TY - JOUR
T1 - Learning to reproduce fluctuating time series by inferring their time-dependent stochastic properties
T2 - Application in Robot learning via tutoring
AU - Murata, Shingo
AU - Namikawa, Jun
AU - Arie, Hiroaki
AU - Sugano, Shigeki
AU - Tani, Jun
PY - 2013/12
Y1 - 2013/12
N2 - This study proposes a novel type of dynamic neural network model that can learn to extract stochastic or fluctuating structures hidden in time series data. The network learns to predict not only the mean of the next input state, but also its time-dependent variance. The training method is based on maximum likelihood estimation by using the gradient descent method and the likelihood function is expressed as a function of the estimated variance. Regarding the model evaluation, we present numerical experiments in which training data were generated in different ways utilizing Gaussian noise. Our analysis showed that the network can predict the time-dependent variance and the mean and it can also reproduce the target stochastic sequence data by utilizing the estimated variance. Furthermore, it was shown that a humanoid robot using the proposed network can learn to reproduce latent stochastic structures hidden in fluctuating tutoring trajectories. This learning scheme is essential for the acquisition of sensory-guided skilled behavior.
AB - This study proposes a novel type of dynamic neural network model that can learn to extract stochastic or fluctuating structures hidden in time series data. The network learns to predict not only the mean of the next input state, but also its time-dependent variance. The training method is based on maximum likelihood estimation by using the gradient descent method and the likelihood function is expressed as a function of the estimated variance. Regarding the model evaluation, we present numerical experiments in which training data were generated in different ways utilizing Gaussian noise. Our analysis showed that the network can predict the time-dependent variance and the mean and it can also reproduce the target stochastic sequence data by utilizing the estimated variance. Furthermore, it was shown that a humanoid robot using the proposed network can learn to reproduce latent stochastic structures hidden in fluctuating tutoring trajectories. This learning scheme is essential for the acquisition of sensory-guided skilled behavior.
KW - Dynamical systems approach
KW - humanoid robot
KW - maximum likelihood estimation
KW - recurrent neural network
UR - http://www.scopus.com/inward/record.url?scp=84890921190&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84890921190&partnerID=8YFLogxK
U2 - 10.1109/TAMD.2013.2258019
DO - 10.1109/TAMD.2013.2258019
M3 - Article
AN - SCOPUS:84890921190
SN - 1943-0604
VL - 5
SP - 298
EP - 310
JO - IEEE Transactions on Autonomous Mental Development
JF - IEEE Transactions on Autonomous Mental Development
IS - 4
M1 - 6502665
ER -