TY - GEN
T1 - Knowledge-based recurrent neural networks in reinforcement learning
AU - Le, Tien Dung
AU - Komeda, Takashi
AU - Takagi, Motoki
PY - 2007
Y1 - 2007
N2 - Recurrent Neural Networks (RNNs) have been shown to have a strong ability to solve some hard problems. Learning time for these problems from scratch is typically very long. For supervised learning, several methods have been proposed to reuse existing knowledge in previous similar tasks. However, for unsupervised learning such as Reinforcement Learning (RL), especially for Partially Observable Markov Decision Processes (POMDPs), it is difficult to apply directly these algorithms. This paper presents several methods which have the potential of transferring of knowledge in RL using RNN: Directed Transfer, Cascade-Correlation, Mixture of Expert Systems, and Two-Level Architecture. Preliminary results of experiments in the E maze domain show the potential of these methods. Knowledge based learning time for a new problem is much shorter learning time from scratch even thought the new task looks very different from the previous tasks.
AB - Recurrent Neural Networks (RNNs) have been shown to have a strong ability to solve some hard problems. Learning time for these problems from scratch is typically very long. For supervised learning, several methods have been proposed to reuse existing knowledge in previous similar tasks. However, for unsupervised learning such as Reinforcement Learning (RL), especially for Partially Observable Markov Decision Processes (POMDPs), it is difficult to apply directly these algorithms. This paper presents several methods which have the potential of transferring of knowledge in RL using RNN: Directed Transfer, Cascade-Correlation, Mixture of Expert Systems, and Two-Level Architecture. Preliminary results of experiments in the E maze domain show the potential of these methods. Knowledge based learning time for a new problem is much shorter learning time from scratch even thought the new task looks very different from the previous tasks.
KW - Machine learning
KW - Neural networks
UR - http://www.scopus.com/inward/record.url?scp=54949115850&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=54949115850&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:54949115850
SN - 9780889866935
T3 - Proceedings of the 11th IASTED International Conference on Artificial Intelligence and Soft Computing, ASC 2007
SP - 169
EP - 174
BT - Proceedings of the 11th IASTED International Conference on Artificial Intelligence and Soft Computing, ASC 2007
T2 - 11th IASTED International Conference on Artificial Intelligence and Soft Computing, ASC 2007
Y2 - 29 August 2007 through 31 August 2007
ER -