TY - GEN
T1 - Policy gradient approach for learning of soccer player agents
T2 - International Conference on Advances in Intelligent Control and Computer Engineering
AU - Igarashi, Harukazu
AU - Fukuoka, Hitoshi
AU - Ishihara, Seiji
PY - 2011/1/10
Y1 - 2011/1/10
N2 - This research develops a learning method for the pass selection problem of midfielders in RoboCup Soccer Simulation games. A policy gradient method is applied as a learning method to solve this problem because it can easily represent the various heuristics of pass selection in a policy function. We implement the learning function in the midfielders' programs of a well-known team, UvA Trilearn Base 2003. Experimental results show that our method effectively achieves clever pass selection by midfielders in full games. Moreover, in this method's framework, dribbling is learned as a pass technique, in essence to and from the passer itself. It is also shown that the improvement in pass selection by our learning helps to make a team much stronger.
AB - This research develops a learning method for the pass selection problem of midfielders in RoboCup Soccer Simulation games. A policy gradient method is applied as a learning method to solve this problem because it can easily represent the various heuristics of pass selection in a policy function. We implement the learning function in the midfielders' programs of a well-known team, UvA Trilearn Base 2003. Experimental results show that our method effectively achieves clever pass selection by midfielders in full games. Moreover, in this method's framework, dribbling is learned as a pass technique, in essence to and from the passer itself. It is also shown that the improvement in pass selection by our learning helps to make a team much stronger.
KW - Multi-agent system
KW - Pass selection
KW - Policy gradient method
KW - Reinforcement learning
KW - RoboCup
UR - http://www.scopus.com/inward/record.url?scp=78651546538&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78651546538&partnerID=8YFLogxK
U2 - 10.1007/978-94-007-0286-8_12
DO - 10.1007/978-94-007-0286-8_12
M3 - Conference contribution
AN - SCOPUS:78651546538
SN - 9789400702851
T3 - Lecture Notes in Electrical Engineering
SP - 137
EP - 148
BT - Intelligent Control and Computer Engineering
Y2 - 17 March 2010 through 19 March 2010
ER -