Learning of soccer player agents using a policy gradient method: Coordination between kicker and receiver during free kicks

H. Igarashi, K. Nakamura, S. Ishihara

研究成果: Conference contribution

5 被引用数 (Scopus)

抄録

The RoboCup Simulation League is recognized as a test bed for research on multi-agent learning. As an example of multi-agent learning in a soccer game, we dealt with a learning problem between a kicker and a receiver when a direct free kick is awarded just outside the opponent's penalty area. In such a situation, to which point should the kicker kick the ball? We propose a function that expresses heuristics to evaluate an advantageous target point for safely sending/receiving a pass and scoring. The heuristics includes an interaction term between a kicker and a receiver to intensify their coordination. To calculate the interaction term, we let kicker/receiver agents have a receiver/kicker action decision model to predict his teammate's action. The evaluation function makes it possible to handle a large space of states consisting of the positions of a kicker, a receiver, and their opponents. The target point of the free kick is selected by the kicker using Boltzmann selection with an evaluation function. Parameters in the function can be learned by a kind of reinforcement learning called the policy gradient method. The point to which a receiver should run to receive the ball is simultaneously learned in the same manner. The effectiveness of our solution was shown by experiments.

本文言語English
ホスト出版物のタイトル2008 International Joint Conference on Neural Networks, IJCNN 2008
ページ46-52
ページ数7
DOI
出版ステータスPublished - 2008
イベント2008 International Joint Conference on Neural Networks, IJCNN 2008 - Hong Kong, China
継続期間: 2008 6月 12008 6月 8

出版物シリーズ

名前Proceedings of the International Joint Conference on Neural Networks

Conference

Conference2008 International Joint Conference on Neural Networks, IJCNN 2008
国/地域China
CityHong Kong
Period08/6/108/6/8

ASJC Scopus subject areas

  • ソフトウェア
  • 人工知能

フィンガープリント

「Learning of soccer player agents using a policy gradient method: Coordination between kicker and receiver during free kicks」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル