TY - JOUR
T1 - Route optimization of construction machine by deep reinforcement learning
AU - Tanabe, Shunya
AU - Sun, Zeyuan
AU - Nakatani, Masayuki
AU - Uchimura, Yutaka
N1 - Publisher Copyright:
© 2019 The Institute of Electrical Engineers of Japan.
Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.
PY - 2019
Y1 - 2019
N2 - After it was reported that an AI player scored higher in Atari2600 games than skilled human players by using deep reinforcement learning techniques, many researchers were inspired to apply deep reinforcement leaning in various fields. This paper focuses on the autonomous ground leveling work by a bulldozer, which is expected to optimize the action of the bulldozer. In a previous work, we implemented a deep Q learning method by giving the images as the input data for the network. However, when learning the image using the convolution layer as the input using deep reinforcement learning, it requires a large computational cost for the learning process. If the size of the neural network is shrunken by contriving the data to be supplied to the input, the learning time (duration) will be reduced. This paper describes the comparison results for different orders of input data. the transition of the learning sequence is also evaluated.
AB - After it was reported that an AI player scored higher in Atari2600 games than skilled human players by using deep reinforcement learning techniques, many researchers were inspired to apply deep reinforcement leaning in various fields. This paper focuses on the autonomous ground leveling work by a bulldozer, which is expected to optimize the action of the bulldozer. In a previous work, we implemented a deep Q learning method by giving the images as the input data for the network. However, when learning the image using the convolution layer as the input using deep reinforcement learning, it requires a large computational cost for the learning process. If the size of the neural network is shrunken by contriving the data to be supplied to the input, the learning time (duration) will be reduced. This paper describes the comparison results for different orders of input data. the transition of the learning sequence is also evaluated.
KW - Artificial intelligence
KW - Autonomous control
KW - Deep reinforcement learning
KW - Leveling machine
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85063755922&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85063755922&partnerID=8YFLogxK
U2 - 10.1541/ieejias.139.401
DO - 10.1541/ieejias.139.401
M3 - Article
AN - SCOPUS:85063755922
SN - 0913-6339
VL - 139
SP - 401
EP - 408
JO - IEEJ Transactions on Industry Applications
JF - IEEJ Transactions on Industry Applications
IS - 4
ER -