TY - GEN
T1 - Route optimization for autonomous bulldozer by distributed deep reinforcement learning
AU - Osaka, Yasuhiro
AU - Odajima, Naoya
AU - Uchimura, Yutaka
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/3/7
Y1 - 2021/3/7
N2 - Since the publication showed DQN based reinforcement learning methods exceeds human's score in Atari 2600 video games, various deep reinforcement learning have bee researched. This paper proposes a method to control bulldozer autonomously by learning the sediment leveling route using PPO that enables distributed deep reinforcement learning. The simulator was originally developed that enables to reproduce the behavior of small and uniform sediment. By incorporating an LSTM that processes the input state as time-series data into the agent network, more than 95% of the sediment in the target area on average was achieved. In addition, the generalization performance for unknown condition was evaluated, by giving unlearned conditions were given as initial setups.
AB - Since the publication showed DQN based reinforcement learning methods exceeds human's score in Atari 2600 video games, various deep reinforcement learning have bee researched. This paper proposes a method to control bulldozer autonomously by learning the sediment leveling route using PPO that enables distributed deep reinforcement learning. The simulator was originally developed that enables to reproduce the behavior of small and uniform sediment. By incorporating an LSTM that processes the input state as time-series data into the agent network, more than 95% of the sediment in the target area on average was achieved. In addition, the generalization performance for unknown condition was evaluated, by giving unlearned conditions were given as initial setups.
KW - Artificial intelligence
KW - Deep Reinforcement Learning
KW - Machine Automation
KW - Machine Learning
UR - http://www.scopus.com/inward/record.url?scp=85104136918&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85104136918&partnerID=8YFLogxK
U2 - 10.1109/ICM46511.2021.9385686
DO - 10.1109/ICM46511.2021.9385686
M3 - Conference contribution
AN - SCOPUS:85104136918
T3 - 2021 IEEE International Conference on Mechatronics, ICM 2021
BT - 2021 IEEE International Conference on Mechatronics, ICM 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE International Conference on Mechatronics, ICM 2021
Y2 - 7 March 2021 through 9 March 2021
ER -