TY - JOUR
T1 - Autonomous Vehicular Landings on the Deck of an Unmanned Surface Vehicle using Deep Reinforcement Learning
AU - Polvara, R.
AU - Sharma, Sanjay
AU - Wan, J.
AU - Manning, A.
AU - Sutton, R.
PY - 2019/4/8
Y1 - 2019/4/8
N2 - Autonomous landing on the deck of a boat or an unmanned surface vehicle (USV) is the minimum requirement for increasing the autonomy of water monitoring missions. This paper introduces an end-to-end control technique based on deep reinforcement learning for landing an unmanned aerial vehicle on a visual marker located on the deck of a USV. The solution proposed consists of a hierarchy of Deep Q-Networks (DQNs) used as high-level navigation policies that address the two phases of the flight: the marker detection and the descending manoeuvre. Few technical improvements have been proposed to stabilize the learning process, such as the combination of vanilla and double DQNs, and a partitioned buffer replay. Simulated studies proved the robustness of the proposed algorithm against different perturbations acting on the marine vessel. The performances obtained are comparable with a state-of-the-art method based on template matching.
AB - Autonomous landing on the deck of a boat or an unmanned surface vehicle (USV) is the minimum requirement for increasing the autonomy of water monitoring missions. This paper introduces an end-to-end control technique based on deep reinforcement learning for landing an unmanned aerial vehicle on a visual marker located on the deck of a USV. The solution proposed consists of a hierarchy of Deep Q-Networks (DQNs) used as high-level navigation policies that address the two phases of the flight: the marker detection and the descending manoeuvre. Few technical improvements have been proposed to stabilize the learning process, such as the combination of vanilla and double DQNs, and a partitioned buffer replay. Simulated studies proved the robustness of the proposed algorithm against different perturbations acting on the marine vessel. The performances obtained are comparable with a state-of-the-art method based on template matching.
KW - Deep reinforcement learning
KW - Unmanned aerial vehicle
KW - Autonomous agents
UR - http://www.scopus.com/inward/record.url?eid=2-s2.0-85065254630&partnerID=MN8TOARS
UR - https://www.cambridge.org/core/journals/robotica/article/autonomous-vehicular-landings-on-the-deck-of-an-unmanned-surface-vehicle-using-deep-reinforcement-learning/6B87C450D4D431EC163DBEA45FD60C73
U2 - 10.1017/S0263574719000316
DO - 10.1017/S0263574719000316
M3 - Article
SN - 0263-5747
VL - 37
SP - 1867
EP - 1882
JO - Robotica
JF - Robotica
IS - 11
ER -