Autonomous Vehicular Landings on the Deck of an Unmanned Surface Vehicle using Deep Reinforcement Learning

R. Polvara; Sanjay Sharma; J. Wan; A. Manning; R. Sutton

doi:10.1017/S0263574719000316

Autonomous Vehicular Landings on the Deck of an Unmanned Surface Vehicle using Deep Reinforcement Learning

R. Polvara, Sanjay Sharma, J. Wan, A. Manning, R. Sutton

Mechanical, Biomedical & Design Engineering

Research output: Contribution to journal › Article › peer-review

Abstract

Autonomous landing on the deck of a boat or an unmanned surface vehicle (USV) is the minimum requirement for increasing the autonomy of water monitoring missions. This paper introduces an end-to-end control technique based on deep reinforcement learning for landing an unmanned aerial vehicle on a visual marker located on the deck of a USV. The solution proposed consists of a hierarchy of Deep Q-Networks (DQNs) used as high-level navigation policies that address the two phases of the flight: the marker detection and the descending manoeuvre. Few technical improvements have been proposed to stabilize the learning process, such as the combination of vanilla and double DQNs, and a partitioned buffer replay. Simulated studies proved the robustness of the proposed algorithm against different perturbations acting on the marine vessel. The performances obtained are comparable with a state-of-the-art method based on template matching.

Original language	English
Pages (from-to)	1867-1882
Journal	Robotica
Volume	37
Issue number	11
DOIs	https://doi.org/10.1017/S0263574719000316
Publication status	Published - 8 Apr 2019

Keywords

Deep reinforcement learning
Unmanned aerial vehicle
Autonomous agents

Access to Document

10.1017/S0263574719000316

http://hdl.handle.net/10026.1/13683Licence: Unspecified

Cite this

@article{bcba01f8886b41f68b0e1fdc4ded865b,

title = "Autonomous Vehicular Landings on the Deck of an Unmanned Surface Vehicle using Deep Reinforcement Learning",

abstract = "Autonomous landing on the deck of a boat or an unmanned surface vehicle (USV) is the minimum requirement for increasing the autonomy of water monitoring missions. This paper introduces an end-to-end control technique based on deep reinforcement learning for landing an unmanned aerial vehicle on a visual marker located on the deck of a USV. The solution proposed consists of a hierarchy of Deep Q-Networks (DQNs) used as high-level navigation policies that address the two phases of the flight: the marker detection and the descending manoeuvre. Few technical improvements have been proposed to stabilize the learning process, such as the combination of vanilla and double DQNs, and a partitioned buffer replay. Simulated studies proved the robustness of the proposed algorithm against different perturbations acting on the marine vessel. The performances obtained are comparable with a state-of-the-art method based on template matching.",

keywords = "Deep reinforcement learning, Unmanned aerial vehicle, Autonomous agents",

author = "R. Polvara and Sanjay Sharma and J. Wan and A. Manning and R. Sutton",

year = "2019",

month = apr,

day = "8",

doi = "10.1017/S0263574719000316",

language = "English",

volume = "37",

pages = "1867--1882",

number = "11",

}

TY - JOUR

T1 - Autonomous Vehicular Landings on the Deck of an Unmanned Surface Vehicle using Deep Reinforcement Learning

AU - Polvara, R.

AU - Sharma, Sanjay

AU - Wan, J.

AU - Manning, A.

AU - Sutton, R.

PY - 2019/4/8

Y1 - 2019/4/8

N2 - Autonomous landing on the deck of a boat or an unmanned surface vehicle (USV) is the minimum requirement for increasing the autonomy of water monitoring missions. This paper introduces an end-to-end control technique based on deep reinforcement learning for landing an unmanned aerial vehicle on a visual marker located on the deck of a USV. The solution proposed consists of a hierarchy of Deep Q-Networks (DQNs) used as high-level navigation policies that address the two phases of the flight: the marker detection and the descending manoeuvre. Few technical improvements have been proposed to stabilize the learning process, such as the combination of vanilla and double DQNs, and a partitioned buffer replay. Simulated studies proved the robustness of the proposed algorithm against different perturbations acting on the marine vessel. The performances obtained are comparable with a state-of-the-art method based on template matching.

AB - Autonomous landing on the deck of a boat or an unmanned surface vehicle (USV) is the minimum requirement for increasing the autonomy of water monitoring missions. This paper introduces an end-to-end control technique based on deep reinforcement learning for landing an unmanned aerial vehicle on a visual marker located on the deck of a USV. The solution proposed consists of a hierarchy of Deep Q-Networks (DQNs) used as high-level navigation policies that address the two phases of the flight: the marker detection and the descending manoeuvre. Few technical improvements have been proposed to stabilize the learning process, such as the combination of vanilla and double DQNs, and a partitioned buffer replay. Simulated studies proved the robustness of the proposed algorithm against different perturbations acting on the marine vessel. The performances obtained are comparable with a state-of-the-art method based on template matching.

KW - Deep reinforcement learning

KW - Unmanned aerial vehicle

KW - Autonomous agents

UR - http://www.scopus.com/inward/record.url?eid=2-s2.0-85065254630&partnerID=MN8TOARS

UR - https://www.cambridge.org/core/journals/robotica/article/autonomous-vehicular-landings-on-the-deck-of-an-unmanned-surface-vehicle-using-deep-reinforcement-learning/6B87C450D4D431EC163DBEA45FD60C73

U2 - 10.1017/S0263574719000316

DO - 10.1017/S0263574719000316

M3 - Article

SN - 0263-5747

VL - 37

SP - 1867

EP - 1882

JO - Robotica

JF - Robotica

IS - 11

ER -

Autonomous Vehicular Landings on the Deck of an Unmanned Surface Vehicle using Deep Reinforcement Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this