Toward End-to-End Control for UAV Autonomous Landing via Deep Reinforcement Learning

R. Polvara; M. Patacchiola; Sanjay Sharma; J. Wan; A. Manning; R. Sutton; A. Cangelosi

doi:10.1109/ICUAS.2018.8453449

Toward End-to-End Control for UAV Autonomous Landing via Deep Reinforcement Learning

R. Polvara, M. Patacchiola, Sanjay Sharma, J. Wan, A. Manning, R. Sutton, A. Cangelosi

Mechanical, Biomedical & Design Engineering

Research output: Chapter in Book/Published conference output › Conference publication

Abstract

The autonomous landing of an unmanned aerial vehicle (UAV) is still an open problem. Previous work focused on the use of hand-crafted geometric features and sensor-data fusion for identifying a fiducial marker and guide the UAV toward it. In this article we propose a method based on deep reinforcement learning that only requires low-resolution images coming from a down looking camera in order to drive the vehicle. The proposed approach is based on a hierarchy of Deep Q-Networks (DQNs) that are used as high-end control policy for the navigation in different phases. We implemented various technical solutions, such as the combination of vanilla and double DQNs trained using a form of prioritized buffer replay that separates experiences in multiple containers. The optimal control policy is learned without any human supervision, providing the agent with a sparse reward feedback indicating the success or failure of the landing. The results show that the quadrotor can autonomously land on a large variety of simulated environments and with relevant noise, proving that the underline DQNs are able to generalise effectively on unseen scenarios. Furthermore, it was proved that in some conditions the network outperformed human pilots.

Original language	English
Title of host publication	2018 International Conference on Unmanned Aircraft Systems (ICUAS)
Publisher	IEEE
Pages	115-123
ISBN (Electronic)	9781538613542
DOIs	https://doi.org/10.1109/ICUAS.2018.8453449
Publication status	Published - 3 Sept 2018
Event	2018 International Conference on Unmanned Aircraft Systems - Dallas, United States Duration: 12 Jun 2018 → 15 Jun 2018

Conference

Conference	2018 International Conference on Unmanned Aircraft Systems
Abbreviated title	ICUAS
Country/Territory	United States
City	Dallas
Period	12/06/18 → 15/06/18

Access to Document

10.1109/ICUAS.2018.8453449

Cite this

@inproceedings{9eebccff9bb04296924d0b96b1fa685a,

title = "Toward End-to-End Control for UAV Autonomous Landing via Deep Reinforcement Learning",

abstract = "The autonomous landing of an unmanned aerial vehicle (UAV) is still an open problem. Previous work focused on the use of hand-crafted geometric features and sensor-data fusion for identifying a fiducial marker and guide the UAV toward it. In this article we propose a method based on deep reinforcement learning that only requires low-resolution images coming from a down looking camera in order to drive the vehicle. The proposed approach is based on a hierarchy of Deep Q-Networks (DQNs) that are used as high-end control policy for the navigation in different phases. We implemented various technical solutions, such as the combination of vanilla and double DQNs trained using a form of prioritized buffer replay that separates experiences in multiple containers. The optimal control policy is learned without any human supervision, providing the agent with a sparse reward feedback indicating the success or failure of the landing. The results show that the quadrotor can autonomously land on a large variety of simulated environments and with relevant noise, proving that the underline DQNs are able to generalise effectively on unseen scenarios. Furthermore, it was proved that in some conditions the network outperformed human pilots.",

author = "R. Polvara and M. Patacchiola and Sanjay Sharma and J. Wan and A. Manning and R. Sutton and A. Cangelosi",

year = "2018",

month = sep,

day = "3",

doi = "10.1109/ICUAS.2018.8453449",

language = "English",

pages = "115--123",

booktitle = "2018 International Conference on Unmanned Aircraft Systems (ICUAS)",

publisher = "IEEE",

address = "United States",

note = "2018 International Conference on Unmanned Aircraft Systems, ICUAS ; Conference date: 12-06-2018 Through 15-06-2018",

}

Polvara, R, Patacchiola, M, Sharma, S, Wan, J, Manning, A, Sutton, R & Cangelosi, A 2018, Toward End-to-End Control for UAV Autonomous Landing via Deep Reinforcement Learning. in 2018 International Conference on Unmanned Aircraft Systems (ICUAS). IEEE, pp. 115-123, 2018 International Conference on Unmanned Aircraft Systems, Dallas, United States, 12/06/18. https://doi.org/10.1109/ICUAS.2018.8453449

TY - GEN

T1 - Toward End-to-End Control for UAV Autonomous Landing via Deep Reinforcement Learning

AU - Polvara, R.

AU - Patacchiola, M.

AU - Sharma, Sanjay

AU - Wan, J.

AU - Manning, A.

AU - Sutton, R.

AU - Cangelosi, A.

PY - 2018/9/3

Y1 - 2018/9/3

N2 - The autonomous landing of an unmanned aerial vehicle (UAV) is still an open problem. Previous work focused on the use of hand-crafted geometric features and sensor-data fusion for identifying a fiducial marker and guide the UAV toward it. In this article we propose a method based on deep reinforcement learning that only requires low-resolution images coming from a down looking camera in order to drive the vehicle. The proposed approach is based on a hierarchy of Deep Q-Networks (DQNs) that are used as high-end control policy for the navigation in different phases. We implemented various technical solutions, such as the combination of vanilla and double DQNs trained using a form of prioritized buffer replay that separates experiences in multiple containers. The optimal control policy is learned without any human supervision, providing the agent with a sparse reward feedback indicating the success or failure of the landing. The results show that the quadrotor can autonomously land on a large variety of simulated environments and with relevant noise, proving that the underline DQNs are able to generalise effectively on unseen scenarios. Furthermore, it was proved that in some conditions the network outperformed human pilots.

AB - The autonomous landing of an unmanned aerial vehicle (UAV) is still an open problem. Previous work focused on the use of hand-crafted geometric features and sensor-data fusion for identifying a fiducial marker and guide the UAV toward it. In this article we propose a method based on deep reinforcement learning that only requires low-resolution images coming from a down looking camera in order to drive the vehicle. The proposed approach is based on a hierarchy of Deep Q-Networks (DQNs) that are used as high-end control policy for the navigation in different phases. We implemented various technical solutions, such as the combination of vanilla and double DQNs trained using a form of prioritized buffer replay that separates experiences in multiple containers. The optimal control policy is learned without any human supervision, providing the agent with a sparse reward feedback indicating the success or failure of the landing. The results show that the quadrotor can autonomously land on a large variety of simulated environments and with relevant noise, proving that the underline DQNs are able to generalise effectively on unseen scenarios. Furthermore, it was proved that in some conditions the network outperformed human pilots.

UR - http://www.scopus.com/inward/record.url?eid=2-s2.0-85053921728&partnerID=MN8TOARS

U2 - 10.1109/ICUAS.2018.8453449

DO - 10.1109/ICUAS.2018.8453449

M3 - Conference publication

SP - 115

EP - 123

BT - 2018 International Conference on Unmanned Aircraft Systems (ICUAS)

PB - IEEE

T2 - 2018 International Conference on Unmanned Aircraft Systems

Y2 - 12 June 2018 through 15 June 2018

ER -

Toward End-to-End Control for UAV Autonomous Landing via Deep Reinforcement Learning

Abstract

Conference

Access to Document

Other files and links

Fingerprint

Cite this