Abstract
Self-supervised monocular depth estimation has been a subject of intense study in recent years, because of its applications in robotics and autonomous driving. Much of the recent work focuses on improving depth estimation by increasing architecture complexity. This paper shows that state-of-the-art performance can also be achieved by improving the learning process rather than increasing model complexity. More specifically, we propose (i) disregarding small potentially dynamic objects when training, and (ii) employing an appearance-based approach to separately estimate object pose for truly dynamic objects. We demonstrate that these simplifications reduce GPU memory usage by 29% and result in qualitatively and quantitatively improved depth maps.
Original language | English |
---|---|
Title of host publication | 2023 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC) |
Editors | Ana C. Lopes, Gabriel Pires, Vitor H. Pinto, Jose L. Lima, Pedro Fonseca |
Publisher | IEEE |
Pages | 10-16 |
Number of pages | 7 |
ISBN (Electronic) | 979-8-3503-0121-2 |
ISBN (Print) | 979-8-3503-0122-9 |
DOIs | |
Publication status | Published - 25 May 2023 |
Event | 2023 IEEE International Conference on Autonomous Robot Systems and Competitions - Tomar, Portugal Duration: 26 Apr 2023 → 27 Apr 2023 |
Publication series
Name | Proceedings of IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC) |
---|---|
Publisher | IEEE |
ISSN (Print) | 2573-9360 |
ISSN (Electronic) | 2573-9387 |
Conference
Conference | 2023 IEEE International Conference on Autonomous Robot Systems and Competitions |
---|---|
Abbreviated title | ICARSC 2023 |
Country/Territory | Portugal |
City | Tomar |
Period | 26/04/23 → 27/04/23 |
Bibliographical note
Funding Information:Most experiments were run on Aston EPS Machine Learning Server, funded by the EPSRC Core Equipment Fund, Grant EP/V036106/1.
Publisher Copyright:
© 2023 IEEE.
Keywords
- Training, Computer vision, Image analysis, Estimation, Graphics processing units, Computer architecture, Complexity theory
- Computer vision, Autonomous vehicles, 3D/stereo scene analysis, Vision and Scene Understanding