TY - JOUR
T1 - Principles and Guidelines for Evaluating Social Robot Navigation Algorithms
AU - Francis, Anthony
AU - Perez D'arpino, Claudia
AU - Li, Chengshu
AU - Xia, Fei
AU - Alahi, Alexandre
AU - Alami, Rachid
AU - Bera, Aniket
AU - Biswas, Abhijat
AU - Biswas, Joydeep
AU - Chandra, Rohan
AU - Chiang, Hao-Tien Lewis
AU - Everett, Michael
AU - Ha, Sehoon
AU - Hart, Justin
AU - How, Jonathan P.
AU - Karnan, Haresh
AU - Lee, Tsang-Wei Edward
AU - Manso, Luis J.
AU - Mirsky, Reuth
AU - Pirk, Soren
AU - Singamaneni, Phani Teja
AU - Stone, Peter
AU - Taylor, Ada V.
AU - Trautman, Peter
AU - Tsoi, Nathan
AU - Vazquez, Marynel
AU - Xiao, Xuesu
AU - Xu, Peng
AU - Yokoyama, Naoki
AU - Toshev, Alexander
AU - Martin-Martin, Roberto
N1 - Copyright © 2025 held by the owner/author(s). This work is licensed under a Creative Commons Attribution International 4.0 License.
PY - 2025/2/20
Y1 - 2025/2/20
N2 - A major challenge to deploying robots widely is navigation in human-populated environments, commonly referred to as social robot navigation. While the field of social navigation has advanced tremendously in recent years, the fair evaluation of algorithms that tackle social navigation remains hard because it involves not just robotic agents moving in static environments but also dynamic human agents and their perceptions of the appropriateness of robot behavior. In contrast, clear, repeatable, and accessible benchmarks have accelerated progress in fields like computer vision, natural language processing and traditional robot navigation by enabling researchers to fairly compare algorithms, revealing limitations of existing solutions and illuminating promising new directions. We believe the same approach can benefit social navigation. In this paper, we pave the road towards common, widely accessible, and repeatable benchmarking criteria to evaluate social robot navigation. Our contributions include (a) a definition of a socially navigating robot as one that respects the principles of safety, comfort, legibility, politeness, social competency, agent understanding, proactivity, and responsiveness to context, (b) guidelines for the use of metrics, development of scenarios, benchmarks, datasets, and simulators to evaluate social navigation, and (c) a design of a social navigation metrics framework to make it easier to compare results from different simulators, robots and datasets.
AB - A major challenge to deploying robots widely is navigation in human-populated environments, commonly referred to as social robot navigation. While the field of social navigation has advanced tremendously in recent years, the fair evaluation of algorithms that tackle social navigation remains hard because it involves not just robotic agents moving in static environments but also dynamic human agents and their perceptions of the appropriateness of robot behavior. In contrast, clear, repeatable, and accessible benchmarks have accelerated progress in fields like computer vision, natural language processing and traditional robot navigation by enabling researchers to fairly compare algorithms, revealing limitations of existing solutions and illuminating promising new directions. We believe the same approach can benefit social navigation. In this paper, we pave the road towards common, widely accessible, and repeatable benchmarking criteria to evaluate social robot navigation. Our contributions include (a) a definition of a socially navigating robot as one that respects the principles of safety, comfort, legibility, politeness, social competency, agent understanding, proactivity, and responsiveness to context, (b) guidelines for the use of metrics, development of scenarios, benchmarks, datasets, and simulators to evaluate social navigation, and (c) a design of a social navigation metrics framework to make it easier to compare results from different simulators, robots and datasets.
KW - social navigation
UR - https://dl.acm.org/doi/10.1145/3700599
U2 - 10.1145/3700599
DO - 10.1145/3700599
M3 - Article
SN - 2573-9522
VL - 14
JO - Transactions on Human-Robot Interaction
JF - Transactions on Human-Robot Interaction
IS - 2
ER -