Integrating Multi-Demonstration Knowledge and Bounded Workspaces for Efficient Deep Reinforcement Learning

Ali Aflakian, Rustam Stolkin, Alireza Rastegarpanah*

*Corresponding author for this work

Research output: Chapter in Book/Published conference outputConference publication

3 Citations (Scopus)

Abstract

We propose a novel approach for boosting deep Reinforcement Learning (RL) using human demonstrations and offline workspace bounding. Our approach involves collecting data from human demonstrations on random surfaces with varying friction and stiffness properties. We then compute a 3D convex hull that encompasses all the paths taken by the demonstrators. By defining the task and the desired parameters as reward functions, we enable the reinforcement learning agent to learn an optimal solution within the bounded space, significantly reducing the search space required for the agent. We compare the training progress and the behavior of the trained policy of our approach with a baseline approach. The results demonstrate that our approach not only expedites learning but also improves the policy's performance and resilience to local minima. Combining our approach with RL also enables the use of imperfect demonstrators as their behavior can be improved during the learning. Our approach has the potential to significantly boost the development of deep RL applications in various domains, including robotics, gaming, and autonomous systems.

Original languageEnglish
Title of host publication2023 IEEE-RAS 22nd International Conference on Humanoid Robots (Humanoids)
PublisherIEEE
Number of pages8
ISBN (Electronic)9798350303278
DOIs
Publication statusPublished - 1 Jan 2024
Event22nd IEEE-RAS International Conference on Humanoid Robots, Humanoids 2023 - Austin, United States
Duration: 12 Dec 202314 Dec 2023

Publication series

NameIEEE-RAS International Conference on Humanoid Robots
ISSN (Print)2164-0572
ISSN (Electronic)2164-0580

Conference

Conference22nd IEEE-RAS International Conference on Humanoid Robots, Humanoids 2023
Country/TerritoryUnited States
CityAustin
Period12/12/2314/12/23

Keywords

  • Contact-rich path following
  • Deep reinforcement learning
  • Human demonstrations
  • Offline workspace bounding

Fingerprint

Dive into the research topics of 'Integrating Multi-Demonstration Knowledge and Bounded Workspaces for Efficient Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this