A novel adaptive weight selection algorithm for multi-objective multi-agent reinforcement learning

Research output: Chapter in Book/Report/Conference proceedingConference contribution

View graph of relations Save citation

Open

Authors

Research units

Abstract

To solve multi-objective problems, multiple reward signals are often scalarized into a single value and further processed using established single-objective problem solving techniques. While the field of multi-objective optimization has made many advances in applying scalarization techniques to obtain good solution trade-offs, the utility of applying these techniques in the multi-objective multi-agent learning domain has not yet been thoroughly investigated. Agents learn the value of their decisions by linearly scalarizing their reward signals at the local level, while acceptable system wide behaviour results. However, the non-linear relationship between weighting parameters of the scalarization function and the learned policy makes the discovery of system wide trade-offs time consuming. Our first contribution is a thorough analysis of well known scalarization schemes within the multi-objective multi-agent reinforcement learning setup. The analysed approaches intelligently explore the weight-space in order to find a wider range of system trade-offs. In our second contribution, we propose a novel adaptive weight algorithm which interacts with the underlying local multi-objective solvers and allows for a better coverage of the Pareto front. Our third contribution is the experimental validation of our approach by learning bi-objective policies in self-organising smart camera networks. We note that our algorithm (i) explores the objective space faster on many problem instances, (ii) obtained solutions that exhibit a larger hypervolume, while (iii) acquiring a greater spread in the objective space.

Documents

  • Weight selection algorithm for multi-objective multi-agent reinforcement learning

    Rights statement: © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

    Accepted author manuscript, 4 MB, PDF-document

Details

Publication date2014
Publication titleProceedings of the International Joint Conference on Neural Networks
PublisherIEEE
Pages2306-2314
Number of pages9
ISBN (Print)978-1-4799-6627-1
Original languageEnglish
Event2014 International Joint Conference on Neural Networks - Beijing, China

Conference

Conference2014 International Joint Conference on Neural Networks
Abbreviated titleIJCNN 2014
CountryChina
CityBeijing
Period6/07/1411/07/14

Bibliographic note

© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

DOI

Download statistics

No data available

Employable Graduates; Exploitable Research

Copy the text from this field...