Offline and online time in Sequential Decision-Making Problems

Aman Soni, Peter R. Lewis, Anikó Ekárt

Research output: Chapter in Book/Published conference outputConference publication

Abstract

A connection has recently been drawn between Dynamic Optimization Problems (DOPs) and Reinforcement Learning Problems (RLPs) where they can be seen as subsets of a broader class of Sequential Decision-Making Problems (SDMPs). SDMPs require new decisions on an ongoing basis. Typically the underlying environment changes between decisions. The SDMP view is useful as it allows the unified space to be explored. Solutions can be designed for characteristics of problem instances using algorithms from either community. Little has been done on comparing algorithm performance across these communities, particularly under real-world resource constraints.

Original languageEnglish
Title of host publication2016 IEEE Symposium Series on Computational Intelligence, SSCI 2016
PublisherIEEE
Number of pages8
ISBN (Electronic)978-1-5090-4240-1
DOIs
Publication statusPublished - 9 Feb 2017
Event2016 IEEE Symposium Series on Computational Intelligence, SSCI 2016 - Athens, Greece
Duration: 6 Dec 20169 Dec 2016

Conference

Conference2016 IEEE Symposium Series on Computational Intelligence, SSCI 2016
Country/TerritoryGreece
CityAthens
Period6/12/169/12/16

Bibliographical note

-

Fingerprint

Dive into the research topics of 'Offline and online time in Sequential Decision-Making Problems'. Together they form a unique fingerprint.

Cite this