An efficient knowledge transfer solution to a novel SMDP formalization of a broker's decision problem

Rodrigue Talla Kuate; Maria Chli; Hai H. Wang

An efficient knowledge transfer solution to a novel SMDP formalization of a broker's decision problem

Rodrigue Talla Kuate, Maria Chli, Hai H. Wang

Computer Science Research Group

Research output: Chapter in Book/Published conference output › Conference publication

Abstract

This paper introduces a new technique for optimizing the trading strategy of brokers that autonomously trade in re- tail and wholesale markets. Simultaneous optimization of re- tail and wholesale strategies has been considered by existing studies as intractable. Therefore, each of these strategies is optimized separately and their interdependence is generally ignored, with resulting broker agents not aiming for a glob- ally optimal retail and wholesale strategy. In this paper, we propose a novel formalization, based on a semi-Markov deci- sion process (SMDP), which globally and simultaneously op- timizes retail and wholesale strategies. The SMDP is solved using hierarchical reinforcement learning (HRL) in multi- agent environments. To address the curse of dimensionality, which arises when applying SMDP and HRL to complex de- cision problems, we propose an ecient knowledge transfer approach. This enables the reuse of learned trading skills in order to speed up the learning in new markets, at the same time as making the broker transportable across market envi- ronments. The proposed SMDP-broker has been thoroughly evaluated in two well-established multi-agent simulation en- vironments within the Trading Agent Competition (TAC) community. Analysis of controlled experiments shows that this broker can outperform the top TAC-brokers. More- over, our broker is able to perform well in a wide range of environments by re-using knowledge acquired in previously experienced settings.

Original language	English
Title of host publication	AAMAS'15 International Conference on Autonomous Agents and Multi Agent Solutions
Publisher	ACM
Pages	1735-1736
Number of pages	2
Volume	3
ISBN (Print)	978-1-4503-3771-7
Publication status	Published - 4 May 2015
Event	14th International Conference on Autonomous Agents and Multi Agent Systems - Istanbul Congres Center, Istanbul, Turkey Duration: 4 May 2015 → 8 May 2015

Conference

Conference	14th International Conference on Autonomous Agents and Multi Agent Systems
Abbreviated title	AAMAS 2015
Country/Territory	Turkey
City	Istanbul
Period	4/05/15 → 8/05/15

Keywords

artificial intelligence
learning
knowledge transfer
reinforcement learning
MDP
SMDP
broker agent

Access to Document

Efficient knowledge transfer solution to a novel SMDP formalization of a broker's decision problemSubmitted manuscript, 170 KB

Cite this

@inproceedings{ef98a51da62344d7b91bb86966f6a7d8,

title = "An efficient knowledge transfer solution to a novel SMDP formalization of a broker's decision problem",

abstract = "This paper introduces a new technique for optimizing the trading strategy of brokers that autonomously trade in re- tail and wholesale markets. Simultaneous optimization of re- tail and wholesale strategies has been considered by existing studies as intractable. Therefore, each of these strategies is optimized separately and their interdependence is generally ignored, with resulting broker agents not aiming for a glob- ally optimal retail and wholesale strategy. In this paper, we propose a novel formalization, based on a semi-Markov deci- sion process (SMDP), which globally and simultaneously op- timizes retail and wholesale strategies. The SMDP is solved using hierarchical reinforcement learning (HRL) in multi- agent environments. To address the curse of dimensionality, which arises when applying SMDP and HRL to complex de- cision problems, we propose an ecient knowledge transfer approach. This enables the reuse of learned trading skills in order to speed up the learning in new markets, at the same time as making the broker transportable across market envi- ronments. The proposed SMDP-broker has been thoroughly evaluated in two well-established multi-agent simulation en- vironments within the Trading Agent Competition (TAC) community. Analysis of controlled experiments shows that this broker can outperform the top TAC-brokers. More- over, our broker is able to perform well in a wide range of environments by re-using knowledge acquired in previously experienced settings.",

keywords = "artificial intelligence, learning, knowledge transfer, reinforcement learning, MDP, SMDP, broker agent",

author = "{Talla Kuate}, Rodrigue and Maria Chli and Wang, {Hai H.}",

year = "2015",

month = may,

day = "4",

language = "English",

isbn = "978-1-4503-3771-7",

volume = "3",

pages = "1735--1736",

booktitle = "AAMAS'15 International Conference on Autonomous Agents and Multi Agent Solutions",

publisher = "ACM",

address = "United States",

note = "14th International Conference on Autonomous Agents and Multi Agent Systems, AAMAS 2015 ; Conference date: 04-05-2015 Through 08-05-2015",

}

Talla Kuate, R, Chli, M & Wang, HH 2015, An efficient knowledge transfer solution to a novel SMDP formalization of a broker's decision problem. in AAMAS'15 International Conference on Autonomous Agents and Multi Agent Solutions. vol. 3, paper 278, ACM, pp. 1735-1736, 14th International Conference on Autonomous Agents and Multi Agent Systems, Istanbul, Turkey, 4/05/15.

TY - GEN

T1 - An efficient knowledge transfer solution to a novel SMDP formalization of a broker's decision problem

AU - Talla Kuate, Rodrigue

AU - Chli, Maria

AU - Wang, Hai H.

PY - 2015/5/4

Y1 - 2015/5/4

N2 - This paper introduces a new technique for optimizing the trading strategy of brokers that autonomously trade in re- tail and wholesale markets. Simultaneous optimization of re- tail and wholesale strategies has been considered by existing studies as intractable. Therefore, each of these strategies is optimized separately and their interdependence is generally ignored, with resulting broker agents not aiming for a glob- ally optimal retail and wholesale strategy. In this paper, we propose a novel formalization, based on a semi-Markov deci- sion process (SMDP), which globally and simultaneously op- timizes retail and wholesale strategies. The SMDP is solved using hierarchical reinforcement learning (HRL) in multi- agent environments. To address the curse of dimensionality, which arises when applying SMDP and HRL to complex de- cision problems, we propose an ecient knowledge transfer approach. This enables the reuse of learned trading skills in order to speed up the learning in new markets, at the same time as making the broker transportable across market envi- ronments. The proposed SMDP-broker has been thoroughly evaluated in two well-established multi-agent simulation en- vironments within the Trading Agent Competition (TAC) community. Analysis of controlled experiments shows that this broker can outperform the top TAC-brokers. More- over, our broker is able to perform well in a wide range of environments by re-using knowledge acquired in previously experienced settings.

AB - This paper introduces a new technique for optimizing the trading strategy of brokers that autonomously trade in re- tail and wholesale markets. Simultaneous optimization of re- tail and wholesale strategies has been considered by existing studies as intractable. Therefore, each of these strategies is optimized separately and their interdependence is generally ignored, with resulting broker agents not aiming for a glob- ally optimal retail and wholesale strategy. In this paper, we propose a novel formalization, based on a semi-Markov deci- sion process (SMDP), which globally and simultaneously op- timizes retail and wholesale strategies. The SMDP is solved using hierarchical reinforcement learning (HRL) in multi- agent environments. To address the curse of dimensionality, which arises when applying SMDP and HRL to complex de- cision problems, we propose an ecient knowledge transfer approach. This enables the reuse of learned trading skills in order to speed up the learning in new markets, at the same time as making the broker transportable across market envi- ronments. The proposed SMDP-broker has been thoroughly evaluated in two well-established multi-agent simulation en- vironments within the Trading Agent Competition (TAC) community. Analysis of controlled experiments shows that this broker can outperform the top TAC-brokers. More- over, our broker is able to perform well in a wide range of environments by re-using knowledge acquired in previously experienced settings.

KW - artificial intelligence

KW - learning

KW - knowledge transfer

KW - reinforcement learning

KW - MDP

KW - SMDP

KW - broker agent

UR - http://www.scopus.com/inward/record.url?scp=84944710632&partnerID=8YFLogxK

UR - http://www.aamas2015.com/en/AAMAS_2015_USB/aamas/p1735.pdf

M3 - Conference publication

AN - SCOPUS:84944710632

SN - 978-1-4503-3771-7

VL - 3

SP - 1735

EP - 1736

BT - AAMAS'15 International Conference on Autonomous Agents and Multi Agent Solutions

PB - ACM

T2 - 14th International Conference on Autonomous Agents and Multi Agent Systems

Y2 - 4 May 2015 through 8 May 2015

ER -

An efficient knowledge transfer solution to a novel SMDP formalization of a broker's decision problem

Abstract

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this