Fully probabilistic control design in an adaptive critic framework

Randa Herzallah; Miroslav Kárný

doi:10.1016/j.neunet.2011.06.006

Fully probabilistic control design in an adaptive critic framework

Randa Herzallah, Miroslav Kárný

Research output: Contribution to journal › Article › peer-review

Abstract

Optimal stochastic controller pushes the closed-loop behavior as close as possible to the desired one. The fully probabilistic design (FPD) uses probabilistic description of the desired closed loop and minimizes Kullback-Leibler divergence of the closed-loop description to the desired one. Practical exploitation of the fully probabilistic design control theory continues to be hindered by the computational complexities involved in numerically solving the associated stochastic dynamic programming problem. In particular very hard multivariate integration and an approximate interpolation of the involved multivariate functions. This paper proposes a new fully probabilistic contro algorithm that uses the adaptive critic methods to circumvent the need for explicitly evaluating the optimal value function, thereby dramatically reducing computational requirements. This is a main contribution of this short paper.

Original language	English
Pages (from-to)	1128-1135
Number of pages	8
Journal	Neural Networks
Volume	24
Issue number	10
Early online date	22 Jun 2011
DOIs	https://doi.org/10.1016/j.neunet.2011.06.006
Publication status	Published - Dec 2011

Bibliographical note

NOTICE: this is the author’s version of a work that was accepted for publication in Neural networks. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Herzallah, R & Kárný, M, 'Fully probabilistic control design in an adaptive critic framework' Neural networks, vol. 24, no. 10 (2011) DOI http://dx.doi.org/10.1016/j.neunet.2011.06.006

Keywords

stochastic control design
fully probabilistic design
adaptive control
adaptive critic

Access to Document

10.1016/j.neunet.2011.06.006

Fully probabilistic control design in an adaptive critic framework
NOTICE: this is the author’s version of a work that was accepted for publication in Neural networks. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Herzallah, R & Kárný, M, 'Fully probabilistic control design in an adaptive critic framework' Neural networks, vol. 24, no. 10 (2011) DOI http://dx.doi.org/10.1016/j.neunet.2011.06.006
Accepted author manuscript, 229 KB

Cite this

@article{24ef67198f914e4b9c796a813bad8567,

title = "Fully probabilistic control design in an adaptive critic framework",

abstract = "Optimal stochastic controller pushes the closed-loop behavior as close as possible to the desired one. The fully probabilistic design (FPD) uses probabilistic description of the desired closed loop and minimizes Kullback-Leibler divergence of the closed-loop description to the desired one. Practical exploitation of the fully probabilistic design control theory continues to be hindered by the computational complexities involved in numerically solving the associated stochastic dynamic programming problem. In particular very hard multivariate integration and an approximate interpolation of the involved multivariate functions. This paper proposes a new fully probabilistic contro algorithm that uses the adaptive critic methods to circumvent the need for explicitly evaluating the optimal value function, thereby dramatically reducing computational requirements. This is a main contribution of this short paper.",

keywords = "stochastic control design, fully probabilistic design, adaptive control, adaptive critic",

author = "Randa Herzallah and Miroslav K{\'a}rn{\'y}",

note = "NOTICE: this is the author{\textquoteright}s version of a work that was accepted for publication in Neural networks. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Herzallah, R & K{\'a}rn{\'y}, M, 'Fully probabilistic control design in an adaptive critic framework' Neural networks, vol. 24, no. 10 (2011) DOI http://dx.doi.org/10.1016/j.neunet.2011.06.006",

year = "2011",

month = dec,

doi = "10.1016/j.neunet.2011.06.006",

language = "English",

volume = "24",

pages = "1128--1135",

journal = "Neural Networks",

issn = "0893-6080",

publisher = "Elsevier",

number = "10",

}

TY - JOUR

T1 - Fully probabilistic control design in an adaptive critic framework

AU - Herzallah, Randa

AU - Kárný, Miroslav

N1 - NOTICE: this is the author’s version of a work that was accepted for publication in Neural networks. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Herzallah, R & Kárný, M, 'Fully probabilistic control design in an adaptive critic framework' Neural networks, vol. 24, no. 10 (2011) DOI http://dx.doi.org/10.1016/j.neunet.2011.06.006

PY - 2011/12

Y1 - 2011/12

N2 - Optimal stochastic controller pushes the closed-loop behavior as close as possible to the desired one. The fully probabilistic design (FPD) uses probabilistic description of the desired closed loop and minimizes Kullback-Leibler divergence of the closed-loop description to the desired one. Practical exploitation of the fully probabilistic design control theory continues to be hindered by the computational complexities involved in numerically solving the associated stochastic dynamic programming problem. In particular very hard multivariate integration and an approximate interpolation of the involved multivariate functions. This paper proposes a new fully probabilistic contro algorithm that uses the adaptive critic methods to circumvent the need for explicitly evaluating the optimal value function, thereby dramatically reducing computational requirements. This is a main contribution of this short paper.

AB - Optimal stochastic controller pushes the closed-loop behavior as close as possible to the desired one. The fully probabilistic design (FPD) uses probabilistic description of the desired closed loop and minimizes Kullback-Leibler divergence of the closed-loop description to the desired one. Practical exploitation of the fully probabilistic design control theory continues to be hindered by the computational complexities involved in numerically solving the associated stochastic dynamic programming problem. In particular very hard multivariate integration and an approximate interpolation of the involved multivariate functions. This paper proposes a new fully probabilistic contro algorithm that uses the adaptive critic methods to circumvent the need for explicitly evaluating the optimal value function, thereby dramatically reducing computational requirements. This is a main contribution of this short paper.

KW - stochastic control design

KW - fully probabilistic design

KW - adaptive control

KW - adaptive critic

U2 - 10.1016/j.neunet.2011.06.006

DO - 10.1016/j.neunet.2011.06.006

M3 - Article

SN - 0893-6080

VL - 24

SP - 1128

EP - 1135

JO - Neural Networks

JF - Neural Networks

IS - 10

ER -

Fully probabilistic control design in an adaptive critic framework

Abstract

Bibliographical note

Keywords

Access to Document

Fingerprint

Cite this