Fully probabilistic control design in an adaptive critic framework

Randa Herzallah, Miroslav Kárný

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Optimal stochastic controller pushes the closed-loop behavior as close as possible to the desired one. The fully probabilistic design (FPD) uses probabilistic description of the desired closed loop and minimizes Kullback-Leibler divergence of the closed-loop description to the desired one. Practical exploitation of the fully probabilistic design control theory continues to be hindered by the computational complexities involved in numerically solving the associated stochastic dynamic programming problem. In particular very hard multivariate integration and an approximate interpolation of the involved multivariate functions. This paper proposes a new fully probabilistic contro algorithm that uses the adaptive critic methods to circumvent the need for explicitly evaluating the optimal value function, thereby dramatically reducing computational requirements. This is a main contribution of this short paper.
    Original languageEnglish
    Pages (from-to)1128-1135
    Number of pages8
    JournalNeural Networks
    Volume24
    Issue number10
    Early online date22 Jun 2011
    DOIs
    Publication statusPublished - Dec 2011

    Bibliographical note

    NOTICE: this is the author’s version of a work that was accepted for publication in Neural networks. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Herzallah, R & Kárný, M, 'Fully probabilistic control design in an adaptive critic framework' Neural networks, vol. 24, no. 10 (2011) DOI http://dx.doi.org/10.1016/j.neunet.2011.06.006

    Keywords

    • stochastic control design
    • fully probabilistic design
    • adaptive control
    • adaptive critic

    Fingerprint

    Dive into the research topics of 'Fully probabilistic control design in an adaptive critic framework'. Together they form a unique fingerprint.

    Cite this