Adaptive dynamic programming

Murray, J.; Cox, C.; Lendaris, G.G.; Saeks, R.

doi:10.1109/tsmcc.2002.801727

articleIEEE Transactions on Systems Man and Cybernetics Part C (Applications and Reviews)May 1, 2002Closed access

Adaptive dynamic programming

JMJ. Murray CCC. Cox GLG.G. Lendaris RSR. Saeks

State University of New York · Stony Brook University · +2 more institutions

Indexed incrossref

Abstract

Unlike the many soft computing applications where it suffices to achieve a "good approximation most of the time," a control system must be stable all of the time. As such, if one desires to learn a control law in real-time, a fusion of soft computing techniques to learn the appropriate control law with hard computing techniques to maintain the stability constraint and guarantee convergence is required. The objective of the paper is to describe an adaptive dynamic programming algorithm (ADPA) which fuses soft computing techniques to learn the optimal cost (or return) functional for a stabilizable nonlinear system with unknown dynamics and hard computing techniques to verify the stability and convergence of the…

Citation impact

674

total citations

FWCI: 4.55
Percentile: 100%
References: 35

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Dynamic programming
Computer science
Convergence (economics)
Optimal control
Mathematical optimization
Bellman equation
Nonlinear system
Stability (learning theory)

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.