Adaptive dynamic programming
State University of New York · Stony Brook University · +2 more institutions
Abstract
Unlike the many soft computing applications where it suffices to achieve a "good approximation most of the time," a control system must be stable all of the time. As such, if one desires to learn a control law in real-time, a fusion of soft computing techniques to learn the appropriate control law with hard computing techniques to maintain the stability constraint and guarantee convergence is required. The objective of the paper is to describe an adaptive dynamic programming algorithm (ADPA) which fuses soft computing techniques to learn the optimal cost (or return) functional for a stabilizable nonlinear system with unknown dynamics and hard computing techniques to verify the stability and convergence of the…
Citation impact
- FWCI
- 4.55
- Percentile
- 100%
- References
- 35
Authors
4Topics & keywords
- Dynamic programming
- Computer science
- Convergence (economics)
- Optimal control
- Mathematical optimization
- Bellman equation
- Nonlinear system
- Stability (learning theory)
- Peace, Justice and strong institutions