Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems

Shandong Institute of Automation · Chinese Academy of Sciences · +1 more institution

PubMed
Indexed incrossrefpubmed

Abstract

This paper is concerned with a new discrete-time policy iteration adaptive dynamic programming (ADP) method for solving the infinite horizon optimal control problem of nonlinear systems. The idea is to use an iterative ADP technique to obtain the iterative control law, which optimizes the iterative performance index function. The main contribution of this paper is to analyze the convergence and stability properties of policy iteration method for discrete-time nonlinear systems for the first time. It shows that the iterative performance index function is nonincreasingly convergent to the optimal solution of the Hamilton-Jacobi-Bellman equation. It is also proven that any of the iterative control laws can…

Citation impact

737
total citations
FWCI
45.06
Percentile
100%
References
53
Citations per year

Authors

2

Topics & keywords

Keywords
  • Convergence (economics)
  • Iterative method
  • Optimal control
  • Dynamic programming
  • Nonlinear system
  • Mathematical optimization
  • Function (biology)
  • Computer science
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.

Funding