Volume 33, Issue 6 pp. 3471-3490
RESEARCH ARTICLE

Optimal trajectory tracking control for a class of nonlinear nonaffine systems via generalized N-step value gradient learning

Mingming Zhao

Mingming Zhao

Faculty of Information Technology, Beijing University of Technology, Beijing, China

Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China

Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, China

Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, China

Search for more papers by this author
Ding Wang

Corresponding Author

Ding Wang

Faculty of Information Technology, Beijing University of Technology, Beijing, China

Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China

Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, China

Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, China

Correspondence Ding Wang, Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China.

Email: [email protected]

Search for more papers by this author
Junfei Qiao

Junfei Qiao

Faculty of Information Technology, Beijing University of Technology, Beijing, China

Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China

Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, China

Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, China

Search for more papers by this author
Lingzhi Hu

Lingzhi Hu

Faculty of Information Technology, Beijing University of Technology, Beijing, China

Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing, China

Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, China

Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing, China

Search for more papers by this author
First published: 27 December 2022
Citations: 3

Funding information: National Key Research and Development Program of China, Grant/Award Number: 2021ZD0112302; Beijing Natural Science Foundation, Grant/Award Number: JQ19013; National Natural Science Foundation of China, Grant/Award Numbers: 62222301; 61890930-5; 62021003

Summary

In this paper, the tracking control problem of unknown nonlinear systems is solved by using the generalized N-step value gradient learning algorithm with parameter λ $$ \lambda $$ [GNSVGL( λ $$ \lambda $$ )]. The GNSVGL( λ $$ \lambda $$ ) algorithm can provide optimal tracking decisions faster than traditional ones. Initialized by different positive semi-definite functions, the monotonicity and convergence properties of the proposed algorithm are proven. Under some conditions, the stability analysis of the value-iteration-based algorithm is provided. The one-return and λ $$ \lambda $$ -return critic neural networks are constructed to approximate the gradient of the one-return and λ $$ \lambda $$ -return cost functions. The action neural network is employed to approximate the control law of the error system. It is emphasized that one-return and λ $$ \lambda $$ -return critic networks are combined to train the action neural network. Finally, via conducting simulation studies and comparisons, the excellent tracking performance of the proposed algorithm is confirmed.

CONFLICT OF INTEREST

We confirm that there are no conflict of interests for this article.

DATA AVAILABILITY STATEMENT

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.