Volume 32, Issue 18 pp. 10180-10200
RESEARCH ARTICLE

Performance-guaranteed containment control for pure-feedback multi agent systems via reinforcement learning algorithm

Ao Luo

Ao Luo

School of Automation and Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, China

Search for more papers by this author
Wenbin Xiao

Corresponding Author

Wenbin Xiao

School of Automation and Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, China

School of Information and Electrical Engineering, Hunan University of Science and Technology, Xiangtan, China

Correspondence Wenbin Xiao, School of Information and Electrical Engineering, Hunan University of Science and Technology, Xiangtan 411201, China.

Email: [email protected]

Search for more papers by this author
Xiao-Meng Li

Xiao-Meng Li

School of Automation and Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, China

Search for more papers by this author
Deyin Yao

Deyin Yao

School of Automation and Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, China

Search for more papers by this author
Qi Zhou

Qi Zhou

School of Automation and Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, China

Search for more papers by this author
First published: 20 September 2022
Citations: 3

Funding information: National Natural Science Foundation of China, Grant/Award Numbers: 61973091; 62003098; The China Postdoctoral Science Foundation, Grant/Award Numbers: 2021TQ0079; 2021M700883; The Key Area Research and Development Program of Guangdong Province, Grant/Award Number: 2021B0101410005; The Local Innovative and Research Teams Project of Guangdong Special Support Program, Grant/Award Number: 2019BT02X353

Abstract

In this article, a performance-guaranteed containment control scheme based on reinforcement learning (RL) algorithm is proposed for a class of pure-feedback multi agent systems (MASs) with unmeasurable states. The unknown nonlinear functions are approximated by the neural networks (NNs) and an adaptive NN state observer is designed for the states estimation. Based on estimated states, the algebraic loop problem can be removed by introducing filtered signals, and the actor-critic architecture of RL algorithm is employed to acquire the optimal controller in the framework of backstepping. Different from many optimal strategies, this article proposes a simpler mechanism based on the uniqueness of the optimal solution to obtain the actor and critic updating laws instead of gradient descent algorithm with complicated calculation. In addition, predefined performance function and an improved error transformation technique are utilized to guarantee the containment error within a prescribed boundary. By using Lyapunov stability theory and graph theory, the stability of the closed-loop system can be demonstrated. Finally, the effectiveness of the method proposed in this article is verified by a simulation example.

CONFLICT OF INTEREST

The authors declare that there is no conflict of interest.

DATA AVAILABILITY STATEMENT

Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.