# American Institute of Mathematical Sciences

April  2021, 14(4): 1495-1518. doi: 10.3934/dcdss.2020377

## Optimal synchronization control of multiple euler-lagrange systems via event-triggered reinforcement learning

 The Key Laboratory of Advanced Control and Optimization for Chemical Processes, Ministry of Education, East China University of Science and Technology, Shanghai 200237, China

* Corresponding author: Yang Tang

Received  January 2020 Revised  January 2020 Published  May 2020

In this paper, an event-triggered reinforcement learning-based met-hod is developed for model-based optimal synchronization control of multiple Euler-Lagrange systems (MELSs) under a directed graph. The strategy of event-triggered optimal control is deduced through the establishment of Hamilton-Jacobi-Bellman (HJB) equation and the triggering condition is then proposed. Event-triggered policy iteration (PI) algorithm is then borrowed from reinforcement learning algorithms to find the optimal solution. One neural network is used to represent the value function to find the analytical solution of the event-triggered HJB equation, weights of which are updated aperiodically. It is proved that both the synchronization error and the weight estimation error are uniformly ultimately bounded (UUB). The Zeno behavior is also excluded in this research. Finally, an example of multiple 2-DOF prototype manipulators is shown to validate the effectiveness of our method.

Citation: Yuan Xu, Xin Jin, Saiwei Wang, Yang Tang. Optimal synchronization control of multiple euler-lagrange systems via event-triggered reinforcement learning. Discrete & Continuous Dynamical Systems - S, 2021, 14 (4) : 1495-1518. doi: 10.3934/dcdss.2020377
##### References:

show all references

##### References:
Communication graph of MELSs
Triggering instants for all agents
Position trajectories of the first and second component of each EL agent
Velocity trajectories of the first and second component of each EL agent
Synchronization errors of the first and second component of each EL agent
Control policies of the first and second component of each EL agent under event-triggered mechanism
Norm of estimated weights of the critic neural network
Validation of Assumption 6 for agent 1
Notations, values and units of the according physical parameters
 Notations Values Units $m_a$ 1.2 $kg$ $m_b$ 1 $kg$ $l_{ca}$ 0.75 $m$ $l_{cb}$ 0.75 $m$ $l_a$ 0.26 $m$ $l_b$ 0.5 $m$ $I_{ca}$ 0.125 $kg\cdot m^2$ $I_{cb}$ 0.188 $kg\cdot m^2$ $g$ 9.81 $m/s^2$
 Notations Values Units $m_a$ 1.2 $kg$ $m_b$ 1 $kg$ $l_{ca}$ 0.75 $m$ $l_{cb}$ 0.75 $m$ $l_a$ 0.26 $m$ $l_b$ 0.5 $m$ $I_{ca}$ 0.125 $kg\cdot m^2$ $I_{cb}$ 0.188 $kg\cdot m^2$ $g$ 9.81 $m/s^2$
 [1] Masashi Wakaiki, Hideki Sano. Stability analysis of infinite-dimensional event-triggered and self-triggered control systems with Lipschitz perturbations. Mathematical Control & Related Fields, 2021  doi: 10.3934/mcrf.2021021 [2] Quan Hai, Shutang Liu. Mean-square delay-distribution-dependent exponential synchronization of chaotic neural networks with mixed random time-varying delays and restricted disturbances. Discrete & Continuous Dynamical Systems - B, 2021, 26 (6) : 3097-3118. doi: 10.3934/dcdsb.2020221 [3] Tobias Geiger, Daniel Wachsmuth, Gerd Wachsmuth. Optimal control of ODEs with state suprema. Mathematical Control & Related Fields, 2021  doi: 10.3934/mcrf.2021012 [4] Diana Keller. Optimal control of a linear stochastic Schrödinger equation. Conference Publications, 2013, 2013 (special) : 437-446. doi: 10.3934/proc.2013.2013.437 [5] Lorenzo Freddi. Optimal control of the transmission rate in compartmental epidemics. Mathematical Control & Related Fields, 2021  doi: 10.3934/mcrf.2021007 [6] Marzia Bisi, Maria Groppi, Giorgio Martalò, Romina Travaglini. Optimal control of leachate recirculation for anaerobic processes in landfills. Discrete & Continuous Dynamical Systems - B, 2021, 26 (6) : 2957-2976. doi: 10.3934/dcdsb.2020215 [7] Alberto Bressan, Ke Han, Franco Rampazzo. On the control of non holonomic systems by active constraints. Discrete & Continuous Dynamical Systems, 2013, 33 (8) : 3329-3353. doi: 10.3934/dcds.2013.33.3329 [8] Paula A. González-Parra, Sunmi Lee, Leticia Velázquez, Carlos Castillo-Chavez. A note on the use of optimal control on a discrete time model of influenza dynamics. Mathematical Biosciences & Engineering, 2011, 8 (1) : 183-197. doi: 10.3934/mbe.2011.8.183 [9] Luke Finlay, Vladimir Gaitsgory, Ivan Lebedev. Linear programming solutions of periodic optimization problems: approximation of the optimal control. Journal of Industrial & Management Optimization, 2007, 3 (2) : 399-413. doi: 10.3934/jimo.2007.3.399 [10] Xiaohong Li, Mingxin Sun, Zhaohua Gong, Enmin Feng. Multistage optimal control for microbial fed-batch fermentation process. Journal of Industrial & Management Optimization, 2021  doi: 10.3934/jimo.2021040 [11] John T. Betts, Stephen Campbell, Claire Digirolamo. Examination of solving optimal control problems with delays using GPOPS-Ⅱ. Numerical Algebra, Control & Optimization, 2021, 11 (2) : 283-305. doi: 10.3934/naco.2020026 [12] Livia Betz, Irwin Yousept. Optimal control of elliptic variational inequalities with bounded and unbounded operators. Mathematical Control & Related Fields, 2021  doi: 10.3934/mcrf.2021009 [13] Christian Meyer, Stephan Walther. Optimal control of perfect plasticity part I: Stress tracking. Mathematical Control & Related Fields, 2021  doi: 10.3934/mcrf.2021022 [14] Shi'an Wang, N. U. Ahmed. Optimal control and stabilization of building maintenance units based on minimum principle. Journal of Industrial & Management Optimization, 2021, 17 (4) : 1713-1727. doi: 10.3934/jimo.2020041 [15] Changjun Yu, Lei Yuan, Shuxuan Su. A new gradient computational formula for optimal control problems with time-delay. Journal of Industrial & Management Optimization, 2021  doi: 10.3934/jimo.2021076 [16] Jaouad Danane. Optimal control of viral infection model with saturated infection rate. Numerical Algebra, Control & Optimization, 2021, 11 (3) : 363-375. doi: 10.3934/naco.2020031 [17] Xiaochen Mao, Weijie Ding, Xiangyu Zhou, Song Wang, Xingyong Li. Complexity in time-delay networks of multiple interacting neural groups. Electronic Research Archive, , () : -. doi: 10.3934/era.2021022 [18] Jamal Mrazgua, El Houssaine Tissir, Mohamed Ouahi. Frequency domain $H_{\infty}$ control design for active suspension systems. Discrete & Continuous Dynamical Systems - S, 2021  doi: 10.3934/dcdss.2021036 [19] Vladimir Gaitsgory, Ilya Shvartsman. Linear programming estimates for Cesàro and Abel limits of optimal values in optimal control problems. Discrete & Continuous Dynamical Systems - B, 2021  doi: 10.3934/dcdsb.2021102 [20] Shanjian Tang, Fu Zhang. Path-dependent optimal stochastic control and viscosity solution of associated Bellman equations. Discrete & Continuous Dynamical Systems, 2015, 35 (11) : 5521-5553. doi: 10.3934/dcds.2015.35.5521

2019 Impact Factor: 1.233

## Metrics

• HTML views (324)
• Cited by (0)

• on AIMS