Model-free Optimal Control for Underactuated Quadrotor Aircraft via Reinforcement Learning

Quynh Nga Duong, Ngoc Trung Dang

Abstract


The control of Unmanned Aerial Vehicles (UAVs), especially quadrotor aircraft, has many practical applications such as transporting, mapping, rescue, and agricultural applications. This paper investigates solving the optimal tracking control problem for a quadrotor system. First, an underactuated quadrotor system is considered a highly nonlinear system with six degrees of freedom and four inputs. Second, a hierarchical control structure consisting of position and attitude controller is adopted to address the underactuated problem, the position controller to achieve the desired tracking and generates the references for the attitude controller, and the attitude controller to achieve the reference attitude tracking. Third, to achieve optimal trajectory tracking, two Data- based Reinforcement Learning (RL) algorithms are applied to both position and attitude controllers to find the optimal control input by using the input- output quadrotor system data. Compared with the traditional optimal algorithms which require directly solving the Algebraic Ricatti Equation (ARE) or the Hamilton-Jacobi-Bellman (HJB) equation. It is impossible or difficult to implement due to the high nonlinear dynamic nature of the quadrotor system. By using RL in the proposed method, optimal policies can be learned without the knowledge of quadrotor dynamic information. Applying the learning control policies to the quadrotor system, the vehicle achieves optimal trajectory tracking. Finally, a simulation result is conducted to verify the optimal trajectory tracking for quadrotor with the proposed controller.

Keywords


Quadrotor System; Reinforcement Learning; Optimal Tracking Control; Data-based Control.

Full Text:

PDF

References


G. Hoffmann, S. Waslander, and C. Tomlin, "Quadrotor helicopter trajectory tracking control," In AIAA guidance, navigation and control conference and exhibit, p. 7410, 2008, doi: 10.2514/6.2008-7410.

A. Das, F. Lewis, and K. Subbarao, "Backstepping approach for controlling a quadrotor using lagrange form dynamics," Journal of Intelligent and Robotic Systems, vol. 56, pp. 127-151, 2009, doi: 10.1007/s10846-009-9331-0.

Z. Zuo, "Trajectory tracking control design with command-filtered compensation for a quadrotor," IET control theory & applications, vol. 4, no. 11, pp. 2343-2355, 2010, doi: 10.1049/iet-cta.2009.0336.

M. Huang, B. Xian, C. Diao, K. Yang, and Y. Feng, "Adaptive tracking control of underactuated quadrotor unmanned aerial vehicles via backstepping," Proceedings of the 2010 American Control Conference, pp. 2076-2081, 2010, doi: 10.1109/ACC.2010.5r531424.

Y. C. Liu and T. W. Ou, "Non‐linear adaptive tracking control for quadrotor aerial robots under uncertain dynamics," IET Control Theory & Applications, vol. 15, no. 8, pp. 1126-1139, 2021, doi: 10.1049/cth2.12112.

O. Mofid and S. Mobayen, "Adaptive sliding mode control for finite-time stability of quad-rotor UAVs with parametric uncertainties," ISA transactions, vol. 72, pp. 1-14, 2018, doi: 10.1016/j.isatra.2017.11.010.

M. Labbadi and M. Cherkaoui, "Robust adaptive nonsingular fast terminal sliding-mode tracking control for an uncertain quadrotor UAV subjected to disturbances," ISA transactions, vol. 99, pp. 290-304, 2020, doi: 10.1016/j.isatra.2019.10.012.

M. Labbadi and M. Cherkaoui, "Robust adaptive backstepping fast terminal sliding mode controller for uncertain quadrotor UAV," Aerospace Science and Technology, vol. 93, p. 105306, 2019, doi: 10.1016/j.ast.2019.105306.

T. Huang et al., "Robust tracking control of a quadrotor UAV based on adaptive sliding mode controller," Complexity, vol. 2019, no. 1, p. 7931632, 2019, doi: 10.1155/2019/7931632.

F. L. Lewis, D. Vrabie, and V. L. Syrmos. Optimal control. John Wiley & Sons, 2012, doi: 10.1002/9781118122631.

B. Kiumarsi et al., "Optimal and autonomous control using reinforcement learning: A survey," IEEE transactions on neural networks and learning systems, vol. 29, no. 6, pp. 2042-2062, 2017.

N. O. Lambert, D. S. Drew, J. Yaconelli, S. Levine, R. Calandra, and K. S. J. Pister, "Low-Level Control of a Quadrotor With Deep Model-Based Reinforcement Learning," in IEEE Robotics and Automation Letters, vol. 4, no. 4, pp. 4224-4230, 2019, doi: 10.1109/LRA.2019.2930489.

J. Hwangbo, I. Sa, R. Siegwart, and M. Hutter, "Control of a Quadrotor With Reinforcement Learning," in IEEE Robotics and Automation Letters, vol. 2, no. 4, pp. 2096-2103, 2017, doi: 10.1109/LRA.2017.2720851.

G. Wen, W. Hao, W. Feng and K. Gao, "Optimized Backstepping Tracking Control Using R,einforcement Learning for Quadrotor Unmanned Aerial Vehicle System," in IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 52, no. 8, pp. 5004-5015, Aug. 2022, doi: 10.1109/TSMC.2021.3112688.

S. Li, P. Durdevic, and Z. Yang, “Optimal tracking control based on integral reinforcement learning for an underactuated drone,” IFAC-PapersOnLine, vol. 52, no. 8, pp. 55-60. 2019, doi: 10.1016/j.ifacol.2019.08.048.

H. Zhang, F. L. Lewis, and A. Das, "Optimal Design for Synchronization of Cooperative Systems: State Feedback, Observer and Output Feedback," in IEEE Transactions on Automatic Control, vol. 56, no. 8, pp. 1948-1952, 2011, doi: 10.1109/TAC.2011.2139510.

Nusawardhana, S. H. Zak, and W. A. Crossley, “Nonlinear synergetic optimal controllers,” Journal of guidance, control, and dynamics, vol. 30, no. 4, pp. 1134-1147, 2007, doi: 10.2514/1.27829.

H. Liu, D. Li, Z. Zuo, and Y. Zhong, "Robust Three-Loop Trajectory Tracking Control for Quadrotors With Multiple Uncertainties," in IEEE Transactions on Industrial Electronics, vol. 63, no. 4, pp. 2263-2274, 2016, doi: 10.1109/TIE.2016.2514339.

H. Liu, Y. Wang, and J. Xi, "Completely distributed formation control for networked quadrotors under switching communication topologies," Systems & Control Letters, vol. 147, p. 104841, 2021, doi: 10.1016/j.sysconle.2020.104841.

F. A. Yaghmaie and D. J. Braun, "Reinforcement learning for a class of continuous-time input constrained optimal control problems," Automatica, vol. 99, pp. 221-227, 2019, doi: 10.1016/j.automatica.2018.10.038.

Y. Jiang and Z. P. Jiang "Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics," Automatica, vol. 48, no. 10, pp. 2699-2704, 2012, doi: 10.1016/j.automatica.2012.06.096.

H. -N. Wu and B. Luo, "Neural Network Based Online Simultaneous Policy Update Algorithm for Solving the HJI Equation in Nonlinear H∞ Control," in IEEE Transactions on Neural Networks and Learning Systems, vol. 23, no. 12, pp. 1884-1895, 2012, doi: 10.1109/TNNLS.2012.2217349.

H. Modares and F. L. Lewis, "Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning," Automatica vol. 50, no. 7, pp. 1780-1792, 2014, doi: 10.1016/j.automatica.2014.05.011.

H. Du, W. Zhu, G. Wen, Z. Duan, and J. Lü, "Distributed Formation Control of Multiple Quadrotor Aircraft Based on Nonsmooth Consensus Algorithms," in IEEE Transactions on Cybernetics, vol. 49, no. 1, pp. 342-353, 2019, doi: 10.1109/TCYB.2017.2777463.

H. S. Nguyen, D. T. Le, V. T. Dang, D. H. Nguyen, A. T. Le, and T. L. Nguyen, "Advanced Motion Control of a Quadrotor Unmanned Aerial Vehicle based on Extended State Observer," 2023 International Conference on System Science and Engineering (ICSSE), pp. 381-386, 2023, doi: 10.1109/ICSSE58758.2023.10227212.

X. Shao, G. Sun, W. Yao, J. Liu, and L. Wu, "Adaptive Sliding Mode Control for Quadrotor UAVs With Input Saturation," in IEEE/ASME Transactions on Mechatronics, vol. 27, no. 3, pp. 1498-1509, 2022, doi: 10.1109/TMECH.2021.3094575.

S. Lian et al., "Adaptive Attitude Control of a Quadrotor Using Fast Nonsingular Terminal Sliding Mode," in IEEE Transactions on Industrial Electronics, vol. 69, no. 2, pp. 1597-1607, 2022, doi: 10.1109/TIE.2021.3057015.

Z. Liu, X. Liu, J. Chen, and C. Fang, "Altitude Control for Variable Load Quadrotor via Learning Rate Based Robust Sliding Mode Controller," in IEEE Access, vol. 7, pp. 9736-9744, 2019, doi: 10.1109/ACCESS.2018.2890450.

L. Yu, G. He, X. Wang, and L. Shen, "A Novel Fixed-Time Sliding Mode Control of Quadrotor With Experiments and Comparisons," in IEEE Control Systems Letters, vol. 6, pp. 770-775, 2022, doi: 10.1109/LCSYS.2021.3086389.

J. Xiong, J. Pan, G. Chen, X. Zhang, and F. Ding, "Sliding Mode Dual-Channel Disturbance Rejection Attitude Control for a Quadrotor," in IEEE Transactions on Industrial Electronics, vol. 69, no. 10, pp. 10489-10499, 2022, doi: 10.1109/TIE.2021.3137600.

B. Li, W. Gong, Y. Yang, B. Xiao, and D. Ran, "Appointed Fixed Time Observer-Based Sliding Mode Control for a Quadrotor UAV Under External Disturbances," in IEEE Transactions on Aerospace and Electronic Systems, vol. 58, no. 1, pp. 290-303, 2022, doi: 10.1109/TAES.2021.3101562.

A. Eltayeb, M. F. Rahmat, M. A. M. Basri, M. A. M. Eltoum, and M. S. Mahmoud, "Integral Adaptive Sliding Mode Control for Quadcopter UAV Under Variable Payload and Disturbance," in IEEE Access, vol. 10, pp. 94754-94764, 2022, doi: 10.1109/ACCESS.2022.3203058.

N. P. Nguyen, H. Oh, and J. Moon, "Continuous Nonsingular Terminal Sliding-Mode Control With Integral-Type Sliding Surface for Disturbed Systems: Application to Attitude Control for Quadrotor UAVs Under External Disturbances," in IEEE Transactions on Aerospace and Electronic Systems, vol. 58, no. 6, pp. 5635-5660, 2022, doi: 10.1109/TAES.2022.3177580.

O. Mofid and S. Mobayen, "Adaptive Finite-Time Backstepping Global Sliding Mode Tracker of Quad-Rotor UAVs Under Model Uncertainty, Wind Perturbation, and Input Saturation," in IEEE Transactions on Aerospace and Electronic Systems, vol. 58, no. 1, pp. 140-151, 2022, doi: 10.1109/TAES.2021.3098168.

B. Liu, Y. Wang, O. Mofid, S. Mobayen, and M. H. Khooban, "Barrier Function-Based Backstepping Fractional-Order Sliding Mode Control for Quad-Rotor Unmanned Aerial Vehicle Under External Disturbances," in IEEE Transactions on Aerospace and Electronic Systems, vol. 60, no. 1, pp. 716-728, 2024, doi: 10.1109/TAES.2023.3328801.

W. Liu, M. Chen, and P. Shi, "Fixed-Time Disturbance Observer-Based Control for Quadcopter Suspension Transportation System," in IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 69, no. 11, pp. 4632-4642, 2022, doi: 10.1109/TCSI.2022.3193878.

V. K. Tripathi, A. K. Kamath, L. Behera, N. K. Verma, and S. Nahavandi, "An Adaptive Fast Terminal Sliding-Mode Controller With Power Rate Proportional Reaching Law for Quadrotor Position and Altitude Tracking," in IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 52, no. 6, pp. 3612-3625, 2022, doi: 10.1109/TSMC.2021.3072099.

Q. Han, Z. Liu, H. Su, and X. Liu, "Filter-Based Disturbance Observer and Adaptive Control for Euler–Lagrange Systems With Application to a Quadrotor UAV," in IEEE Transactions on Industrial Electronics, vol. 70, no. 8, pp. 8437-8445, 2023, doi: 10.1109/TIE.2022.3224167.

H. Maqsood and Y. Qu. "Nonlinear disturbance observer based sliding mode control of quadrotor helicopter," Journal of Electrical Engineering & Technology, vol. 15, pp. 1453-1461, 2020, doi: 10.1007/s42835-020-00421-w.

M. Moussa and M. Cherkaoui, "Robust integral terminal sliding mode control for quadrotor UAV with external disturbances," International Journal of Aerospace Engineering, vol. 2019, no.1, p. 2016416, 2019, doi: 10.1155/2019/2016416.

H. Razmi and S. Afshinfar, "Neural network-based adaptive sliding mode control design for position and attitude control of a quadrotor UAV," Aerospace Science and technology, vol. 91, pp. 12-27, 2019, doi: 10.1016/j.ast.2019.04.055.

X. Lin, Y. Wang, and Y. Liu, "Neural‐network‐based robust terminal sliding‐mode control of quadrotor," Asian Journal of Control, vol. 24, no. 1, pp. 427-438, 2022, doi: 10.1002/asjc.2478.

R. Falcón, H. Ríos, and A. Dzul, "Comparative analysis of continuous sliding-modes control strategies for quad-rotor robust tracking," Control Engineering Practice, vol. 90, pp. 241-256, 2019, doi: 10.1016/j.conengprac.2019.06.013.

M. Vahdanipour and M. Khodabandeh, "Adaptive fractional order sliding mode control for a quadrotor with a varying load," Aerospace Science and Technology, vol. 86, pp. 737-747, 2019, doi: 10.1016/j.ast.2019.105306.

X. Wang et al., "Quadrotor fault tolerant incremental sliding mode control driven by sliding mode disturbance observers," Aerospace Science and Technology, vol. 87, pp. 417-430, 2019, doi: 10.1016/j.ast.2019.03.001.

Z. Hou, Zhiwei, P. Lu, and Z. Tu, "Nonsingular terminal sliding mode control for a quadrotor UAV with a total rotor failure," Aerospace Science and Technology, vol. 98, p. 105716, 2020, doi: 10.1016/j.ast.2020.105716.

G. Guoqiang et al., "Output Feedback Adaptive Dynamic Surface Sliding‐Mode Control for Quadrotor UAVs with Tracking Error Constraints," Complexity, vol. 2020, no. 1, p. 8537198, 2020, doi: 10.1155/2020/8537198.

L. Chen et al., "Robust trajectory tracking control for a quadrotor using recursive sliding mode control and nonlinear extended state observer," Aerospace Science and Technology, vol. 128, p. 107749, 2022, doi: 10.1016/j.ast.2022.107749.

A. Noordin et al., "Adaptive PID controller using sliding mode control approaches for quadrotor UAV attitude and position stabilization," Arabian Journal for Science and Engineering, vol. 46, pp. 963-981, 2021, doi: 10.1007/s13369-020-04742-w.

X. Ai and J. Yu. "Fixed-time trajectory tracking for a quadrotor with external disturbances: A flatness-based sliding mode control approach," Aerospace Science and Technology, vol. 89, pp. 58-76, 2019, doi: 10.1016/j.ast.2019.03.059.

H. Hassani, A. Mansouri, and A. Ahaitouf, "Robust autonomous flight for quadrotor UAV based on adaptive nonsingular fast terminal sliding mode control," International Journal of Dynamics and Control, vol. 9, pp. 619-635, 2021, doi: 10.1007/s40435-020-00666-3.

S. Ullah et al., "Robust integral sliding mode control design for stability enhancement of under-actuated quadcopter," International Journal of Control, Automation and Systems, vol. 18, pp. 1671-1678, 2020, doi: 10.1007/s12555-019-0302-3.

M. Labbadi and M. Cherkaoui, "Novel robust super twisting integral sliding mode controller for a quadrotor under external disturbances," International Journal of Dynamics and Control, vol. 8, no. 3, pp. 805-815, 2020, doi: 10.1007/s40435-019-00599-6.

H. Ghadiri, M. Emami, and H. Khodadadi, "Adaptive super-twisting non-singular terminal sliding mode control for tracking of quadrotor with bounded disturbances," Aerospace Science and Technology, vol. 112, p. 106616, 2021, doi: 10.1016/j.ast.2021.106616.

S. C. Yogi, V. K. Tripathi, and L. Behera, "Adaptive Integral Sliding Mode Control Using Fully Connected Recurrent Neural Network for Position and Attitude Control of Quadrotor," in IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 12, pp. 5595-5609, 2021, doi: 10.1109/TNNLS.2021.3071020.

J. Pan et al., "Attitude control of quadrotor UAVs based on adaptive sliding mode," International Journal of Control, Automation and Systems, vol. 21, no. 8, pp. 2698-2707, 2023, doi: 10.1007/s12555-022-0189-2.

L. -X. Xu, H. -J. Ma, D. Guo, A. -H. Xie, and D. -L. Song, "Backstepping Sliding-Mode and Cascade Active Disturbance Rejection Control for a Quadrotor UAV," in IEEE/ASME Transactions on Mechatronics, vol. 25, no. 6, pp. 2743-2753, 2020, doi: 10.1109/TMECH.2020.2990582.

M. Pouzesh and S. Mobayen, "Event-triggered fractional-order sliding mode control technique for stabilization of disturbed quadrotor unmanned aerial vehicles," Aerospace Science and Technology, vol. 121, p. 107337, 2022, doi: 10.1016/j.ast.2022.107337.

Z. Zhu and S. Cao. "Back‐stepping sliding mode control method for quadrotor UAV with actuator failure," The Journal of Engineering, vol. 2019, no. 22, pp. 8374-8377, 2019, doi: 10.1049/joe.2019.1084.

X. Wu, B. Xiao, and Y. Qu, "Modeling and sliding mode-based attitude tracking control of a quadrotor UAV with time-varying mass," ISA transactions, vol. 124, pp. 436-443, 2022, doi: 10.1016/j.isatra.2019.08.017.

O. Mofid et al., "Desired tracking of delayed quadrotor UAV under model uncertainty and wind disturbance using adaptive super-twisting terminal sliding mode control," ISA transactions, vol. 123, pp. 455-471, 2022, doi: 10.1016/j.isatra.2021.06.002.

K. Alattas et al., "Barrier function adaptive nonsingular terminal sliding mode control approach for quad-rotor unmanned aerial vehicles," Sensors vol. 22, no. 3, p. 909, 2022, doi: 10.3390/s22030909.

W. Gong et al., "Fixed-time integral-type sliding mode control for the quadrotor UAV attitude stabilization under actuator failures," Aerospace Science and Technology vol. 95, p. 105444, 2019, doi: 10.1016/j.ast.2019.105444.

L. Chen et al., "Robust adaptive recursive sliding mode attitude control for a quadrotor with unknown disturbances," ISA transactions, vol. 122, pp. 114-125, 2022, doi: 10.1016/j.isatra.2021.04.046.

V. K. Tripathi et al., "Finite‐time super twisting sliding mode controller based on higher‐order sliding mode observer for real‐time trajectory tracking of a quadrotor," IET Control Theory & Applications, vol. 14, no. 16, pp. 2359-2371, 2020, doi: 10.1049/iet-cta.2020.0348.

Z. Zhao et al., "High-order sliding mode observer-based trajectory tracking control for a quadrotor UAV with uncertain dynamics," Nonlinear Dynamics, vol. 102, pp. 2583-2596, 2020, doi: 10.1007/s11071-020-06050-2.

T. Jiang, T. Song, and D. Lin, "Integral sliding mode based control for quadrotors with disturbances: Simulations and experiments," International Journal of Control, Automation and Systems, vol. 17, pp. 1987-1998, 2019, doi: 10.1007/s12555-018-0500-4.

A. Najafi et al., "Adaptive barrier fast terminal sliding mode actuator fault tolerant control approach for quadrotor UAVs," Mathematics, vol. 10, no. 16, p. 3009, 2022, doi: 10.3390/math10163009.

W. Alqaisi et al., "Three-loop uncertainties compensator and sliding mode quadrotor control," Computers & Electrical Engineering, vol. 81, p. 106507, 2020, doi: 10.1016/j.compeleceng.2019.106507.

Z. Hou, X. Yu, and P. Lu, "Terminal sliding mode control for quadrotors with chattering reduction and disturbances estimator: Theory and application," Journal of Intelligent & Robotic Systems, vol. 105, no. 4, p. 71, 2022, doi: 10.1007/s10846-022-01679-0.

G. Xu et al., "Adaptive prescribed performance terminal sliding mode attitude control for quadrotor under input saturation," IET Control Theory & Applications, vol. 14, no. 17, pp. 2473-2480, 2020, doi: 10.1049/iet-cta.2019.0488.

Y. Nettari, M. Labbadi, and S. Kurt, "Adaptive backstepping integral sliding mode control combined with super-twisting algorithm for nonlinear UAV quadrotor system," IFAC-PapersOnLine, vol. 55, no. 12, pp. 264-269, 2022, doi: 10.1016/j.ifacol.2022.07.322.

S. Ullah et al., "Neuro-adaptive fast integral terminal sliding mode control design with variable gain robust exact differentiator for under-actuated quadcopter UAV," ISA transactions, vol. 120, pp. 293-304, 2022, doi: 10.1016/j.isatra.2021.02.045.

T. Kusznir and J. Smoczek, "Sliding mode‐based control of a UAV quadrotor for suppressing the cable‐suspended payload vibration," Journal of Control Science and Engineering, vol. 2020, no. 1, p. 5058039, 2020, doi: 10.1155/2020/5058039.

G. Xu et al., "Adaptive sliding mode disturbance observer–based funnel trajectory tracking control of quadrotor with external disturbances," IET Control Theory & Applications, vol. 15, no. 13, pp. 1778-1788, 2021, doi: 10.1049/cth2.12159.

S. C. Yogi, L. Behera, and S. Nahavandi, "Adaptive Intelligent Minimum Parameter Singularity Free Sliding Mode Controller Design for Quadrotor," in IEEE Transactions on Automation Science and Engineering, vol. 21, no. 2, pp. 1805-1823, 2024, doi: 10.1109/TASE.2023.3243660.

Z. Li, X. Ma, and Y. Li, "Robust tracking control strategy for a quadrotor using RPD-SMC and RISE," Neurocomputing, vol. 331, pp. 312-322, 2019, doi: 10.1016/j.neucom.2018.11.070.

B. Gao, Y. J. Liu, and L. Liu, "Adaptive neural fault-tolerant control of a quadrotor UAV via fast terminal sliding mode," Aerospace Science and Technology, vol. 129, p. 107818, 2022, doi: 10.1016/j.ast.2022.107818.

Ö. Bingöl and H. M. Güzey, "Finite-time neuro-sliding-mode controller design for quadrotor uavs carrying suspended payload," Drones vol. 6, no. 10, p. 311, 2022, doi: 10.3390/drones6100311.

B. Wang, et al., "An adaptive sliding mode fault‐tolerant control of a quadrotor unmanned aerial vehicle with actuator faults and model uncertainties," International Journal of Robust and Nonlinear Control, vol. 33, no. 17, pp. 10182-10198, 2023, doi: 10.1002/rnc.6631.




DOI: https://doi.org/10.18196/jrc.v5i6.23585

Refbacks

  • There are currently no refbacks.


Copyright (c) 2024 Quynh Nga Duong, Ngoc Trung Dang

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

 


Journal of Robotics and Control (JRC)

P-ISSN: 2715-5056 || E-ISSN: 2715-5072
Organized by Peneliti Teknologi Teknik Indonesia
Published by Universitas Muhammadiyah Yogyakarta in collaboration with Peneliti Teknologi Teknik Indonesia, Indonesia and the Department of Electrical Engineering
Website: http://journal.umy.ac.id/index.php/jrc
Email: jrcofumy@gmail.com


Kuliah Teknik Elektro Terbaik