Learning stabilization control of quadrotor in near-ground setting using reinforcement learning

Mantas Briliauskas

doi:10.5755/j01.itc.53.1.35135

Title	Learning stabilization control of quadrotor in near-ground setting using reinforcement learning
Authors	Briliauskas, Mantas
DOI	10.5755/j01.itc.53.1.35135
Full Text
Is Part of	Informacinės technologijos ir valdymas = Information technology and control.. Kaunas : Technologija. 2024, vol. 53, no. 1, p. 237-242.. ISSN 1392-124X. eISSN 2335-884X
Keywords [eng]	quadrotor ; stabilization ; reinforcement learning ; PPO ; reward function for near ground flight
Abstract [eng]	With the development of intelligent systems, the popularity of using micro aerial vehicles (MAV) increases significantly in the fields of rescue, photography, security, agriculture, and warfare. New modern solutions of machine learning like ChatGPT that are fine-tuned using reinforcement learning (RL) provides evidence of new trends in seeking general artificial intelligence. RL has already been proven to work as a flight controller for MAV performing better than Proportional Integral Derivative (PID)-based solutions. However, using negative Euclidean distance to the target point as the reward function is sufficient in obstacle-free spaces, e.g. in the air, but fails in special cases, e.g. when training near the ground. In this work, we address this issue by proposing a new reward function with early termination. It not only allows to successfully train Proximal Policy Optimization (PPO) algorithm to stabilize the quadrotor in the near-ground setting, but also achieves lower Euclidean distance error compared to the baseline setup.
Published	Kaunas : Technologija
Type	Journal article
Language	English
Publication date	2024
CC license

„Learning stabilization control of quadrotor in near-ground setting using reinforcement learning“