Title Learning stabilization control of quadrotor in near-ground setting using reinforcement learning /
Authors Briliauskas, Mantas
DOI 10.5755/j01.itc.53.1.35135
Full Text Download
Is Part of InformacinÄ—s technologijos ir valdymas = Information technology and control.. Kaunas : Technologija. 2024, vol. 53, no. 1, p. 237-242.. ISSN 1392-124X. eISSN 2335-884X
Keywords [eng] quadrotor ; stabilization ; reinforcement learning ; PPO ; reward function for near ground flight
Abstract [eng] With the development of intelligent systems, the popularity of using micro aerial vehicles (MAV) increases significantly in the fields of rescue, photography, security, agriculture, and warfare. New modern solutions of machine learning like ChatGPT that are fine-tuned using reinforcement learning (RL) provides evidence of new trends in seeking general artificial intelligence. RL has already been proven to work as a flight controller for MAV performing better than Proportional Integral Derivative (PID)-based solutions. However, using negative Euclidean distance to the target point as the reward function is sufficient in obstacle-free spaces, e.g. in the air, but fails in special cases, e.g. when training near the ground. In this work, we address this issue by proposing a new reward function with early termination. It not only allows to successfully train Proximal Policy Optimization (PPO) algorithm to stabilize the quadrotor in the near-ground setting, but also achieves lower Euclidean distance error compared to the baseline setup.
Published Kaunas : Technologija
Type Journal article
Language English
Publication date 2024
CC license CC license description