Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Comparative Evaluation of Reinforcement Learning and Model Predictive Control for 6DoF Position Control of an Autonomous Underwater Vehicle

  • Sümer Tunçay*
  • , Alain Andres
  • , Ignacio Carlucho
  • *Autor correspondiente de este trabajo

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

Autonomous Underwater Vehicles (AUVs) require precise and robust control strategies for 3D pose regulation in dynamic underwater environments. In this study, we present a comparative evaluation of model-free and model-based control methods for AUV position control. Specifically, we analyze the performance of neural network controllers trained by three Reinforcement Learning (RL) algorithms—Proximal Policy Optimization (PPO), Twin Delayed Deep Deterministic Policy Gradient (TD3), and Soft Actor-Critic (SAC)—alongside a Model Predictive Control (MPC) baseline. We train our RL methods in a simplified AUV simulator implemented in PyTorch, while our evaluation is done in a realistic marine robotics simulator called Stonefish. Controllers are evaluated on the basis of tracking accuracy, robustness to disturbances, and generalization capabilities. Our results show that, MPC suffers from unmodeled dynamics such as disturbances, whereas RL demonstrates adaptation capabilities to disturbances. Also, although MPC demonstrates strong control performance, it requires an accurate model, high compute power and a careful implementation to run in real-time whereas the control frequency of RL policies is only bound by the inference time of the policy network. Among RL-based controllers, PPO achieves the best overall performance, both in terms of training stability and control accuracy. This study provides insight into the feasibility of RL-based controllers for AUV position control, offering guidance for selecting suitable control strategies in real-world marine robotics applications.

Idioma originalInglés
Título de la publicación alojadaTowards Autonomous Robotic Systems - 26th Annual Conference, TAROS 2025, Proceedings
EditoresAna Cavalcanti, Simon Foster, Robert Richardson
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas381-394
Número de páginas14
ISBN (versión impresa)9783032014856
DOI
EstadoPublicada - 2026
Evento26th Annual Conference on Towards Autonomous Robotic Systems, TAROS 2025 - York, Reino Unido
Duración: 20 ago 202522 ago 2025

Serie de la publicación

NombreLecture Notes in Computer Science
Volumen16045 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia26th Annual Conference on Towards Autonomous Robotic Systems, TAROS 2025
País/TerritorioReino Unido
CiudadYork
Período20/08/2522/08/25

Huella

Profundice en los temas de investigación de 'Comparative Evaluation of Reinforcement Learning and Model Predictive Control for 6DoF Position Control of an Autonomous Underwater Vehicle'. En conjunto forman una huella única.

Citar esto