Comparative Evaluation of Reinforcement Learning and Model Predictive Control for 6DoF Position Control of an Autonomous Underwater Vehicle

  • Sümer Tunçay*
  • , Alain Andres
  • , Ignacio Carlucho
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Autonomous Underwater Vehicles (AUVs) require precise and robust control strategies for 3D pose regulation in dynamic underwater environments. In this study, we present a comparative evaluation of model-free and model-based control methods for AUV position control. Specifically, we analyze the performance of neural network controllers trained by three Reinforcement Learning (RL) algorithms—Proximal Policy Optimization (PPO), Twin Delayed Deep Deterministic Policy Gradient (TD3), and Soft Actor-Critic (SAC)—alongside a Model Predictive Control (MPC) baseline. We train our RL methods in a simplified AUV simulator implemented in PyTorch, while our evaluation is done in a realistic marine robotics simulator called Stonefish. Controllers are evaluated on the basis of tracking accuracy, robustness to disturbances, and generalization capabilities. Our results show that, MPC suffers from unmodeled dynamics such as disturbances, whereas RL demonstrates adaptation capabilities to disturbances. Also, although MPC demonstrates strong control performance, it requires an accurate model, high compute power and a careful implementation to run in real-time whereas the control frequency of RL policies is only bound by the inference time of the policy network. Among RL-based controllers, PPO achieves the best overall performance, both in terms of training stability and control accuracy. This study provides insight into the feasibility of RL-based controllers for AUV position control, offering guidance for selecting suitable control strategies in real-world marine robotics applications.

Original languageEnglish
Title of host publicationTowards Autonomous Robotic Systems - 26th Annual Conference, TAROS 2025, Proceedings
EditorsAna Cavalcanti, Simon Foster, Robert Richardson
PublisherSpringer Science and Business Media Deutschland GmbH
Pages381-394
Number of pages14
ISBN (Print)9783032014856
DOIs
Publication statusPublished - 2026
Event26th Annual Conference on Towards Autonomous Robotic Systems, TAROS 2025 - York, United Kingdom
Duration: 20 Aug 202522 Aug 2025

Publication series

NameLecture Notes in Computer Science
Volume16045 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference26th Annual Conference on Towards Autonomous Robotic Systems, TAROS 2025
Country/TerritoryUnited Kingdom
CityYork
Period20/08/2522/08/25

Keywords

  • Autonomous Underwater Vehicles
  • Optimal Control
  • Reinforcement Learning

Fingerprint

Dive into the research topics of 'Comparative Evaluation of Reinforcement Learning and Model Predictive Control for 6DoF Position Control of an Autonomous Underwater Vehicle'. Together they form a unique fingerprint.

Cite this