Advancing towards Safe Reinforcement Learning over Sparse Environments with Out-of-Distribution Observations: Detection and Adaptation Strategies

Aitor Martinez-Seras*, Alain Andres, Javier Del Ser

*Autor correspondiente de este trabajo

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

Safety in AI-based systems is among the highest research priorities, particularly when such systems are deployed in real-world scenarios subject to uncertainties and unpredictable inputs. Among them, the presence of long-tailed stimuli (Out-of-Distribution data, OoD) has captured much interest in recent times, giving rise to many proposals over the years to detect unfamiliar inputs to the model and adapt its knowledge accordingly. This work analyzes several OoD detection and adaptation strategies for Reinforcement Learning agents over environments with sparse reward signals. The sparsity of rewards and the impact of OoD objects on the state transition distribution learned by the agent are shown to be crucial when it comes to the design of effective knowledge transfer methods once OoD objects are detected. Furthermore, different approaches to detect OoD elements within the observation of the agent are also outlined, stressing on their benefits and potential downsides. Experiments with procedurally generated environments are performed to assess the performance of the considered OoD detection techniques, and to gauge the impact of the adaptation strategies on the generalization capability of the RL agent. The results pave the way towards further research around the provision of safety guarantees in sparse open-world Reinforcement Learning environments.

Idioma originalInglés
Título de la publicación alojada2024 International Joint Conference on Neural Networks, IJCNN 2024 - Proceedings
EditorialInstitute of Electrical and Electronics Engineers Inc.
ISBN (versión digital)9798350359312
DOI
EstadoPublicada - 2024
Evento2024 International Joint Conference on Neural Networks, IJCNN 2024 - Yokohama, Japón
Duración: 30 jun 20245 jul 2024

Serie de la publicación

NombreProceedings of the International Joint Conference on Neural Networks

Conferencia

Conferencia2024 International Joint Conference on Neural Networks, IJCNN 2024
País/TerritorioJapón
CiudadYokohama
Período30/06/245/07/24

Huella

Profundice en los temas de investigación de 'Advancing towards Safe Reinforcement Learning over Sparse Environments with Out-of-Distribution Observations: Detection and Adaptation Strategies'. En conjunto forman una huella única.

Citar esto