Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Leveraging Driver Attention for an End-To-End Explainable Decision-Making from Frontal Images

  • Javier Araluce*
  • , Luis M. Bergasa
  • , Manuel Ocana
  • , Angel Llamazares
  • , Elena Lopez-Guillen
  • *Autor correspondiente de este trabajo
  • University of Alcalá

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

11 Citas (Scopus)

Resumen

Explaining the decision made by end-To-end autonomous driving is a difficult task. These approaches take raw sensor data and compute the decision as a black box with large deep learning models. Understanding the output of deep learning is a complex challenge due to the complicated nature of explainability; as data passes through the network, it becomes untraceable, making it difficult to understand. Explainability increases confidence in the decision by making the black box that drives the vehicle transparent to the user inside. Achieving a Level 5 autonomous vehicle necessitates the resolution of that challenging task. In this work, we propose a model that leverages the driver's attention to obtain explainable decisions based on an attention map and the scene context. Our novel architecture addresses the task of obtaining a decision and its explanation from a single RGB sequence of the driving scene ahead. We base this architecture on the Transformer architecture with some efficiency tricks in order to use it at a reasonable frame rate. Moreover, we integrate in this proposal our previous ARAGAN model, which obtains SOTA attention maps, to improve the performance of the model thanks to understand the sequence as a human does. We train and validate our proposal on the BDD-OIA dataset, achieving on-pair results or even better than other state-of-The-Art methods. Additionally, we present a simulation-based proof of concept demonstrating the model's performance as a copilot in a close-loop vehicle to driver interaction.

Idioma originalInglés
Páginas (desde-hasta)10091-10102
Número de páginas12
PublicaciónIEEE Transactions on Intelligent Transportation Systems
Volumen25
N.º8
DOI
EstadoPublicada - 2024
Publicado de forma externa

Huella

Profundice en los temas de investigación de 'Leveraging Driver Attention for an End-To-End Explainable Decision-Making from Frontal Images'. En conjunto forman una huella única.

Citar esto