Leveraging Driver Attention for an End-To-End Explainable Decision-Making from Frontal Images

  • Javier Araluce*
  • , Luis M. Bergasa
  • , Manuel Ocana
  • , Angel Llamazares
  • , Elena Lopez-Guillen
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

Explaining the decision made by end-To-end autonomous driving is a difficult task. These approaches take raw sensor data and compute the decision as a black box with large deep learning models. Understanding the output of deep learning is a complex challenge due to the complicated nature of explainability; as data passes through the network, it becomes untraceable, making it difficult to understand. Explainability increases confidence in the decision by making the black box that drives the vehicle transparent to the user inside. Achieving a Level 5 autonomous vehicle necessitates the resolution of that challenging task. In this work, we propose a model that leverages the driver's attention to obtain explainable decisions based on an attention map and the scene context. Our novel architecture addresses the task of obtaining a decision and its explanation from a single RGB sequence of the driving scene ahead. We base this architecture on the Transformer architecture with some efficiency tricks in order to use it at a reasonable frame rate. Moreover, we integrate in this proposal our previous ARAGAN model, which obtains SOTA attention maps, to improve the performance of the model thanks to understand the sequence as a human does. We train and validate our proposal on the BDD-OIA dataset, achieving on-pair results or even better than other state-of-The-Art methods. Additionally, we present a simulation-based proof of concept demonstrating the model's performance as a copilot in a close-loop vehicle to driver interaction.

Original languageEnglish
Pages (from-to)10091-10102
Number of pages12
JournalIEEE Transactions on Intelligent Transportation Systems
Volume25
Issue number8
DOIs
Publication statusPublished - 2024
Externally publishedYes

Keywords

  • Driver attention
  • decision-making
  • deep learning
  • explainability
  • self-driving

Fingerprint

Dive into the research topics of 'Leveraging Driver Attention for an End-To-End Explainable Decision-Making from Frontal Images'. Together they form a unique fingerprint.

Cite this