Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

A BI-Directional Deep Learning Interface for Gaze-Controlled Wheelchair Navigation: Overcoming the Midas Touch Problem

  • Gianni Bremer*
  • , Joseph McIntyre
  • , Je Hyung Jung
  • , Stefano Ellero
  • , Issa Mouawad
  • , Davide Di Gloria
  • , Markus Lappe
  • *Autor correspondiente de este trabajo
  • University of Münster
  • Stam S.r.l.

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

We present a gaze-based augmented reality control interface for electric wheelchairs, addressing the challenges faced by individuals with mobility impairments. The development transitions through three stages: model training with offline evaluation, Virtual Reality (VR) simulations, and physical deployment. First, we trained deep learning models, comparing Transformers and LSTMs, to predict locomotion intentions based on gaze data. While gaze predicts steering intentions well, it sometimes diverges from locomotion goals. To tackle this, we classify gaze movements as either indicative of locomotor intention or not. This novel approach addresses the Midas Touch Problem of gaze. Datasets were collected in controlled VR environments featuring different tasks. We find that data sets with tasks that encouraged diverse navigation and gaze behaviors enable strong generalization. The online VR simulation evaluation phase enabled safe and immersive testing, allowing the assessment of system performance and the integration of feedback for user guidance. Our approach provided smoother navigation control compared to traditional 'Where-You-Look-Is-Where-You-Go' methods. Feedback improved user ratings of the system. In the final stage, the system was deployed on a physical wheelchair equipped with an augmented reality (AR) device to provide feedback about the predictions to the user, allowing real-world evaluation. Despite differences in user behavior between VR and physical environments, the system successfully translated gaze inputs into precise and safe navigation commands. Users were able to steer the wheelchair solely using their eyes while simultaneously being able to look at destinations at the side of the path.

Idioma originalInglés
Título de la publicación alojadaProceedings - 2025 IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2025
EditoresUlrich Eck, Gun Lee, Alexander Plopski, Missie Smith, Qi Sun, Markus Tatzgern
EditorialInstitute of Electrical and Electronics Engineers Inc.
Páginas867-877
Número de páginas11
ISBN (versión digital)9798331587611
DOI
EstadoPublicada - 2025
Evento24th IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2025 - Daejeon, República de Corea
Duración: 8 oct 202512 oct 2025

Serie de la publicación

NombreProceedings - 2025 IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2025

Conferencia

Conferencia24th IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2025
País/TerritorioRepública de Corea
CiudadDaejeon
Período8/10/2512/10/25

Huella

Profundice en los temas de investigación de 'A BI-Directional Deep Learning Interface for Gaze-Controlled Wheelchair Navigation: Overcoming the Midas Touch Problem'. En conjunto forman una huella única.

Citar esto