Unsupervised data labeling and incremental cross-domain training for enhanced hybrid eye gaze estimation

Abstract

This paper aims to advance the fields of unsupervised data labeling and incremental cross-domain training techniques. We apply these innovative methods to develop a model tailored for the Augmentative and Alternative Communication (AAC) application domain, introducing a new perspective in hybrid eye Gaze Estimation (GE). These hybrid eye GE models combine the generalization strengths of appearance-based models with the scene understanding capabilities inherent in geometrical reconstruction. The use of open eye tracking datasets for the AAC domain introduces domain shift, while accurately labeling gaze vectors is challenging without specialized hardware for proper 3D dimensional reconstruction. We propose an approach to solve this challenges by conducting standardized unsupervised gaze vector labeling across multiple open GE datasets and subsequently performing incremental training to adapt to the target domain. Using a proprietary dataset we were able to reduce the gaze error from 4.87º to 3.95º, compared to a traditional single-step training.

Publication
Proceedings of the 2024 Symposium on Eye Tracking Research and Applications, ETRA 2024, Glasgow, United Kingdom, June 4-7, 2024
Unai Elordi
Unai Elordi
Assistant Professor

My research interests include computer vision, pattern recognition, and artificial intelligence for intelligent video analytics.