Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Pupil: An Open Source Platform for Pervasive Eye Tracking and Mobile Gaze-based Interaction (1405.0006v1)

Published 30 Apr 2014 in cs.CV and cs.HC

Abstract: Commercial head-mounted eye trackers provide useful features to customers in industry and research but are expensive and rely on closed source hardware and software. This limits the application areas and use of mobile eye tracking to expert users and inhibits user-driven development, customisation, and extension. In this paper we present Pupil -- an accessible, affordable, and extensible open source platform for mobile eye tracking and gaze-based interaction. Pupil comprises 1) a light-weight headset with high-resolution cameras, 2) an open source software framework for mobile eye tracking, as well as 3) a graphical user interface (GUI) to playback and visualize video and gaze data. Pupil features high-resolution scene and eye cameras for monocular and binocular gaze estimation. The software and GUI are platform-independent and include state-of-the-art algorithms for real-time pupil detection and tracking, calibration, and accurate gaze estimation. Results of a performance evaluation show that Pupil can provide an average gaze estimation accuracy of 0.6 degree of visual angle (0.08 degree precision) with a latency of the processing pipeline of only 0.045 seconds.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Moritz Kassner (1 paper)
  2. William Patera (1 paper)
  3. Andreas Bulling (81 papers)
Citations (777)

Summary

  • The paper introduces Pupil, an open-source framework that leverages affordable hardware and versatile software to make eye tracking accessible for research.
  • The paper details a lightweight headset design featuring an infrared eye camera and high-resolution scene camera, achieving 0.6° accuracy and 0.08° precision.
  • The paper demonstrates low-latency performance with a 0.045s eye processing delay and a 0.124s overall system delay, ensuring real-time gaze-based interaction.

Pupil: An Open Source Platform for Pervasive Eye Tracking and Mobile Gaze-based Interaction

The paper "Pupil: An Open Source Platform for Pervasive Eye Tracking and Mobile Gaze-based Interaction" by Kassner et al. presents a comprehensive framework designed to advance eye tracking research through the deployment of a low-cost, accessible, and extensible solution. This essay provides an analytical synopsis of their work, emphasizing the technical merits, system architecture, performance metrics, and prospective contributions to the field.

Introduction

The technical landscape in eye tracking has evolved substantially since the early cumbersome devices of the 20th century. Today's head-mounted eye trackers offer advanced capabilities but remain largely proprietary and expensive, restricting broader use and innovation. The authors introduce Pupil, an open-source platform aimed at democratizing eye tracking technology. Pupil encompasses a portable headset, a versatile software framework, and an intuitive graphical user interface, catering to the needs of both novice users and advanced researchers.

System Architecture

Hardware Components

Pupil's hardware architecture includes a lightweight, ergonomically designed headset featuring high-resolution cameras tailored for eye tracking. Key components encompass:

  • Eye Camera: Infrared spectrum camera designed for dark pupil detection, boasting a resolution of 800x600 pixels at 30Hz.
  • Scene Camera: High-resolution (1920x1080 pixels) camera providing a 90-degree field of view to capture the user’s field of vision.

Standard consumer electronics (e.g., USB cameras) and rapid fabrication techniques (e.g., Selective Laser Sintering) ensure affordability and modularity, allowing for rapid iterations and user-driven customization.

Software Components

The Pupil software suite is platform-independent and includes:

  • Pupil Capture: Real-time capture and processing of video streams, pupil detection, gaze mapping, and frame recording.
  • Pupil Player: Tool for playback and visualization of recorded data.

Both components are equipped with state-of-the-art algorithms for accurate pupil detection and gaze estimation. The software leverages Python and libraries such as OpenCV, FFMPEG, and NumPy for robustness and extensibility.

Performance Evaluation

Spatial Accuracy and Precision

Pupil demonstrates superior accuracy and precision, essential metrics for effective eye tracking:

  • Accuracy: 0.6 degrees of visual angle.
  • Precision: 0.08 degrees of visual angle.

These metrics were obtained through a meticulous evaluation process involving multiple subjects and state-of-the-art algorithms. Pupil's detection algorithm shows competitive performance when benchmarked against other known algorithms, achieving a detection error of fewer than 2 pixels for the majority of test scenarios.

Temporal Characteristics

The authors pay substantial attention to temporal synchronization and system latency:

  • Eye Processing Pipeline Latency: 0.045 seconds.
  • Overall System Latency: 0.124 seconds.

These figures illustrate Pupil's efficiency in providing low-latency, real-time processing capabilities, which are critical for pervasive eye-based interactions.

Implications and Future Work

Pupil stands out due to its open-source nature, allowing researchers to extend and customize the platform for diverse applications ranging from cognitive psychology to human-computer interaction. The adoption of modular design principles ensures that both hardware and software can evolve, spurred by community-driven innovations.

Key limitations, such as parallax error and performance under IR-rich conditions, have been acknowledged. Future developments will focus on enhancing mobility, improving real-time pose tracking, simplifying user experience, and increasing tracking robustness in varying environmental conditions.

Conclusion

The work presented in this paper exemplifies a significant stride towards making sophisticated eye tracking technology accessible to a broader audience. By offering an open-source, affordable, and extensible solution, Pupil facilitates advancements across various fields while fostering a collaborative research ecosystem. The ongoing improvements and active development promise to continually enhance the functionality and application scope of Pupil, heralding a new era of pervasive, eye-based human-computer interaction research.