Asynchronous, Photometric Feature Tracking using Events and Frames (1807.09713v1)

Published 25 Jul 2018 in cs.CV and cs.RO

Abstract: We present a method that leverages the complementarity of event cameras and standard cameras to track visual features with low-latency. Event cameras are novel sensors that output pixel-level brightness changes, called "events". They offer significant advantages over standard cameras, namely a very high dynamic range, no motion blur, and a latency in the order of microseconds. However, because the same scene pattern can produce different events depending on the motion direction, establishing event correspondences across time is challenging. By contrast, standard cameras provide intensity measurements (frames) that do not depend on motion direction. Our method extracts features on frames and subsequently tracks them asynchronously using events, thereby exploiting the best of both types of data: the frames provide a photometric representation that does not depend on motion direction and the events provide low-latency updates. In contrast to previous works, which are based on heuristics, this is the first principled method that uses raw intensity measurements directly, based on a generative event model within a maximum-likelihood framework. As a result, our method produces feature tracks that are both more accurate (subpixel accuracy) and longer than the state of the art, across a wide variety of scenes.

Citations (222)

View on Semantic Scholar

Summary

The paper presents a hybrid tracking framework that integrates asynchronous event data with synchronous frame data to achieve robust feature tracking in dynamic conditions.
The method minimizes photometric error by aligning temporally dense event data with frame data, resulting in precise edge detection and alignment.
Experimental results demonstrate improved tracking performance and low-latency processing, particularly in fast motions and variable lighting environments.

Asynchronous, Photometric Feature Tracking using Events and Frames

The paper "Asynchronous, Photometric Feature Tracking using Events and Frames" by Daniel Gehrig, Henri Rebecq, Guillermo Gallego, and Davide Scaramuzza explores an innovative approach in computer vision, leveraging the unique capabilities of event cameras for feature tracking. This research presents a solution that integrates both asynchronous event data and more traditional synchronous frame data to achieve robust feature tracking, aiming to exploit the advantages of each sensory input.

Overview

Event cameras, such as the Dynamic Vision Sensor (DVS), differ fundamentally from conventional cameras by continuously capturing changes in the scene at a pixel level with sub-millisecond latency. This characteristic provides several benefits like low latency, high dynamic range, and reduced motion blur, making them a compelling choice for high-speed and dynamic environments.

The proposed method in this paper combines the strengths of event-based vision with frame-based inputs to propose a novel feature tracking framework. This framework is particularly advantageous in situations where traditional frame-based systems may struggle, such as rapid motion scenarios and scenes with significant lighting variations.

Key Contributions

The primary contributions of this paper are as follows:

Hybrid Tracking Framework: The authors introduce a tracking algorithm that integrates both events and frames, offering a robust solution that better handles complex and dynamic visual environments compared to purely frame-based or event-based methods.
Photometric Error Minimization: By minimizing photometric error, the proposed method effectively aligns temporally dense event data with frame data, enhancing the accuracy and reliability of the feature tracking process.
Asynchronous Processing: The methodology emphasizes asynchronous processing, taking full advantage of the high temporal resolution of event cameras to provide low-latency feature tracking.

Methodology and Results

The authors detail an algorithm that first detects edges in the scene using both frames and events, then tracks these edges across both types of data. They employ a photometric alignment strategy where they minimize the residuals between the predicted and actual event and frame data, leading to precise feature tracking even in challenging conditions, such as fast motions or erratic lighting changes.

Experimentally, the system demonstrates significant performance improvements over traditional frame-based tracking methods, particularly in dynamic scenes. The method’s robustness in terms of alignment correctness and tracking speed is quantitatively evidenced by thorough evaluations on various datasets.

Implications and Future Work

This work has implications for both the theoretical understanding and practical applications within real-time video processing and robotics. The successful integration of event-driven vision with frame-based approaches highlights the potential for developing highly robust and adaptive computer vision systems that can operate effectively under conditions that are challenging for traditional methods.

Future research directions may focus on extending this hybrid approach to other computer vision tasks beyond feature tracking, such as object recognition or scene reconstruction. Additionally, optimizing computational efficiency and exploring deeper integration with machine learning models could further enhance the effectiveness and applicability of event-frame hybrid systems. The continued development in this area may inspire new architectures capable of even more sophisticated processing of visual information in real-time.

PDF Markdown