Revisiting Active Perception (1603.02729v2)

Published 8 Mar 2016 in cs.CV and cs.RO

Abstract: Despite the recent successes in robotics, artificial intelligence and computer vision, a complete artificial agent necessarily must include active perception. A multitude of ideas and methods for how to accomplish this have already appeared in the past, their broader utility perhaps impeded by insufficient computational power or costly hardware. The history of these ideas, perhaps selective due to our perspectives, is presented with the goal of organizing the past literature and highlighting the seminal contributions. We argue that those contributions are as relevant today as they were decades ago and, with the state of modern computational tools, are poised to find new life in the robotic perception systems of the next decade.

Citations (292)

View on Semantic Scholar

Summary

The paper introduces a framework defining active perception as the dynamic assessment of why, what, how, when, and where to sense.
It surveys historical and modern methodologies, linking early robotics and perceptual theories to current AI applications.
It analyzes computational challenges, including NP-completeness, underscoring the need for selective attention in perceptual processing.

Revisiting Active Perception

The paper, "Revisiting Active Perception," authored by Ruzena Bajcsy, Yiannis Aloimonos, and John K. Tsotsos, provides a comprehensive survey of the concept of active perception, highlighting its historical progression and current relevance in the fields of robotics, artificial intelligence, and computer vision. The authors posit that active perception remains essential for the development of complete artificial agents, particularly in conjunction with modern computational tools.

The notion of active perception roots itself in the belief that perceptual systems should not be passive recipients of sensory input; instead, they should actively seek information relevant to their tasks. The paper meticulously chronicles the evolution of this perspective, beginning with the pioneering work of J.J. Gibson on affordances and optic flow, through the developments such as the SHAKEY robot, and extending to modern theories and applications in robotics.

The authors propose a definition of active perception as the process by which an agent dynamically decides why it needs to sense, what to perceive, and how, when, and where to execute these perceptual actions. This is encapsulated in the framework of the "active pentuple," which includes the components: Why, What, How, When, and Where. The paper emphasizes the importance of each of these components in forming a cohesive and adaptive perception-action loop.

In particular, the discussion draws attention to the computational challenges inherent in translating this active paradigm to practical systems. From a theoretical standpoint, the authors provide a complexity analysis, noting the NP-Completeness of perceptual tasks when analyzed under various constraints. These insights underscore the necessity for selective attention mechanisms within active perception systems to mitigate computational demands.

A variety of methodologies are reviewed, highlighting different approaches to implementing active perception. These range from early experimental setups with robotic systems that explored mechanical and sensor alignment, to modern computational models employing attentional strategies to limit processing costs. The contrast between passive and active vision is rigorously explored, with empirical evidence suggesting that situationally aware sensory processing can vastly improve efficiency and effectiveness in perception tasks.

Several theoretical implications arise from this paper. By revisiting foundational ideas and integrating them with current technological capabilities, the authors suggest that active perception can significantly enhance the landscape of future AI systems. The discussion also invites speculation on the evolutionary trajectory of artificial perception, hinting at advancements in humanoid robot companions and collaborative teams with diverse sensory and action affordances.

In summary, this paper presents a detailed and structured perspective on active perception, arguing for its necessity and untapped potential in creating intelligent, adaptable artificial agents. The synthesis of historical insights with contemporary research not only reinforces active perception's validity but also sets the stage for future innovations in the domain of AI and robotics.