Dice Question Streamline Icon: https://streamlinehq.com

Extending the dual-stream approach to other Gestalt relations

Ascertain whether the dual-stream recurrent neural network that combines glimpse contents and gaze positions to learn spatial structure for numerosity can be extended to account for human judgments of Gestalt relations—including proximity, similarity, enclosure, symmetry, and continuity—when the objects are wholly novel.

Information Square Streamline Icon: https://streamlinehq.com

Background

The model is proposed as a computational instantiation of a dorsal–ventral factorization of scene structure and content, respectively. It successfully captures enumeration behavior and neural coding motifs associated with posterior parietal cortex, supporting the view that integrating what and where enables zero-shot structural reasoning.

The authors explicitly note that general relational perception in vision encompasses a broader set of Gestalt principles. Whether the same dual-stream mechanism can generalize beyond numerosity to these relations, particularly for novel objects and configurations, remains an open question.

References

Another question is whether our approach extends to other Gestalt principles, and can be deployed to explain the human ability to judge relations of proximity, similarity, enclosure, symmetry and continuity with wholly novel objects.

Zero-shot counting with a dual-stream neural network model (2405.09953 - Thompson et al., 16 May 2024) in Discussion, final paragraph