Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

On the use of Cortical Magnification and Saccades as Biological Proxies for Data Augmentation (2112.07173v1)

Published 14 Dec 2021 in cs.CV, cs.AI, cs.NE, and q-bio.NC

Abstract: Self-supervised learning is a powerful way to learn useful representations from natural data. It has also been suggested as one possible means of building visual representation in humans, but the specific objective and algorithm are unknown. Currently, most self-supervised methods encourage the system to learn an invariant representation of different transformations of the same image in contrast to those of other images. However, such transformations are generally non-biologically plausible, and often consist of contrived perceptual schemes such as random cropping and color jittering. In this paper, we attempt to reverse-engineer these augmentations to be more biologically or perceptually plausible while still conferring the same benefits for encouraging robust representation. Critically, we find that random cropping can be substituted by cortical magnification, and saccade-like sampling of the image could also assist the representation learning. The feasibility of these transformations suggests a potential way that biological visual systems could implement self-supervision. Further, they break the widely accepted spatially-uniform processing assumption used in many computer vision algorithms, suggesting a role for spatially-adaptive computation in humans and machines alike. Our code and demo can be found here.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Binxu Wang (10 papers)
  2. David Mayo (5 papers)
  3. Arturo Deza (13 papers)
  4. Andrei Barbu (35 papers)
  5. Colin Conwell (6 papers)
Citations (11)
X Twitter Logo Streamline Icon: https://streamlinehq.com