Dice Question Streamline Icon: https://streamlinehq.com

Duplication across neuroscience data repositories

Determine the extent of duplication across major neuroscience data repositories (e.g., OpenNeuro, DANDI, Brain Image Library, EBRAINS, FigShare, Allen Institute, Zenodo) to improve data accessibility, integration, and effective use in brain emulation research.

Information Square Streamline Icon: https://streamlinehq.com

Background

The data management section highlights ongoing fragmentation and heterogeneity across repositories, impeding standardized access and integration.

Quantifying duplication would help streamline data discovery, reduce redundant storage, and enable more comprehensive, consistent training and validation datasets for emulation efforts.

References

It is unclear how much duplication exists across repositories.

State of Brain Emulation Report 2025 (2510.15745 - Zanichelli et al., 17 Oct 2025) in Data management: storage, standardization, analysis