An Open-World, Diverse, Cross-Spatial-Temporal Benchmark for Dynamic Wild Person Re-Identification (2403.15119v1)
Abstract: Person re-identification (ReID) has made great strides thanks to the data-driven deep learning techniques. However, the existing benchmark datasets lack diversity, and models trained on these data cannot generalize well to dynamic wild scenarios. To meet the goal of improving the explicit generalization of ReID models, we develop a new Open-World, Diverse, Cross-Spatial-Temporal dataset named OWD with several distinct features. 1) Diverse collection scenes: multiple independent open-world and highly dynamic collecting scenes, including streets, intersections, shopping malls, etc. 2) Diverse lighting variations: long time spans from daytime to nighttime with abundant illumination changes. 3) Diverse person status: multiple camera networks in all seasons with normal/adverse weather conditions and diverse pedestrian appearances (e.g., clothes, personal belongings, poses, etc.). 4) Protected privacy: invisible faces for privacy critical applications. To improve the implicit generalization of ReID, we further propose a Latent Domain Expansion (LDE) method to develop the potential of source data, which decouples discriminative identity-relevant and trustworthy domain-relevant features and implicitly enforces domain-randomized identity feature space expansion with richer domain diversity to facilitate domain invariant representations. Our comprehensive evaluations with most benchmark datasets in the community are crucial for progress, although this work is far from the grand goal toward open-world and dynamic wild applications.
- Y. Bai, J. Jiao, W. Ce, J. Liu, Y. Lou, X. Feng, and L.-Y. Duan, “Person30k: A dual-meta generalization network for person re-identification,” in CVPR, 2021, pp. 2123–2132.
- S. Bak, P. Carr, and J.-F. Lalonde, “Domain adaptation through synthesis for unsupervised person re-identification,” in ECCV, 2018, pp. 189–205.
- D. Baltieri, R. Vezzani, and R. Cucchiara, “3dpes: 3d people dataset for surveillance and forensics,” in ACM workshop on Human gesture and behavior understanding, 2011, pp. 59–64.
- I. B. Barbosa, M. Cristani, B. Caputo, A. Rognhaugen, and T. Theoharis, “Looking beyond appearances: Synthetic training data for deep cnns in re-identification,” Computer Vision and Image Understanding, vol. 167, pp. 50–62, 2018.
- G. Blanchard, G. Lee, and C. Scott, “Generalizing from several related classification tasks to a new unlabeled sample,” in NeurIPS, vol. 24, 2011, pp. 2178–2186.
- P. Chen, P. Dai, J. Liu, F. Zheng, M. Xu, Q. Tian, and R. Ji, “Dual distribution alignment network for generalizable person re-identification,” in AAAI, vol. 35, no. 2, 2021, pp. 1054–1062.
- X. Chen, C. Fu, Y. Zhao, F. Zheng, J. Song, R. Ji, and Y. Yang, “Salience-guided cascaded suppression network for person re-identification,” in CVPR, 2020, pp. 3300–3310.
- D. S. Cheng, M. Cristani, M. Stoppa, L. Bazzani, and V. Murino, “Custom pictorial structures for re-identification.” in BMVC, vol. 1, no. 2. Citeseer, 2011, p. 6.
- S. Choi, T. Kim, M. Jeong, H. Park, and C. Kim, “Meta batch-instance normalization for generalizable person re-identification,” in CVPR, 2021, pp. 3425–3435.
- Y. Dai, X. Li, J. Liu, Z. Tong, and L.-Y. Duan, “Generalizable person re-identification with relevance-aware mixture of experts,” in CVPR, 2021, pp. 16 145–16 154.
- D. Fu, D. Chen, J. Bao, H. Yang, L. Yuan, L. Zhang, H. Li, and D. Chen, “Unsupervised pre-training for person re-identification,” in CVPR, 2021, pp. 14 750–14 759.
- D. Gray and H. Tao, “Viewpoint invariant pedestrian recognition with an ensemble of localized features,” in ECCV, 2008.
- K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in CVPR, 2016, pp. 770–778.
- L. He, X. Liao, W. Liu, X. Liu, P. Cheng, and T. Mei, “Fastreid: a pytorch toolbox for real-world person re-identification,” arXiv preprint arXiv:2006.02631, vol. 1, no. 6, 2020.
- A. Hermans, L. Beyer, and B. Leibe, “In defense of the triplet loss for person re-identification,” arXiv preprint arXiv:1703.07737, 2017.
- M. Hirzer, C. Beleznai, P. M. Roth, and H. Bischof, “Person re-identification by descriptive and discriminative classification,” in Scandinavian conference on Image analysis, 2011, pp. 91–102.
- J. Hu, L. Shen, and G. Sun, “Squeeze-and-excitation networks,” in CVPR, 2018, pp. 7132–7141.
- Q. Huang, Y. Xiong, and D. Lin, “Unifying identification and context learning for person recognition,” in CVPR, 2018, pp. 2217–2225.
- Y. Huang, Q. Wu, J. Xu, and Y. Zhong, “Celebrities-reid: A benchmark for clothes variation in long-term person re-identification,” in IJCNN. IEEE, 2019, pp. 1–8.
- Y. Huang, J. Xu, Q. Wu, Y. Zhong, P. Zhang, and Z. Zhang, “Beyond scalar neuron: Adopting vector-neuron capsules for long-term person re-identification,” IEEE TCSVT, vol. 30, no. 10, pp. 3459–3471, 2019.
- Y. Huang, X. Fu, L. Li, and Z.-J. Zha, “Learning degradation-invariant representation for robust real-world person re-identification,” International Journal of Computer Vision, vol. 130, no. 11, pp. 2770–2796, 2022.
- J. Jia, Q. Ruan, and T. M. Hospedales, “Frustratingly easy person re-identification: Generalizing person re-id in practice,” arXiv preprint arXiv:1905.03422, 2019.
- X. Jin, C. Lan, W. Zeng, Z. Chen, and L. Zhang, “Style normalization and restitution for generalizable person re-identification,” in CVPR, 2020, pp. 3140–3149.
- ——, “Style normalization and restitution for generalizable person re-identification,” in CVPR, 2020, pp. 3143–3152.
- D. Li, Y. Yang, Y.-Z. Song, and T. M. Hospedales, “Learning to generalize: Meta-learning for domain generalization,” in AAAI, vol. 32, no. 1, 2018.
- P. Li, D. Li, W. Li, S. Gong, Y. Fu, and T. M. Hospedales, “A simple feature augmentation for domain generalization,” in ICCV, 2021, pp. 8886–8895.
- W. Li, R. Zhao, T. Xiao, and X. Wang, “Deepreid: Deep filter pairing neural network for person re-identification,” in CVPR, 2014, pp. 152–159.
- W. Li and X. Wang, “Locally aligned feature transforms across views,” in CVPR, 2013, pp. 3594–3601.
- W. Li, R. Zhao, and X. Wang, “Human reidentification with transferred metric learning,” in ACCV, 2012, pp. 31–44.
- W. Li, X. Zhu, and S. Gong, “Scalable person re-identification by harmonious attention,” International Journal of Computer Vision, vol. 128, no. 6, pp. 1635–1653, 2020.
- Y. Li, J. Song, H. Ni, and H. T. Shen, “Style-controllable generalized person re-identification,” in ACMMM, 2023, pp. 7912–7921.
- S. Liao and L. Shao, “Interpretable and generalizable person re-identification with query-adaptive convolution and temporal lifting,” in ECCV. Springer, 2020, pp. 456–474.
- ——, “Graph sampling based deep metric learning for generalizable person re-identification,” in CVPR, 2022, pp. 7359–7368.
- C. C. Loy, C. Liu, and S. Gong, “Person re-identification by manifold ranking,” in ICIP, 2013, pp. 3567–3571.
- H. Luo, Y. Gu, X. Liao, S. Lai, and W. Jiang, “Bag of tricks and a strong baseline for deep person re-identification,” in CVPRW, 2019, pp. 0–0.
- P. Luo, R. Zhang, J. Ren, Z. Peng, and J. Li, “Switchable normalization for learning-to-normalize deep representation,” IEEE TPAMI, vol. 43, no. 2, pp. 712–728, 2019.
- L. Ma, H. Liu, L. Hu, C. Wang, and Q. Sun, “Orientation driven bag of appearances for person re-identification,” arXiv preprint arXiv:1605.02464, 2016.
- N. Martinel and C. Micheloni, “Re-identify people in wide area camera network,” in CVPRW. IEEE, 2012, pp. 31–36.
- H. Ni, J. Song, X. Luo, F. Zheng, W. Li, and H. T. Shen, “Meta distribution alignment for generalizable person re-identification,” in CVPR, 2022, pp. 2477–2486.
- A. Nichol, J. Achiam, and J. Schulman, “On first-order meta-learning algorithms,” arXiv preprint arXiv:1803.02999, 2018.
- X. Pan, P. Luo, J. Shi, and X. Tang, “Two at once: Enhancing learning and generalization capacities via ibn-net,” in ECCV, 2018.
- X. Qian, W. Wang, L. Zhang, F. Zhu, Y. Fu, T. Xiang, Y.-G. Jiang, and X. Xue, “Long-term cloth-changing person re-identification,” in ACCV, 2020.
- S. Qiao, C. Liu, W. Shen, and A. L. Yuille, “Few-shot image recognition by predicting parameters from activations,” in CVPR, 2018, pp. 7229–7238.
- Y. Rao, J. Lu, and J. Zhou, “Learning discriminative aggregation network for video-based face recognition and person re-identification,” International Journal of Computer Vision, vol. 127, pp. 701–718, 2019.
- O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein et al., “Imagenet large scale visual recognition challenge,” International Journal of Computer Vision, vol. 115, no. 3, pp. 211–252, 2015.
- R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, “Grad-cam: Visual explanations from deep networks via gradient-based localization,” in ICCV, 2017, pp. 618–626.
- S. Shankar, V. Piratla, S. Chakrabarti, S. Chaudhuri, P. Jyothi, and S. Sarawagi, “Generalizing across domains via cross-gradient training,” arXiv preprint arXiv:1804.10745, 2018.
- X. Shu, X. Wang, X. Zang, S. Zhang, Y. Chen, G. Li, and Q. Tian, “Large-scale spatio-temporal person re-identification: Algorithms and benchmark,” IEEE TCSVT, vol. 32, no. 7, pp. 4390–4403, 2021.
- Y. Shu, Z. Cao, C. Wang, J. Wang, and M. Long, “Open domain generalization with domain-augmented meta-learning,” in CVPR, 2021, pp. 9624–9633.
- J. Song, Y. Yang, Y.-Z. Song, T. Xiang, and T. M. Hospedales, “Generalizable person re-identification by domain-invariant mapping network,” in CVPR, 2019, pp. 719–728.
- X. Sun and L. Zheng, “Dissecting person re-identification from the viewpoint of viewpoint,” in CVPR, 2019, pp. 608–617.
- Y. Sun, L. Zheng, Y. Yang, Q. Tian, and S. Wang, “Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline),” in ECCV, 2018, pp. 480–496.
- M. Tamura and T. Murakami, “Augmented hard example mining for generalizable person re-identification,” arXiv preprint arXiv:1910.05280, 2019.
- L. Van der Maaten and G. Hinton, “Visualizing data using t-sne.” Journal of Machine Learning Research, vol. 9, no. 11, 2008.
- G. Wang, G. Wang, X. Zhang, J. Lai, Z. Yu, and L. Lin, “Weakly supervised person re-id: Differentiable graphical learning and a new benchmark,” IEEE TNNLS, vol. 32, no. 5, pp. 2142–2156, 2020.
- G. Wang, Y. Yuan, X. Chen, J. Li, and X. Zhou, “Learning discriminative features with multiple granularities for person re-identification,” in ACMMM, 2018, pp. 274–282.
- Y. Wang, X. Liang, and S. Liao, “Cloning outfits from real-world images to 3d characters for generalizable person re-identification,” in CVPR, 2022, pp. 4900–4909.
- Y. Wang, S. Liao, and L. Shao, “Surpassing real-world source training data: Random 3d characters for generalizable person re-identification,” in ACMMM, 2020, pp. 3422–3430.
- Y. Wang, X. Pan, S. Song, H. Zhang, G. Huang, and C. Wu, “Implicit semantic data augmentation for deep networks,” in NeurIPS, vol. 32, 2019.
- L. Wei, S. Zhang, W. Gao, and Q. Tian, “Person transfer gan to bridge domain gap for person re-identification,” in CVPR, 2018, pp. 79–88.
- S. Woo, J. Park, J. Y. Lee, and I. S. Kweon, “Cbam: Convolutional block attention module,” in ECCV, 2018, pp. 3–19.
- T. Xiao, S. Li, B. Wang, L. Lin, and X. Wang, “Joint detection and identification feature learning for person search,” in CVPR, 2017, pp. 3415–3424.
- P. Xu and X. Zhu, “Deepchange: A long-term person re-identification benchmark with clothes change,” in ICCV, 2023, pp. 11 196–11 205.
- Q. Yang, A. Wu, and W.-S. Zheng, “Person re-identification by contour sketch under moderate clothing change,” IEEE TPAMI, vol. 43, no. 6, pp. 2029–2046, 2019.
- M. Ye, J. Shen, G. Lin, T. Xiang, L. Shao, and S. C. H. Hoi, “Deep learning for person re-identification: A survey and outlook,” arXiv preprint arXiv:2001.04193, 2020.
- J. Yin, A. Wu, and W.-S. Zheng, “Fine-grained person re-identification,” International Journal of Computer Vision, vol. 128, pp. 1654–1672, 2020.
- Y. Zhai, P. Peng, M. Jia, S. Li, W. Chen, X. Gao, and Y. Tian, “Population-based evolutionary gaming for unsupervised person re-identification,” International Journal of Computer Vision, vol. 131, no. 1, pp. 1–25, 2023.
- J. Zhang, Y. Yuan, and Q. Wang, “Night person re-identification and a benchmark,” IEEE Access, vol. PP, no. 99, pp. 1–1, 2019.
- L. Zhang, Z. Deng, K. Kawaguchi, A. Ghorbani, and J. Zou, “How does mixup help with robustness and generalization?” arXiv preprint arXiv:2010.04819, 2020.
- N. Zhang, M. Paluri, Y. Taigman, R. Fergus, and L. Bourdev, “Beyond frontal faces: Improving person recognition using multiple cues,” in CVPR, 2015, pp. 4804–4813.
- T. Zhang, L. Xie, L. Wei, Z. Zhuang, Y. Zhang, B. Li, and Q. Tian, “Unrealperson: An adaptive pipeline towards costless person re-identification,” in CVPR, 2021, pp. 11 506–11 515.
- X. Zhang, H. Luo, X. Fan, W. Xiang, Y. Sun, Q. Xiao, W. Jiang, C. Zhang, and J. Sun, “Alignedreid: Surpassing human-level performance in person re-identification,” arXiv preprint arXiv:1711.08184, 2017.
- Z. Zhang, C. Lan, W. Zeng, X. Jin, and Z. Chen, “Relation-aware global attention for person re-identification,” in CVPR, 2020, pp. 3186–3195.
- J. Zhao, Y. Zhao, X. Chen, and J. Li, “Revisiting stochastic learning for generalizable person re-identification,” in ACMMM, 2022, pp. 1758–1768.
- Y. Zhao, Z. Zhong, F. Yang, Z. Luo, Y. Lin, S. Li, and S. Nicu, “Learning to generalize unseen domains via memory-based multi-source meta-learning for person re-identification,” in CVPR, 2021, pp. 6277–6286.
- L. Zheng, L. Shen, L. Tian, S. Wang, J. Wang, and Q. Tian, “Scalable person re-identification: A benchmark,” in ICCV, 2015, pp. 1116–1124.
- L. Zheng, H. Zhang, S. Sun, M. Chandraker, Y. Yang, and Q. Tian, “Person re-identification in the wild,” in CVPR, 2017, pp. 1367–1376.
- M. Zheng, S. Karanam, and R. J. Radke, “Rpifield: A new dataset for temporally evaluating person re-identification,” in CVPRW, 2018, pp. 1893–1895.
- W.-S. Zheng, S. Gong, and T. Xiang, “Associating groups of people.” in BMVC, vol. 2, no. 6, 2009, pp. 1–11.
- Z. Zheng, L. Zheng, and Y. Yang, “Unlabeled samples generated by gan improve the person re-identification baseline in vitro,” in ICCV, 2017, pp. 3754–3762.
- K. Zhou, Y. Yang, A. Cavallaro, and T. Xiang, “Learning generalisable omni-scale representations for person re-identification,” IEEE TPAMI, vol. 44, no. 9, pp. 5056–5069, 2021.
- K. Zhou, Y. Yang, Y. Qiao, and T. Xiang, “Domain generalization with mixstyle,” arXiv preprint arXiv:2104.02008, 2021.
- X. Zhu, X. Zhu, M. Li, V. Murino, and S. Gong, “Intra-camera supervised person re-identification: A new benchmark,” in ICCVW, 2019, pp. 0–0.
- Z. Zhuang, L. Wei, L. Xie, T. Zhang, H. Zhang, H. Wu, H. Ai, and Q. Tian, “Rethinking the distribution gap of person re-identification with camera-based batch normalization,” in ECCV. Springer, 2020, pp. 140–157.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Collections
Sign up for free to add this paper to one or more collections.