Papers
Topics
Authors
Recent
2000 character limit reached

Diverse Deep Feature Ensemble Learning for Omni-Domain Generalized Person Re-identification (2410.08460v1)

Published 11 Oct 2024 in cs.CV

Abstract: Person Re-identification (Person ReID) has progressed to a level where single-domain supervised Person ReID performance has saturated. However, such methods experience a significant drop in performance when trained and tested across different datasets, motivating the development of domain generalization techniques. However, our research reveals that domain generalization methods significantly underperform single-domain supervised methods on single dataset benchmarks. An ideal Person ReID method should be effective regardless of the number of domains involved, and when test domain data is available for training it should perform as well as state-of-the-art (SOTA) fully supervised methods. This is a paradigm that we call Omni-Domain Generalization Person ReID (ODG-ReID). We propose a way to achieve ODG-ReID by creating deep feature diversity with self-ensembles. Our method, Diverse Deep Feature Ensemble Learning (D2FEL), deploys unique instance normalization patterns that generate multiple diverse views and recombines these views into a compact encoding. To the best of our knowledge, our work is one of few to consider omni-domain generalization in Person ReID, and we advance the study of applying feature ensembles in Person ReID. D2FEL significantly improves and matches the SOTA performance for major domain generalization and single-domain supervised benchmarks.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (40)
  1. DEX: Domain Embedding Expansion for Generalized Person Re-identification. In The 32nd British Machine Vision Conference. 14.
  2. Beyond triplet loss: A deep quadruplet network for person re-identification. In Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Vol. 2017-Janua. 1320–1329. https://doi.org/10.1109/CVPR.2017.145 arXiv:1704.01719
  3. Meta Batch-Instance Normalization for Generalizable Person Re-Identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
  4. Generalizable Person Re-identification with Relevance-aware Mixture of Experts. In 2021 Conference on Computer Vision and Pattern Recognition (CVPR). arXiv:2105.09156
  5. Sanjoy Dasgupta and Anupam Gupta. 2002. An Elementary Proof of a Theorem of Johnson and Lindenstrauss. In Random Structures and Algorithms. 1–10. https://doi.org/10.1002/rsa.10073
  6. ImageNet: A large-scale hierarchical image database. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 248–255.
  7. Karl Pearson F.R.S. 1901. LIII. On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 2, 11 (1901), 559–572. https://doi.org/10.1080/14786440109462720
  8. Deep residual learning for image recognition. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2016-Decem. 770–778. arXiv:1512.03385
  9. In Defense of the Triplet Loss for Person Re-Identification. In arXiv preprint. arXiv:1703.07737 http://arxiv.org/abs/1703.07737
  10. Harold Hotelling. 1933. Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology 24 (1933), 498–520. https://api.semanticscholar.org/CorpusID:144828484
  11. Multi-pseudo regularized label for generated data in person re-identification. IEEE Transactions on Image Processing 28, 3 (2019), 1391–1403. https://doi.org/10.1109/TIP.2018.2874715 arXiv:1801.06742
  12. Frustratingly easy person re-identification: Generalizing person Re-ID in practice. In 30th British Machine Vision Conference 2019, BMVC 2019. arXiv:1905.03422
  13. Style Normalization and Restitution for Generalizable Person Re-Identification. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 3140–3149. arXiv:2005.11037
  14. William Johnson and Joram Lindenstrauss. 1984. Extensions of Lipschitz maps into a Hilbert space. Contemp. Math. 26 (1984), 189–206. https://doi.org/10.1090/conm/026/737400
  15. Knowledge Distillation by On-the-Fly Native Ensemble. In Proceedings of the 32nd International Conference on Neural Information Processing Systems (NIPS’18). Curran Associates Inc., Red Hook, NY, USA, 7528–7538.
  16. DeepReID: Deep Filter Pairing Neural Network for Person Re-identification. In 2014 IEEE Conference on Computer Vision and Pattern Recognition. 152–159.
  17. Reliability Exploration with Self-Ensemble Learning for Domain Adaptive Person Re-identification. Proceedings of the AAAI Conference on Artificial Intelligence 36, 2 (2022), 1527–1535. https://doi.org/10.1609/aaai.v36i2.20043
  18. Shengcai Liao and Ling Shao. 2020. Interpretable and Generalizable Deep Image Matching with Adaptive Convolutions. European Conference on Computer Vision (ECCV) abs/1904.1 (2020). arXiv:1904.10424
  19. Shan Lin and Chang Tsun Li. 2017. End-to-End Correspondence and Relationship Learning of Mid-Level Deep Features for Person Re-Identification. In DICTA 2017 - 2017 International Conference on Digital Image Computing: Techniques and Applications, Vol. 2017-Decem. 1–6. https://doi.org/10.1109/DICTA.2017.8227426
  20. Multi-Domain Adversarial Feature Generalization for Person Re-Identification. IEEE Transactions on Image Processing (nov 2020). arXiv:2011.12563
  21. Bag of tricks and a strong baseline for deep person re-identification. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Vol. 2019-June. 1487–1495. arXiv:1903.07071
  22. Part-Aware Transformer for Generalizable Person Re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 11280–11289.
  23. Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 11208 LNCS. 484–500. https://doi.org/10.1007/978-3-030-01225-0_29 arXiv:1807.09441
  24. A Novel Mix-Normalization Method for Generalizable Multi-Source Person Re-Identification. IEEE Transactions on Multimedia (2022), 1–12. https://doi.org/10.1109/TMM.2022.3183393
  25. Generalizable person re-identification by domain-invariant mapping network. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2019-June. 719–728.
  26. Style Interleaved Learning for Generalizable Person Re-identification. IEEE Transactions on Multimedia (2023). https://doi.org/10.1109/TMM.2023.3283878
  27. Learning discriminative features with multiple granularities for person re-identification. In MM 2018 - Proceedings of the 2018 ACM Multimedia Conference. 274–282. https://doi.org/10.1145/3240508.3240552 arXiv:1804.01438
  28. Person Transfer GAN to Bridge Domain Gap for Person Re-identification. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 79–88. https://doi.org/10.1109/CVPR.2018.00016 arXiv:1711.08565
  29. A Discriminative Feature Learning Approach for Deep Face Recognition. In Proc. European Conference on Computer Vision (ECCV). Vol. 9911 LNCS. Springer Science+Business Media, 499–515.
  30. Mimic Embedding via Adaptive Aggregation: Learning Generalizable Person Re-identification. In Computer Vision – ECCV 2022, Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, and Tal Hassner (Eds.). Springer Nature Switzerland, Cham, 372–388.
  31. Deep Learning for Person Re-identification: A Survey and Outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).
  32. Multiple Domain Experts Collaborative Learning: Multi-Source Domain Generalization For Person Re-Identification. arXiv:2105.12355 [cs.CV]
  33. Adaptive Cross-domain Learning for Generalizable Person Re-identification. In Computer Vision – ECCV 2022, Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, and Tal Hassner (Eds.). Springer Nature Switzerland, Cham, 215–232.
  34. Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 6273–6282. https://doi.org/10.1109/CVPR46437.2021.00621 arXiv:2012.00417
  35. Scalable person re-identification: A benchmark. In Proceedings of the IEEE International Conference on Computer Vision, Vol. 2015 Inter. 1116–1124.
  36. Person Re-identification: Past, Present and Future. arXiv preprint (2016). arXiv:1610.02984 http://arxiv.org/abs/1610.02984
  37. Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro. In Proceedings of the IEEE International Conference on Computer Vision, Vol. 2017-Octob. 3774–3782. arXiv:1701.07717
  38. Random erasing data augmentation. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence. 13001–13008. https://doi.org/10.1609/aaai.v34i07.7000 arXiv:1708.04896
  39. Learning Generalisable Omni-Scale Representations for Person Re-Identification. In IEEE Transactions on Pattern Analysis and Machine Intelligence. https://doi.org/10.1109/TPAMI.2021.3069237 arXiv:1910.06827
  40. Rethinking the Distribution Gap of Person Re-identification with Camera-Based Batch Normalization. In European Conference on Computer Vision. Springer, 140–157.

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.