Spatial Cascaded Clustering and Weighted Memory for Unsupervised Person Re-identification (2403.00261v1)
Abstract: Recent unsupervised person re-identification (re-ID) methods achieve high performance by leveraging fine-grained local context. These methods are referred to as part-based methods. However, most part-based methods obtain local contexts through horizontal division, which suffer from misalignment due to various human poses. Additionally, the misalignment of semantic information in part features restricts the use of metric learning, thus affecting the effectiveness of part-based methods. The two issues mentioned above result in the under-utilization of part features in part-based methods. We introduce the Spatial Cascaded Clustering and Weighted Memory (SCWM) method to address these challenges. SCWM aims to parse and align more accurate local contexts for different human body parts while allowing the memory module to balance hard example mining and noise suppression. Specifically, we first analyze the foreground omissions and spatial confusions issues in the previous method. Then, we propose foreground and space corrections to enhance the completeness and reasonableness of the human parsing results. Next, we introduce a weighted memory and utilize two weighting strategies. These strategies address hard sample mining for global features and enhance noise resistance for part features, which enables better utilization of both global and part features. Extensive experiments on Market-1501 and MSMT17 validate the proposed method's effectiveness over many state-of-the-art methods.
- Unsupervised learning of visual features by contrasting cluster assignments. Advances in Neural Information Processing Systems, 33:9912–9924, 2020.
- Mixed high-order attention network for person re-identification. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 2019a.
- Ice: Inter-instance contrastive encoding for unsupervised person re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 14960–14969, 2021a.
- Joint generative and contrastive learning for unsupervised person re-identification. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), page 8. IEEE, 2021b.
- Abd-net: Attentive but diverse person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision, pages 8351–8361, 2019b.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597–1607, 2020.
- Beyond appearance: a semantic controllable self-supervised learning framework for human-centric visual tasks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15050–15061, 2023.
- Part-based pseudo label refinement for unsupervised person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7308–7318, 2022.
- Cluster contrast for unsupervised person re-identification. In Proceedings of the Asian Conference on Computer Vision, pages 1142–1160, 2022.
- A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD, page 6, 1996.
- Unsupervised person re-identification. ACM Transactions on Multimedia Computing, Communications, and Applications, 14(4):1–18, 2018.
- Mutual mean-teaching: Pseudo label refinery for unsupervised domain adaptation on person re-identification. arXiv preprint arXiv:2001.01526, 2020a.
- Self-paced contrastive learning with hybrid memory for domain adaptive object re-id. Advances in Neural Information Processing Systems, 33:11309–11321, 2020b.
- Beyond human parts: Dual part-aligned representations for person re-identification. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 2019.
- Dimensionality reduction by learning an invariant mapping. In 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2 (CVPR’06), pages 1735–1742. IEEE, 2006.
- Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), number 6. IEEE, 2016.
- Momentum contrast for unsupervised visual representation learning. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9729–9738. IEEE, 2020.
- Learning deep context-aware features over body and latent parts for person re-identification. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 384–393, 2017.
- Harmonious attention network for person re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2285–2294, 2018.
- A bottom-up clustering approach to unsupervised person re-identification. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01):8738–8745, 2019.
- Pose transferrable person re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4099–4108, 2018.
- Frequency information matters for image matting. In Asian Conference on Pattern Recognition, pages 81–94. Springer, 2023.
- Encoding based saliency detection for videos and images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2494–2502, 2015.
- Pose-guided feature alignment for occluded person re-identification. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 542–551, 2019.
- Ishan Misra and Laurens Van Der Maaten. Self-supervised learning of pretext-invariant representations. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6707–6717. IEEE, 2020.
- A pose-sensitive embedding for person re-identification with expanded cross neighborhood re-ranking. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 420–429, 2018.
- Dual attention matching network for context-aware feature sequence based person re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5363–5372, 2018.
- Mask-guided contrastive attention model for person re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1179–1188, 2018.
- Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the European conference on computer vision (ECCV), pages 480–496, 2018.
- Contrastive multiview coding. In Computer Vision – ECCV 2020, pages 776–794. Springer International Publishing, 2020.
- Laurens Van Der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of Machine Learning Research (JMLR), 4:7, 2008.
- Unsupervised person re-identification via multi-label classification. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10981–10990. IEEE, 2020.
- Learning discriminative features with multiple granularities for person re-identification. In Proceedings of the 26th ACM international conference on Multimedia, pages 274–282, 2018.
- Camera-aware proxies for unsupervised person re-identification. Proceedings of the AAAI Conference on Artificial Intelligence, 35(4):2764–2772, 2021.
- Person transfer gan to bridge domain gap for person re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, page 5. IEEE, 2018.
- Discover cross-modality nuances for visible-infrared person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4330–4339, 2021.
- Multi-centroid representation network for domain adaptive person re-id. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 2750–2758, 2022.
- Unsupervised feature learning via non-parametric instance discrimination. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, 2018.
- Intra-inter camera similarity for unsupervised person re-identification. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 11926–11935. IEEE, 2021.
- Towards rich feature discovery with class activation maps augmentation for person re-identification. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 1389–1398, 2019.
- Deep representation learning with part loss for person re-identification. IEEE Transactions on Image Processing, 28:2860–2871, 2019.
- Unsupervised embedding learning via invariant and spreading instance feature. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6210–6219. IEEE, 2019.
- Hierarchical clustering with hard-batch triplet loss for person re-identification. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2008.
- Refining pseudo labels with clustering consensus over generations for unsupervised object re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3436–3445, 2021.
- Implicit sample extension for unsupervised person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7369–7378, 2022.
- Pyramidal person re-identification via multi-loss dynamic training. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8514–8522, 2019a.
- Scalable person re-identification: A benchmark. In 2015 IEEE International Conference on Computer Vision (ICCV), number 5. IEEE, 2015.
- Camera style and identity disentangling network for person re-identification. In BMVC, page 66, 2019b.
- Online pseudo label generation by hierarchical cluster dynamics for adaptive person re-identification. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 8371–8381. IEEE, 2021.
- Identity-guided human semantic parsing for person re-identification. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part III 16, pages 346–363. Springer, 2020.
- Plip: Language-image pre-training for person representation learning, 2023.
- Jiahao Hong (4 papers)
- Jialong Zuo (22 papers)
- Chuchu Han (13 papers)
- Ruochen Zheng (3 papers)
- Ming Tian (5 papers)
- Changxin Gao (76 papers)
- Nong Sang (86 papers)