Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification (2403.11708v3)
Abstract: Visible-Infrared Person Re-identification (VI-ReID) is a challenging cross-modal pedestrian retrieval task, due to significant intra-class variations and cross-modal discrepancies among different cameras. Existing works mainly focus on embedding images of different modalities into a unified space to mine modality-shared features. They only seek distinctive information within these shared features, while ignoring the identity-aware useful information that is implicit in the modality-specific features. To address this issue, we propose a novel Implicit Discriminative Knowledge Learning (IDKL) network to uncover and leverage the implicit discriminative information contained within the modality-specific. First, we extract modality-specific and modality-shared features using a novel dual-stream network. Then, the modality-specific features undergo purification to reduce their modality style discrepancies while preserving identity-aware discriminative knowledge. Subsequently, this kind of implicit knowledge is distilled into the modality-shared feature to enhance its distinctiveness. Finally, an alignment loss is proposed to minimize modality discrepancy on enhanced modality-shared features. Extensive experiments on multiple public datasets demonstrate the superiority of IDKL network over the state-of-the-art methods. Code is available at https://github.com/1KK077/IDKL.
- Structure-aware positional transformer for visible-infrared person re-identification. IEEE Transactions on Image Processing, 31:2352–2364, 2022.
- Neural feature search for rgb-infrared person re-identification. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 587–597, 2021.
- Visible-infrared person re-identification via semantic alignment and affinity inference. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 11270–11279, 2023.
- Cm-nas: Cross-modality neural architecture search for visible-infrared person re-identification. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 11803–11812, 2021.
- Cross-modal cross-domain dual alignment network for rgb-infrared person re-identification. IEEE Transactions on Circuits and Systems for Video Technology, 32(10):6874–6887, 2022.
- Unsupervised domain adaptation by backpropagation. In International conference on machine learning, pages 1180–1189. PMLR, 2015.
- Cross-modality person re-identification via modality confusion and center aggregation. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 16383–16392, 2021.
- Hsme: Hypersphere manifold embedding for visible thermal person re-identification. In Proceedings of the AAAI conference on artificial intelligence, pages 8385–8392, 2019.
- Transreid: Transformer-based object re-identification. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 14993–15002, 2021.
- Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7132–7141, 2018.
- Arbitrary style transfer in real-time with adaptive instance normalization. In Proceedings of the IEEE international conference on computer vision, pages 1501–1510, 2017.
- Modality-adaptive mixup and invariant decomposition for rgb-infrared person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 1034–1042, 2022.
- Frustratingly easy person re-identification: Generalizing person re-id in practice. arXiv preprint arXiv:1905.03422, 2019.
- Cross-modality transformer for visible-infrared person re-identification. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XIV, pages 480–496. Springer, 2022.
- Style normalization and restitution for generalizable person re-identification. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 3143–3152, 2020.
- Counterfactual intervention feature transfer for visible-infrared person re-identification. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXVI, pages 381–398. Springer, 2022.
- Homogeneous-to-heterogeneous: Unsupervised learning for rgb-infrared person re-identification. IEEE Transactions on Image Processing, 30:6392–6407, 2021.
- Parameter sharing exploration and hetero-center triplet loss for visible-thermal person re-identification. IEEE Transactions on Multimedia, 23:4414–4425, 2021.
- Learning memory-augmented unidirectional metrics for cross-modality person re-identification. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 19344–19353, 2022a.
- Neural image parts group search for person re-identification. IEEE Transactions on Circuits and Systems for Video Technology, 2022b.
- Cross-modality person re-identification with shared-specific feature transfer. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 13376–13386, 2020.
- Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors, 17(3):605, 2017.
- Two at once: Enhancing learning and generalization capacities via ibn-net. In Proceedings of the European Conference on Computer Vision (ECCV), pages 464–479, 2018.
- Fine-tuning cnn image retrieval with no human annotation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(7):1655–1668, 2019.
- Learning instance-level spatial-temporal patterns for person re-identification. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 14910–14919, 2021.
- Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE international conference on computer vision, pages 618–626, 2017.
- Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the European conference on computer vision (ECCV), pages 480–496, 2018.
- Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 3622–3631, 2019a.
- Cross-modality paired-images generation for rgb-infrared person re-identification. In Proceedings of the AAAI conference on artificial intelligence, pages 12144–12151, 2020.
- Learning to reduce dual-level discrepancy for infrared-visible person re-identification. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 618–626, 2019b.
- Syncretic modality collaborative learning for visible infrared person re-identification. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 225–234, 2021.
- Rgb-infrared cross-modality person re-identification. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 5390–5399, 2017.
- Discover cross-modality nuances for visible-infrared person re-identification. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4328–4337, 2021.
- Cross-modality paired-images generation and augmentation for rgb-infrared person re-identification. Neural Networks, 128:294–304, 2020.
- Cross-modality person re-identification via modality-aware collaborative ensemble learning. IEEE Transactions on Image Processing, 29:9387–9399, 2020a.
- Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVII 16, pages 229–247. Springer, 2020b.
- Channel augmented joint learning for visible-infrared recognition. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 13547–13556, 2021a.
- Visible-infrared person re-identification via homogeneous augmented tri-modal learning. IEEE Transactions on Information Forensics and Security, 16:728–739, 2021b.
- Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(6):2872–2893, 2022.
- Modality unifying network for visible-infrared person re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 11185–11195, 2023.
- Style uncertainty based self-paced meta learning for generalizable person re-identification. IEEE Transactions on Image Processing, 32:2107–2119, 2023.
- Fmcnet: Feature-level modality compensation for visible-infrared person re-identification. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 7339–7348, 2022a.
- Diverse embedding expansion network and low-light cross-modality benchmark for visible-infrared person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2153–2162, 2023.
- Deep mutual learning. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4320–4328, 2018.
- Modality synergy complement learning with cascaded aggregation for visible-infrared person re-identification. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XIV, pages 462–479. Springer, 2022b.
- Spatial-channel enhanced transformer for visible-infrared person re-identification. IEEE Transactions on Multimedia, pages 1–1, 2022.
- Visible-infrared person re-identification via partially interactive collaboration. IEEE Transactions on Image Processing, 31:6951–6963, 2022.
- Omni-scale feature learning for person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision, pages 3702–3712, 2019.
- Hetero-center loss for cross-modality person re-identification. Neurocomputing, 386:97–109, 2020.
- Kaijie Ren (1 paper)
- Lei Zhang (1689 papers)