Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification (2403.11708v3)

Published 18 Mar 2024 in cs.CV

Abstract: Visible-Infrared Person Re-identification (VI-ReID) is a challenging cross-modal pedestrian retrieval task, due to significant intra-class variations and cross-modal discrepancies among different cameras. Existing works mainly focus on embedding images of different modalities into a unified space to mine modality-shared features. They only seek distinctive information within these shared features, while ignoring the identity-aware useful information that is implicit in the modality-specific features. To address this issue, we propose a novel Implicit Discriminative Knowledge Learning (IDKL) network to uncover and leverage the implicit discriminative information contained within the modality-specific. First, we extract modality-specific and modality-shared features using a novel dual-stream network. Then, the modality-specific features undergo purification to reduce their modality style discrepancies while preserving identity-aware discriminative knowledge. Subsequently, this kind of implicit knowledge is distilled into the modality-shared feature to enhance its distinctiveness. Finally, an alignment loss is proposed to minimize modality discrepancy on enhanced modality-shared features. Extensive experiments on multiple public datasets demonstrate the superiority of IDKL network over the state-of-the-art methods. Code is available at https://github.com/1KK077/IDKL.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (49)
  1. Structure-aware positional transformer for visible-infrared person re-identification. IEEE Transactions on Image Processing, 31:2352–2364, 2022.
  2. Neural feature search for rgb-infrared person re-identification. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 587–597, 2021.
  3. Visible-infrared person re-identification via semantic alignment and affinity inference. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 11270–11279, 2023.
  4. Cm-nas: Cross-modality neural architecture search for visible-infrared person re-identification. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 11803–11812, 2021.
  5. Cross-modal cross-domain dual alignment network for rgb-infrared person re-identification. IEEE Transactions on Circuits and Systems for Video Technology, 32(10):6874–6887, 2022.
  6. Unsupervised domain adaptation by backpropagation. In International conference on machine learning, pages 1180–1189. PMLR, 2015.
  7. Cross-modality person re-identification via modality confusion and center aggregation. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 16383–16392, 2021.
  8. Hsme: Hypersphere manifold embedding for visible thermal person re-identification. In Proceedings of the AAAI conference on artificial intelligence, pages 8385–8392, 2019.
  9. Transreid: Transformer-based object re-identification. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 14993–15002, 2021.
  10. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7132–7141, 2018.
  11. Arbitrary style transfer in real-time with adaptive instance normalization. In Proceedings of the IEEE international conference on computer vision, pages 1501–1510, 2017.
  12. Modality-adaptive mixup and invariant decomposition for rgb-infrared person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 1034–1042, 2022.
  13. Frustratingly easy person re-identification: Generalizing person re-id in practice. arXiv preprint arXiv:1905.03422, 2019.
  14. Cross-modality transformer for visible-infrared person re-identification. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XIV, pages 480–496. Springer, 2022.
  15. Style normalization and restitution for generalizable person re-identification. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 3143–3152, 2020.
  16. Counterfactual intervention feature transfer for visible-infrared person re-identification. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXVI, pages 381–398. Springer, 2022.
  17. Homogeneous-to-heterogeneous: Unsupervised learning for rgb-infrared person re-identification. IEEE Transactions on Image Processing, 30:6392–6407, 2021.
  18. Parameter sharing exploration and hetero-center triplet loss for visible-thermal person re-identification. IEEE Transactions on Multimedia, 23:4414–4425, 2021.
  19. Learning memory-augmented unidirectional metrics for cross-modality person re-identification. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 19344–19353, 2022a.
  20. Neural image parts group search for person re-identification. IEEE Transactions on Circuits and Systems for Video Technology, 2022b.
  21. Cross-modality person re-identification with shared-specific feature transfer. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 13376–13386, 2020.
  22. Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors, 17(3):605, 2017.
  23. Two at once: Enhancing learning and generalization capacities via ibn-net. In Proceedings of the European Conference on Computer Vision (ECCV), pages 464–479, 2018.
  24. Fine-tuning cnn image retrieval with no human annotation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(7):1655–1668, 2019.
  25. Learning instance-level spatial-temporal patterns for person re-identification. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 14910–14919, 2021.
  26. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE international conference on computer vision, pages 618–626, 2017.
  27. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the European conference on computer vision (ECCV), pages 480–496, 2018.
  28. Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 3622–3631, 2019a.
  29. Cross-modality paired-images generation for rgb-infrared person re-identification. In Proceedings of the AAAI conference on artificial intelligence, pages 12144–12151, 2020.
  30. Learning to reduce dual-level discrepancy for infrared-visible person re-identification. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 618–626, 2019b.
  31. Syncretic modality collaborative learning for visible infrared person re-identification. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 225–234, 2021.
  32. Rgb-infrared cross-modality person re-identification. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 5390–5399, 2017.
  33. Discover cross-modality nuances for visible-infrared person re-identification. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4328–4337, 2021.
  34. Cross-modality paired-images generation and augmentation for rgb-infrared person re-identification. Neural Networks, 128:294–304, 2020.
  35. Cross-modality person re-identification via modality-aware collaborative ensemble learning. IEEE Transactions on Image Processing, 29:9387–9399, 2020a.
  36. Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVII 16, pages 229–247. Springer, 2020b.
  37. Channel augmented joint learning for visible-infrared recognition. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 13547–13556, 2021a.
  38. Visible-infrared person re-identification via homogeneous augmented tri-modal learning. IEEE Transactions on Information Forensics and Security, 16:728–739, 2021b.
  39. Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(6):2872–2893, 2022.
  40. Modality unifying network for visible-infrared person re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 11185–11195, 2023.
  41. Style uncertainty based self-paced meta learning for generalizable person re-identification. IEEE Transactions on Image Processing, 32:2107–2119, 2023.
  42. Fmcnet: Feature-level modality compensation for visible-infrared person re-identification. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 7339–7348, 2022a.
  43. Diverse embedding expansion network and low-light cross-modality benchmark for visible-infrared person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2153–2162, 2023.
  44. Deep mutual learning. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4320–4328, 2018.
  45. Modality synergy complement learning with cascaded aggregation for visible-infrared person re-identification. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XIV, pages 462–479. Springer, 2022b.
  46. Spatial-channel enhanced transformer for visible-infrared person re-identification. IEEE Transactions on Multimedia, pages 1–1, 2022.
  47. Visible-infrared person re-identification via partially interactive collaboration. IEEE Transactions on Image Processing, 31:6951–6963, 2022.
  48. Omni-scale feature learning for person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision, pages 3702–3712, 2019.
  49. Hetero-center loss for cross-modality person re-identification. Neurocomputing, 386:97–109, 2020.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Kaijie Ren (1 paper)
  2. Lei Zhang (1689 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.