Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Parameter Hierarchical Optimization for Visible-Infrared Person Re-Identification (2404.07930v1)

Published 11 Apr 2024 in cs.CV and cs.AI

Abstract: Visible-infrared person re-identification (VI-reID) aims at matching cross-modality pedestrian images captured by disjoint visible or infrared cameras. Existing methods alleviate the cross-modality discrepancies via designing different kinds of network architectures. Different from available methods, in this paper, we propose a novel parameter optimizing paradigm, parameter hierarchical optimization (PHO) method, for the task of VI-ReID. It allows part of parameters to be directly optimized without any training, which narrows the search space of parameters and makes the whole network more easier to be trained. Specifically, we first divide the parameters into different types, and then introduce a self-adaptive alignment strategy (SAS) to automatically align the visible and infrared images through transformation. Considering that features in different dimension have varying importance, we develop an auto-weighted alignment learning (AAL) module that can automatically weight features according to their importance. Importantly, in the alignment process of SAS and AAL, all the parameters are immediately optimized with optimization principles rather than training the whole network, which yields a better parameter training manner. Furthermore, we establish the cross-modality consistent learning (CCL) loss to extract discriminative person representations with translation consistency. We provide both theoretical justification and empirical evidence that our proposed PHO method outperform existing VI-reID approaches.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (62)
  1. Structure-aware positional transformer for visible-infrared person re-identification. IEEE Transactions on Image Processing, 31:2352–2364, 2022.
  2. Neural feature search for rgb-infrared person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 587–597, 2021.
  3. Hi-cmd: Hierarchical cross-modality disentanglement for visible-infrared person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10257–10266, 2020.
  4. Cross-modality person re-identification with generative adversarial training. In Proceedings of the International Joint Conference on Artificial Intelligence, volume 1, page 6, 2018.
  5. C. Eom and B. Ham. Learning disentangled representation for robust person re-identification. Advances in Neural Information Processing Systems, 32, 2019.
  6. Learning modality-specific representations for visible-infrared person re-identification. IEEE Transactions on Image Processing, 29:579–590, 2019.
  7. Cm-nas: Cross-modality neural architecture search for visible-infrared person re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 11823–11832, 2021.
  8. Hsme: Hypersphere manifold embedding for visible thermal person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 8385–8392, 2019.
  9. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 770–778, 2016.
  10. Automated variable weighting in k-means type clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(5):657–668, 2005.
  11. Modality-adaptive mixup and invariant decomposition for rgb-infrared person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 1034–1042, 2022.
  12. Mmd-reid: A simple but effective solution for visible-thermal person reid. arXiv preprint arXiv:2111.05059, 2021.
  13. Sdl: Spectrum-disentangled representation learning for visible-infrared person re-identification. IEEE Transactions on Circuits and Systems for Video Technology, 30(10):3422–3432, 2020.
  14. D. P. Kingma and J. Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
  15. Infrared-visible cross-modal person re-identification with an x modality. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 4610–4617, 2020.
  16. Intermediary-guided bidirectional spatial-temporal aggregation network for video-based visible-infrared person re-identification. IEEE Transactions on Circuits and Systems for Video Technology, 2023. doi: 10.1109/TCSVT.2023.3246091.
  17. Visible-infrared person re-identification with modality-specific memory network. IEEE Transactions on Image Processing, 31:7165–7178, 2022.
  18. S. Liao and L. Shao. Transmatcher: Deep image matching through transformers for generalizable person re-identification. Advances in Neural Information Processing Systems, 34:1992–2003, 2021.
  19. Learning modal-invariant and temporal-memory for video-based visible-infrared person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20973–20982, 2022.
  20. A multi-constraint similarity learning with adaptive weighting for visible-thermal person re-identification. In Proceedings of the International Joint Conference on Artificial Intelligence, pages 845–851, 2021.
  21. Parameter sharing exploration and hetero-center triplet loss for visible-thermal person re-identification. IEEE Transactions on Multimedia, 23:4414–4425, 2020.
  22. Learning memory-augmented unidirectional metrics for cross-modality person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 19366–19375, 2022.
  23. Cross-modality person re-identification with shared-specific feature transfer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13379–13389, 2020.
  24. Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors, 17(3):605, 2017.
  25. Learning by aligning: Visible-infrared person re-identification using cross-modal correspondences. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 12046–12055, 2021.
  26. Dual gaussian-based variational subspace disentanglement for visible-infrared person re-identification. In Proceedings of the 28th ACM International Conference on Multimedia, pages 2149–2158, 2020.
  27. Unsupervised generalizable multi-source person re-identification: A domain-specific adaptive framework. Pattern Recognition, 140:109546, 2023.
  28. Fine-tuning cnn image retrieval with no human annotation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(7):1655–1668, 2018.
  29. On the convergence of adam and beyond. arXiv preprint arXiv:1904.09237, 2019.
  30. Tri-modality consistency optimization with heterogeneous augmented images for visible-infrared person re-identification. Neurocomputing, 523:170–181, 2023.
  31. Z. Sun and F. Zhao. Counterfactual attention alignment for visible-infrared cross-modality person re-identification. Pattern Recognition Letters, 168:79–85, 2023.
  32. Aanet: Attribute attention network for person re-identifications. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7134–7143, 2019.
  33. Farewell to mutual information: Variational distillation for cross-modal person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1522–1531, 2021.
  34. L. Van der Maaten and G. Hinton. Visualizing data using t-sne. Journal of Machine Learning Research, 9(11), 2008.
  35. G2da: Geometry-guided dual-alignment learning for rgb-infrared person re-identification. Pattern Recognition, 135:109150, 2023.
  36. Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 3623–3632, 2019.
  37. Cross-modality paired-images generation for rgb-infrared person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 12144–12151, 2020.
  38. Nformer: Robust person re-identification with neighbor transformer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7297–7307, 2022.
  39. Deep multi-patch matching network for visible thermal person re-identification. IEEE Transactions on Multimedia, 23:1474–1488, 2020.
  40. Learning to reduce dual-level discrepancy for infrared-visible person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 618–626, 2019.
  41. Flexible body partition-based adversarial learning for visible infrared person re-identification. IEEE Transactions on Neural Networks and Learning Systems, 33(9):4676–4687, 2021.
  42. Rgb-infrared cross-modality person re-identification. In Proceedings of the IEEE International Conference on Computer Vision, pages 5380–5389, 2017.
  43. Discover cross-modality nuances for visible-infrared person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4330–4339, 2021.
  44. Attention-aware compositional network for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2119–2128, 2018.
  45. Learning with twin noisy labels for visible-infrared person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14308–14317, 2022.
  46. Hierarchical discriminative learning for visible thermal person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 32, 2018.
  47. Bi-directional center-constrained top-ranking for visible thermal person re-identification. IEEE Transactions on Information Forensics and Security, 15:407–419, 2019.
  48. Channel augmented joint learning for visible-infrared recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 13567–13576, 2021.
  49. Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In Proceedings of European Conference on Computer Vision, pages 229–247. Springer, 2020.
  50. Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(6):2872–2893, 2021.
  51. Visible-infrared person re-identification via homogeneous augmented tri-modal learning. IEEE Transactions on Information Forensics and Security, 16:728–739, 2020.
  52. Visible thermal person re-identification via dual-constrained top-ranking. In Proceedings of the International Joint Conference on Artificial Intelligence, volume 1, page 2, 2018.
  53. Unsupervised person re-identification by deep asymmetric metric embedding. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(4):956–973, 2018.
  54. An iterative locally auto-weighted least squares method for microarray missing value estimation. IEEE Transactions on Nanobioscience, 16(1):21–33, 2016.
  55. Fmcnet: Feature-level modality compensation for visible-infrared person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7349–7358, 2022.
  56. Attend to the difference: Cross-modality person re-identification via contrastive correlation. IEEE Transactions on Image Processing, 30:8861–8872, 2021.
  57. Mrcn: A novel modality restitution and compensation network for visible-infrared person re-identification. arXiv preprint arXiv:2303.14626, 2023.
  58. Rgb-ir cross-modality person reid based on teacher-student gan model. Pattern Recognition Letters, 150:155–161, 2021.
  59. Progressive attribute embedding for accurate cross-modality person re-id. In Proceedings of the 30th ACM International Conference on Multimedia, pages 4309–4317, 2022.
  60. Person re-identification: Past, present and future. arXiv preprint arXiv:1610.02984, 2016.
  61. Joint discriminative and generative learning for person re-identification. In proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2138–2147, 2019.
  62. Random erasing data augmentation. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 13001–13008, 2020.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Zeng YU (4 papers)
  2. Yunxiao Shi (20 papers)

Summary

We haven't generated a summary for this paper yet.