Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-spectral Vehicle Re-identification with Cross-directional Consistency Network and a High-quality Benchmark (2208.00632v2)

Published 1 Aug 2022 in cs.CV

Abstract: To tackle the challenge of vehicle re-identification (Re-ID) in complex lighting environments and diverse scenes, multi-spectral sources like visible and infrared information are taken into consideration due to their excellent complementary advantages. However, multi-spectral vehicle Re-ID suffers cross-modality discrepancy caused by heterogeneous properties of different modalities as well as a big challenge of the diverse appearance with different views in each identity. Meanwhile, diverse environmental interference leads to heavy sample distributional discrepancy in each modality. In this work, we propose a novel cross-directional consistency network to simultaneously overcome the discrepancies from both modality and sample aspects. In particular, we design a new cross-directional center loss to pull the modality centers of each identity close to mitigate cross-modality discrepancy, while the sample centers of each identity close to alleviate the sample discrepancy. Such strategy can generate discriminative multi-spectral feature representations for vehicle Re-ID. In addition, we design an adaptive layer normalization unit to dynamically adjust individual feature distribution to handle distributional discrepancy of intra-modality features for robust learning. To provide a comprehensive evaluation platform, we create a high-quality RGB-NIR-TIR multi-spectral vehicle Re-ID benchmark (MSVR310), including 310 different vehicles from a broad range of viewpoints, time spans and environmental complexities. Comprehensive experiments on both created and public datasets demonstrate the effectiveness of the proposed approach comparing to the state-of-the-art methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (75)
  1. Multi-feature, multi-modal, and multi-source social event detection: A comprehensive survey. Information Fusion 79, 279–308.
  2. Pedestrian re-identification algorithm based on visual attention-positive sample generation network deep learning model. Information Fusion 86-87, 136–145.
  3. Layer normalization. arXiv preprint arXiv:1607.06450 .
  4. Re-identification with rgb-d sensors, in: Proc. European Conference on Computer Vision Workshops.
  5. Deep meta metric learning, in: Proc. IEEE/CVF International Conference on Computer Vision.
  6. Shape-former: Bridging cnn and transformer via shapeconv for multimodal image matching. Information Fusion 91, 445–457.
  7. Vehicle re-identification with viewpoint-aware metric learning, in: Proc. IEEE/CVF International Conference on Computer Vision, pp. 8281–8290.
  8. Imagenet: A large-scale hierarchical image database, in: Proc. IEEE/CVF International Conference on Computer Vision.
  9. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 .
  10. A cross-modal distillation network for person re-identification in rgb-depth. arXiv preprint arXiv:1810.11641 .
  11. Learning coarse-to-fine structured feature embedding for vehicle re-identification, in: Proc. AAAI Conference on Artificial Intelligence.
  12. Part-regularized near-duplicate vehicle re-identification, in: Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3992–4000.
  13. Deep residual learning for image recognition, pp. 770–778.
  14. Transreid: Transformer-based object re-identification, in: Proc. IEEE/CVF International Conference on Computer Vision, pp. 15013–15022.
  15. In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737 .
  16. Squeeze-and-excitation networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 42, 2011–2023.
  17. Densely connected convolutional networks, in: Proc. IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708.
  18. Deep learning for visible-infrared cross-modality person re-identification: A comprehensive review. Information Fusion 91, 396–411.
  19. A strong baseline for vehicle re-identification, in: Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 4142–4149.
  20. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 .
  21. A dual-path model with adaptive attention for vehicle re-identification, in: Proc. IEEE/CVF International Conference on Computer Vision, pp. 6131–6140.
  22. Adam: A method for stochastic optimization. CoRR abs/1412.6980.
  23. Learning collaborative sparse representation for grayscale-thermal tracking. IEEE Transactions on Image Processing 25, 5743–5756.
  24. Infrared-visible cross-modal person re-identification with an x modality, in: Proc. AAAI Conference on Artificial Intelligence, pp. 4610–4617.
  25. Attribute and state guided structural embedding network for vehicle re-identification. IEEE transactions on image processing 31, 5949–5962.
  26. Multi-spectral vehicle re-identification: A challenge., in: Proc. AAAI Conference on Artificial Intelligence, pp. 11345–11353.
  27. Class-aware modality mix and center-guided metric learning for visible-thermal person re-identification, in: Proc. ACM International Conference on Multimedia, pp. 889–897.
  28. Deep relative distance learning: Tell the difference between similar vehicles, in: Proc. IEEE Conference on Computer Vision and Pattern Recognition.
  29. Learning memory-augmented unidirectional metrics for cross-modality person re-identification, in: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 19344–19353. doi:10.1109/CVPR52688.2022.01876.
  30. Large-scale vehicle re-identification in urban surveillance videos, in: Proc. IEEE International Conference on Multimedia and Expo, pp. 1–6.
  31. Embedding adversarial learning for vehicle re-identification. IEEE Transactions on Image Processing 28, 3794–3807.
  32. Veri-wild: A large dataset and a new method for vehicle re-identification in the wild, in: Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3230–3238.
  33. Rgbt tracking via multi-adapter network with hierarchical divergence loss. IEEE Transactions on Image Processing 30, 5613–5625.
  34. Cross-modality person re-identification with shared-specific feature transfer, in: Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13376–13386.
  35. A strong baseline and batch normalization neck for deep person re-identification. IEEE Transactions on Multimedia 22, 2597–2609.
  36. Visualizing data using t-sne. Journal of Machine Learning Research 9, 2579–2605.
  37. Parsing-based view-aware embedding network for vehicle re-identification, in: Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7101–7110.
  38. Tri-modal person re-identification with rgb, depth and thermal features, in: Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 301–307.
  39. 3d reconstruction of freely moving persons for re-identification with a depth sensor, in: Proc. IEEE International Conference on Robotics and Automation, pp. 4512–4519.
  40. Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors (Basel, Switzerland) 17.
  41. Learning by aligning: Visible-infrared person re-identification using cross-modal correspondences, in: Proc. IEEE/CVF International Conference on Computer Vision, pp. 12046–12055.
  42. Mobilenetv2: Inverted residuals and linear bottlenecks, in: Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4510–4520.
  43. Learning deep neural networks for vehicle re-id with visual-spatio-temporal path proposals, in: Proc. IEEE International Conference on Computer Vision, pp. 1918–1927.
  44. Circle loss: A unified perspective of pair similarity optimization, in: Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6397–6406.
  45. Beyond part models: Person retrieval with refined part pooling, in: Proc. European Conference on Computer Vision.
  46. Rethinking the inception architecture for computer vision, in: Proc. IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826.
  47. Cityflow: A city-scale benchmark for multi-target multi-camera vehicle tracking and re-identification, in: Proc. IEEE/CVF Internaltional Conference on Computer Vision and Pattern Recognition, pp. 8789–8798.
  48. Multi-interactive dual-decoder for rgb-thermal salient object detection. IEEE Transactions on Image Processing 30, 5678–5691.
  49. M5l: Multi-modal multi-margin metric learning for rgbt tracking. IEEE Transactions on Image Processing 31, 85–98.
  50. Instance normalization: The missing ingredient for fast stylization. arXiv preprint arXiv:1607.08022 .
  51. Learning discriminative features with multiple granularities for person re-identification, in: Proc. ACM International Conference on Multimedia.
  52. Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment, in: Proc. IEEE/CVF International Conference on Computer Vision, pp. 3622–3631.
  53. Orientation invariant feature embedding and spatial temporal regularization for vehicle re-identification, in: Proc. IEEE International Conference on Computer Vision, pp. 379–387.
  54. Flexible body partition-based adversarial learning for visible infrared person re-identification. IEEE Transactions on Neural Networks and Learning Systems 33, 4676–4687.
  55. Syncretic modality collaborative learning for visible infrared person re-identification, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 225–234.
  56. Rbdf: Reciprocal bidirectional framework for visible infrared person reidentification. IEEE Transactions on Cybernetics 52, 10988–10998.
  57. Dynamic clustering of multi-modal sensor networks in urban scenarios. Information Fusion 15, 130–140.
  58. A discriminative feature learning approach for deep face recognition, in: Proc. European Conference on Computer Vision.
  59. Robust depth-based person re-identification. IEEE Transactions on Image Processing 26, 2588–2603.
  60. Rgb-infrared cross-modality person re-identification, in: Proc. IEEE International Conference on Computer Vision, pp. 5390–5399.
  61. An end-to-end heterogeneous restraint network for rgb-d cross-modal person re-identification. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 18, 1–22.
  62. Discover cross-modality nuances for visible-infrared person re-identification, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4330–4339.
  63. Group normalization, in: Proc. European Conference on Computer Vision.
  64. Simulating content consistent vehicle datasets with attribute descent, in: Proc. European Conference on Computer Vision.
  65. Dynamic tri-level relation mining with attentive graph for visible infrared re-identification. IEEE Transactions on Information Forensics and Security 17, 386–398.
  66. Bi-directional center-constrained top-ranking for visible thermal person re-identification. IEEE Transactions on Information Forensics and Security 15, 407–419.
  67. Dynamic dual-attentive aggregation learning for visible-infrared person re-identification, in: Proc. European Conference on Computer Vision, Springer. pp. 229–247.
  68. Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence PP.
  69. Visible thermal person re-identification via dual-constrained top-ranking, in: Proc. International Joint Conference on Artificial Intelligence.
  70. Towards a unified middle modality learning for visible-infrared person re-identification, in: Proceedings of the 29th ACM International Conference on Multimedia, Association for Computing Machinery, New York, NY, USA. p. 788–796. URL: https://doi.org/10.1145/3474085.3475250, doi:10.1145/3474085.3475250.
  71. Heterogeneous relational complement for vehicle re-identification, in: Proc. IEEE/CVF International Conference on Computer Vision, pp. 205–214.
  72. Robust multi-modality person re-identification, in: Proc. AAAI Conference on Artificial Intelligence, pp. 3529–3537.
  73. Scalable person re-identification: A benchmark, in: Proc. IEEE/CVF International Conference on Computer Vision, pp. 1116–1124.
  74. Omni-scale feature learning for person re-identification, in: Proc. IEEE/CVF International Conference on Computer Vision, pp. 3701–3711.
  75. Hetero-center loss for cross-modality person re-identification. Neurocomputing 386, 97–109.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Aihua Zheng (30 papers)
  2. Xianpeng Zhu (1 paper)
  3. Zhiqi Ma (5 papers)
  4. Chenglong Li (94 papers)
  5. Jin Tang (139 papers)
  6. Jixin Ma (4 papers)
Citations (13)