Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhancing Long-Term Person Re-Identification Using Global, Local Body Part, and Head Streams (2403.02892v1)

Published 5 Mar 2024 in cs.CV and cs.AI

Abstract: This work addresses the task of long-term person re-identification. Typically, person re-identification assumes that people do not change their clothes, which limits its applications to short-term scenarios. To overcome this limitation, we investigate long-term person re-identification, which considers both clothes-changing and clothes-consistent scenarios. In this paper, we propose a novel framework that effectively learns and utilizes both global and local information. The proposed framework consists of three streams: global, local body part, and head streams. The global and head streams encode identity-relevant information from an entire image and a cropped image of the head region, respectively. Both streams encode the most distinct, less distinct, and average features using the combinations of adversarial erasing, max pooling, and average pooling. The local body part stream extracts identity-related information for each body part, allowing it to be compared with the same body part from another image. Since body part annotations are not available in re-identification datasets, pseudo-labels are generated using clustering. These labels are then utilized to train a body part segmentation head in the local body part stream. The proposed framework is trained by backpropagating the weighted summation of the identity classification loss, the pair-based loss, and the pseudo body part segmentation loss. To demonstrate the effectiveness of the proposed method, we conducted experiments on three publicly available datasets (Celeb-reID, PRCC, and VC-Clothes). The experimental results demonstrate that the proposed method outperforms the previous state-of-the-art method.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (60)
  1. Cloth-changing person re-identification with self-attention, in: 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW), pp. 602–610. doi:10.1109/WACVW54805.2022.00066.
  2. Openpose: Realtime multi-person 2d pose estimation using part affinity fields. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 172–186. doi:10.1109/TPAMI.2019.2929257.
  3. Weed mapping in multispectral drone imagery using lightweight vision transformers. Neurocomputing 562, 126914. doi:https://doi.org/10.1016/j.neucom.2023.126914.
  4. Multi-level factorisation net for person re-identification, in: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2109–2118. doi:10.1109/CVPR.2018.00225.
  5. Learning 3d shape feature for texture-insensitive person re-identification, in: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 8142–8151. doi:10.1109/CVPR46437.2021.00805.
  6. Xception: Deep learning with depthwise separable convolutions, in: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1800–1807. doi:10.1109/CVPR.2017.195.
  7. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 .
  8. Disentangled representations for short-term and long-term person re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 8975–8991. doi:10.1109/TPAMI.2021.3122444.
  9. Horizontal pyramid matching for person re-identification, in: Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, AAAI Press. doi:10.1609/aaai.v33i01.33018295.
  10. Clothes-changing person re-identification with rgb modality only, in: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1050–1059. doi:10.1109/CVPR52688.2022.00113.
  11. Densepose: Dense human pose estimation in the wild, in: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7297–7306. doi:10.1109/CVPR.2018.00762.
  12. Spatial complementary and self-repair learning for occluded person re-identification. Neurocomputing 546, 126360. doi:https://doi.org/10.1016/j.neucom.2023.126360.
  13. Bag of tricks for image classification with convolutional neural networks, in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 558–567. doi:10.1109/CVPR.2019.00065.
  14. Lightweight multi-branch network for person re-identification, in: 2021 IEEE International Conference on Image Processing (ICIP), pp. 1129–1133. doi:10.1109/ICIP42928.2021.9506733.
  15. Fine-grained shape-appearance mutual learning for cloth-changing person re-identification, in: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10508–10517. doi:10.1109/CVPR46437.2021.01037.
  16. Iefm and ids: Enhancing 3d environment perception via information encoding in indoor point cloud semantic segmentation. Neurocomputing 563, 126944. doi:https://doi.org/10.1016/j.neucom.2023.126944.
  17. Celebrities-reid: A benchmark for clothes variation in long-term person re-identification, in: 2019 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. doi:10.1109/IJCNN.2019.8851957.
  18. Clothing status awareness for long-term person re-identification, in: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 11875–11884. doi:10.1109/ICCV48922.2021.01168.
  19. Beyond scalar neuron: Adopting vector-neuron capsules for long-term person re-identification. IEEE Transactions on Circuits and Systems for Video Technology 30, 3459–3471. doi:10.1109/TCSVT.2019.2948093.
  20. Cloth-changing person re-identification from a single image with gait prediction and regularization, in: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14258–14267. doi:10.1109/CVPR52688.2022.01388.
  21. Depth-adaptive deep neural network for semantic segmentation. IEEE Transactions on Multimedia 20, 2478–2490. doi:10.1109/TMM.2018.2798282.
  22. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25.
  23. Where to look: Multi-granularity occlusion aware for video person re-identification. Neurocomputing 536, 137–151. doi:https://doi.org/10.1016/j.neucom.2023.03.003.
  24. Deepreid: Deep filter pairing neural network for person re-identification, in: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 152–159. doi:10.1109/CVPR.2014.27.
  25. Harmonious attention network for person re-identification, in: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2285–2294. doi:10.1109/CVPR.2018.00243.
  26. Learning shape representations for person re-identification under clothing change, in: 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 2431–2440. doi:10.1109/WACV48630.2021.00248.
  27. SGDR: stochastic gradient descent with restarts. CoRR abs/1608.03983. arXiv:1608.03983.
  28. On exploring pose estimation as an auxiliary learning task for visible–infrared person re-identification. Neurocomputing 556, 126652. doi:https://doi.org/10.1016/j.neucom.2023.126652.
  29. Incremental class discovery for semantic segmentation with rgbd sensing, in: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 972–981. doi:10.1109/ICCV.2019.00106.
  30. Person recognition in personal photo collections. IEEE Transactions on Pattern Analysis and Machine Intelligence 42, 203–220. doi:10.1109/TPAMI.2018.2877588.
  31. Multi-scale deep learning architectures for person re-identification, in: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 5409–5418. doi:10.1109/ICCV.2017.577.
  32. Long-term cloth-changing person re-identification, in: Ishikawa, H., Liu, C.L., Pajdla, T., Shi, J. (Eds.), Computer Vision – ACCV 2020, Springer International Publishing, Cham. pp. 71–88.
  33. Top-db-net: Top dropblock for activation enhancement in person re-identification, in: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 2980–2987. doi:10.1109/ICPR48806.2021.9412017.
  34. Performance measures and a data set for multi-target, multi-camera tracking, in: Hua, G., Jégou, H. (Eds.), Computer Vision – ECCV 2016 Workshops, Springer International Publishing, Cham. pp. 17–35.
  35. Dynamic routing between capsules, in: Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R. (Eds.), Advances in Neural Information Processing Systems, Curran Associates, Inc.
  36. Iranet: Identity-relevance aware representation for cloth-changing person re-identification. Image and Vision Computing 117, 104335. doi:https://doi.org/10.1016/j.imavis.2021.104335.
  37. Two-stream convolutional networks for action recognition in videos, in: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N., Weinberger, K. (Eds.), Advances in Neural Information Processing Systems, Curran Associates, Inc.
  38. Mask-guided contrastive attention and two-stream metric co-learning for person re-identification. Neurocomputing 465, 561–573. doi:https://doi.org/10.1016/j.neucom.2021.09.038.
  39. Part-aligned bilinear representations for person re-identification, in: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (Eds.), Computer Vision – ECCV 2018, Springer International Publishing, Cham. pp. 418–437.
  40. Deep high-resolution representation learning for human pose estimation, in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5686–5696. doi:10.1109/CVPR.2019.00584.
  41. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline), in: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (Eds.), Computer Vision – ECCV 2018, Springer International Publishing, Cham. pp. 501–518.
  42. Going deeper with convolutions, in: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9. doi:10.1109/CVPR.2015.7298594.
  43. Pyramidbox: A context-assisted single shot face detector, in: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (Eds.), Computer Vision – ECCV 2018, Springer International Publishing, Cham. pp. 812–828.
  44. When person re-identification meets changing clothes, in: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 3620–3628. doi:10.1109/CVPRW50498.2020.00423.
  45. Learning discriminative features with multiple granularities for person re-identification, in: Proceedings of the 26th ACM International Conference on Multimedia, Association for Computing Machinery, New York, NY, USA. p. 274–282. doi:10.1145/3240508.3240552.
  46. Deep high-resolution representation learning for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 3349–3364. doi:10.1109/TPAMI.2020.2983686.
  47. Multi-similarity loss with general pair weighting for deep metric learning, in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5017–5025. doi:10.1109/CVPR.2019.00516.
  48. Adversarial feature disentanglement for long-term person re-identification, in: Zhou, Z.H. (Ed.), Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21, International Joint Conferences on Artificial Intelligence Organization. pp. 1201–1207. doi:10.24963/ijcai.2021/166. main Track.
  49. Person re-identification by contour sketch under moderate clothing change. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 2029–2046. doi:10.1109/TPAMI.2019.2960509.
  50. Sampling agnostic feature representation for long-term person re-identification. IEEE Transactions on Image Processing 31, 6412–6423. doi:10.1109/TIP.2022.3207024.
  51. Abnormal event detection for video surveillance using an enhanced two-stream fusion method. Neurocomputing 553, 126561. doi:https://doi.org/10.1016/j.neucom.2023.126561.
  52. Dual-granularity feature alignment for cross-modality person re-identification. Neurocomputing 511, 78–90. doi:https://doi.org/10.1016/j.neucom.2022.09.077.
  53. Cocas: A large-scale clothes changing person dataset for re-identification, in: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3397–3406. doi:10.1109/CVPR42600.2020.00346.
  54. Unauthorized access detection system to the equipments in a room based on the persons identification by face recognition. Engineering Applications of Artificial Intelligence 124, 106637. doi:https://doi.org/10.1016/j.engappai.2023.106637.
  55. Cross-modal attention fusion network for rgb-d semantic segmentation. Neurocomputing 548, 126389. doi:https://doi.org/10.1016/j.neucom.2023.126389.
  56. Scalable person re-identification: A benchmark, in: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1116–1124. doi:10.1109/ICCV.2015.133.
  57. Joint discriminative and generative learning for person re-identification, in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2133–2142. doi:10.1109/CVPR.2019.00224.
  58. Random erasing data augmentation. Proceedings of the AAAI Conference on Artificial Intelligence 34, 13001–13008. doi:10.1609/aaai.v34i07.7000.
  59. Omni-scale feature learning for person re-identification, in: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 3701–3711. doi:10.1109/ICCV.2019.00380.
  60. Identity-guided human semantic parsing for person re-identification, in: European Conference on Computer Vision, Springer. pp. 346–363.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Duy Tran Thanh (1 paper)
  2. Yeejin Lee (15 papers)
  3. Byeongkeun Kang (22 papers)
Citations (1)