Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
91 tokens/sec
GPT-4o
12 tokens/sec
Gemini 2.5 Pro Pro
o3 Pro
5 tokens/sec
GPT-4.1 Pro
15 tokens/sec
DeepSeek R1 via Azure Pro
33 tokens/sec
Gemini 2.5 Flash Deprecated
12 tokens/sec
2000 character limit reached

View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network (2403.14513v1)

Published 21 Mar 2024 in cs.CV

Abstract: Existing person re-identification methods have achieved remarkable advances in appearance-based identity association across homogeneous cameras, such as ground-ground matching. However, as a more practical scenario, aerial-ground person re-identification (AGPReID) among heterogeneous cameras has received minimal attention. To alleviate the disruption of discriminative identity representation by dramatic view discrepancy as the most significant challenge in AGPReID, the view-decoupled transformer (VDT) is proposed as a simple yet effective framework. Two major components are designed in VDT to decouple view-related and view-unrelated features, namely hierarchical subtractive separation and orthogonal loss, where the former separates these two features inside the VDT, and the latter constrains these two to be independent. In addition, we contribute a large-scale AGPReID dataset called CARGO, consisting of five/eight aerial/ground cameras, 5,000 identities, and 108,563 images. Experiments on two datasets show that VDT is a feasible and effective solution for AGPReID, surpassing the previous method on mAP/Rank1 by up to 5.0%/2.7% on CARGO and 3.7%/5.2% on AG-ReID, keeping the same magnitude of computational complexity. Our project is available at https://github.com/LinlyAC/VDT-AGPReID

Definition Search Book Streamline Icon: https://streamlinehq.com
References (47)
  1. Camera-driven representation learning for unsupervised domain adaptive person re-identification. In Int. Conf. Comput. Vis., pages 11453–11462, 2023.
  2. Weperson: Learning a generalized re-identification model from all-weather virtual data. In ACM Int. Conf. Multimedia, page 3115–3123, 2021.
  3. Uncertainty modeling with second-order transformer for group re-identification. In AAAI, volume 36, pages 3318–3325, 2022.
  4. Separable spatial-temporal residual graph for cloth-changing group re-identification. IEEE Trans. Pattern Anal. Mach. Intell., pages 1–16, 2024.
  5. Modeling 3d layout for group re-identification. In IEEE Conf. Comput. Vis. Pattern Recog., pages 7512–7520, 2022.
  6. Ac2as: Activation consistency coupled ann-snn framework for fast and memory-efficient snn training. Pattern Recognition, 2023.
  7. Region-based online selective examination for weakly supervised semantic segmentation. Information Fusion, page 102311, 2024.
  8. Exploring dual-task correlation for pose guided person image generation. In IEEE Conf. Comput. Vis. Pattern Recog., pages 7713–7722, June 2022.
  9. Salient part-aligned and keypoint disentangling transformer for person re-identification in aerial imagery. In Int. Conf. Multimedia and Expo, 2024.
  10. Pose guided person image generation via dual-task correlation and affinity learning. IEEE Trans. Vis. Comput. Graph., pages 1–18, 2023.
  11. Spike count maximization for neuromorphic vision recognition. In IJCAI, 2023.
  12. Formulating discrete probability flow through optimal transport. In Adv. Neural Inform. Process. Syst., 2023.
  13. Self-supervised image-specific prototype exploration for weakly supervised semantic segmentation. In IEEE Conf. Comput. Vis. Pattern Recog., pages 4288–4298, 2022.
  14. Dissecting person re-identification from the viewpoint of viewpoint. In IEEE Conf. Comput. Vis. Pattern Recog., 2019.
  15. Uncertainty modeling for group re-identification. Int. J. Comput. Vis., 2024.
  16. Simam: A simple, parameter-free attention module for convolutional neural networks. pages 11863–11874. PMLR, 2021.
  17. Scalable person re-identification: A benchmark. In Int. Conf. Comput. Vis., pages 1116–1124, 2015.
  18. Person re-identification in aerial imagery. IEEE Trans. Multimedia, 23:281–291, 2021.
  19. Learning modal-invariant angular metric by cyclic projection network for vis-nir person re-identification. IEEE Trans. Image Process., 30:8019–8033, 2021.
  20. Seeing like a human: Asynchronous learning with dynamic progressive refinement for person re-identification. IEEE Trans. Image Process., 31:352–365, 2022.
  21. Deep learning for person re-identification: A survey and outlook. IEEE Trans. Pattern Anal. Mach. Intell., 44(6):2872–2893, 2022.
  22. Aerial-ground person re-id. In Int. Conf. Multimedia and Expo, pages 2585–2590, 2023.
  23. Rotation exploration transformer for aerial person re-identification. In Int. Conf. Multimedia and Expo, 2024.
  24. Unrealperson: An adaptive pipeline towards costless person re-identification. In IEEE Conf. Comput. Vis. Pattern Recog., pages 11506–11515, 2021.
  25. Person transfer gan to bridge domain gap for person re-identification. In IEEE Conf. Comput. Vis. Pattern Recog., pages 79–88, 2018.
  26. Person re-identification using kernel-based metric learning methods. In Eur. Conf. Comput. Vis., pages 1–16, 2014.
  27. Person re-identification by local maximal occurrence representation and metric learning. In IEEE Conf. Comput. Vis. Pattern Recog., pages 2197–2206, 2015.
  28. Beyond part models: Person retrieval with refined part pooling (and A strong convolutional baseline). In Eur. Conf. Comput. Vis., pages 501–518, 2018.
  29. Transreid: Transformer-based object re-identification. In Int. Conf. Comput. Vis., pages 15013–15022, 2021.
  30. Uav-human: A large benchmark for human behavior understanding with unmanned aerial vehicles. In IEEE Conf. Comput. Vis. Pattern Recog., pages 16266–16275, 2021.
  31. Rotation invariant transformer for recognizing object in uavs. In ACM Int. Conf. Multimedia, page 2565–2574, 2022.
  32. Surpassing real-world source training data: Random 3d characters for generalizable person re-identification. In ACM Int. Conf. Multimedia, page 3422–3430, 2020.
  33. Cloning outfits from real-world images to 3d characters for generalizable person re-identification. In IEEE Conf. Comput. Vis. Pattern Recog., pages 4890–4899, 2022.
  34. An image is worth 16x16 words: Transformers for image recognition at scale. In Int. Conf. Learn. Represent., 2021.
  35. Swin transformer: Hierarchical vision transformer using shifted windows. In Int. Conf. Comput. Vis., pages 10012–10022, 2021.
  36. Fastreid: A pytorch toolbox for general instance re-identification. In ACM Int. Conf. Multimedia, page 9664–9667, 2023.
  37. Learning part-based convolutional features for person re-identification. IEEE Trans. Pattern Anal. Mach. Intell., 43:902–917, 2021.
  38. Bag of tricks and a strong baseline for deep person re-identification. In IEEE Conf. Comput. Vis. Pattern Recog. Worksh., 2019.
  39. Learning discriminative features with multiple granularities for person re-identification. In ACM Int. Conf. Multimedia, pages 274–282, 2018.
  40. Vehicle re-identification: an efficient baseline using triplet embedding. In International Joint Conference on Neural Networks, pages 1–9, 2019.
  41. A strong and efficient baseline for vehicle re-identification using deep triplet embedding. Journal of Artificial Intelligence and Soft Computing Research, 10(1):27–45, 2020.
  42. Learning generalisable omni-scale representations for person re-identification. IEEE Trans. Pattern Anal. Mach. Intell., 44(9):5056–5069, 2022.
  43. Makehuman: a review of the modelling framework. In Congress of the International Ergonomics Association, pages 224–232, 2018.
  44. Unity Technologies. Unity3D: Cross-platform 3D engine, 2021.
  45. Imagenet: A large-scale hierarchical image database. In IEEE Conf. Comput. Vis. Pattern Recog., pages 248–255, 2009.
  46. Léon Bottou. Stochastic gradient descent tricks. Neural Networks: Tricks of the Trade: Second Edition, pages 421–436, 2012.
  47. Pytorch: An imperative style, high-performance deep learning library. In Adv. Neural Inform. Process. Syst., pages 8024–8035, 2019.
Citations (6)

Summary

We haven't generated a summary for this paper yet.