Towards Improved Proxy-based Deep Metric Learning via Data-Augmented Domain Adaptation (2401.00617v1)
Abstract: Deep Metric Learning (DML) plays an important role in modern computer vision research, where we learn a distance metric for a set of image representations. Recent DML techniques utilize the proxy to interact with the corresponding image samples in the embedding space. However, existing proxy-based DML methods focus on learning individual proxy-to-sample distance while the overall distribution of samples and proxies lacks attention. In this paper, we present a novel proxy-based DML framework that focuses on aligning the sample and proxy distributions to improve the efficiency of proxy-based DML losses. Specifically, we propose the Data-Augmented Domain Adaptation (DADA) method to adapt the domain gap between the group of samples and proxies. To the best of our knowledge, we are the first to leverage domain adaptation to boost the performance of proxy-based DML. We show that our method can be easily plugged into existing proxy-based DML losses. Our experiments on benchmarks, including the popular CUB-200-2011, CARS196, Stanford Online Products, and In-Shop Clothes Retrieval, show that our learning algorithm significantly improves the existing proxy losses and achieves superior results compared to the existing methods.
- A theory of learning from different domains. Machine learning, 79: 151–175.
- A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses. In ECCV, 548–564. Springer.
- Mode regularized generative adversarial networks. ICLR.
- Reusing the Task-specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation. In CVPR, 7181–7190.
- Person re-identification by multi-channel parts-based cnn with improved triplet loss function. In CVPR, 1335–1344.
- Learning a similarity metric discriminatively, with application to face verification. In CVPR, volume 1, 539–546. IEEE.
- Towards discriminability and diversity: Batch nuclear-norm maximization under label insufficient situations. In CVPR, 3941–3950.
- Batch dropblock network for person re-identification and beyond. In ICCV, 3691–3701.
- Imagenet: A large-scale hierarchical image database. In CVPR, 248–255. Ieee.
- Training vision transformers for image retrieval. arXiv preprint arXiv:2102.05644.
- Hyperbolic Vision Transformers: Combining Improvements in Metric Learning. In CVPR, 7409–7419.
- Unsupervised domain adaptation by backpropagation. In ICML, 1180–1189. PMLR.
- Domain-adversarial training of neural networks. The journal of machine learning research, 17(1): 2096–2030.
- Self-supervising fine-grained region similarities for large-scale image localization. In ECCV, 369–386. Springer.
- Neighbourhood components analysis. NIPS, 17.
- Generative adversarial networks. Communications of the ACM, 63(11): 139–144.
- Dimensionality reduction by learning an invariant mapping. In CVPR, volume 2, 1735–1742. IEEE.
- A Survey on Adversarial Domain Adaptation. Neural Processing Letters, 1–41.
- Deep residual learning for image recognition. In CVPR, 770–778.
- In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737.
- Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML, 448–456. PMLR.
- Image-to-image translation with conditional adversarial networks. In CVPR, 1125–1134.
- Coded Residual Transform for Generalizable Deep Metric Learning. NeurIPS.
- Not all samples are created equal: Deep learning with importance sampling. In ICML, 2525–2534. PMLR.
- Proxy anchor loss for deep metric learning. In CVPR, 3238–3247.
- Adam: A method for stochastic optimization. ICLR.
- Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114.
- Learning with memory-based virtual classes for deep metric learning. In CVPR, 11792–11801.
- 3d object representations for fine-grained categorization. In ICCV workshop, 554–561.
- M-adda: Unsupervised domain adaptation with deep metric learning. Domain adaptation for visual understanding, 17–31.
- Sliced wasserstein discrepancy for unsupervised domain adaptation. In CVPR, 10285–10295.
- Rank-based distance metric learning: An application to image retrieval. In CVPR, 1–8. IEEE.
- Hypergraph-Induced Semantic Tuplet Loss for Deep Metric Learning. In CVPR, 212–222.
- DAS: Densely-Anchored Sampling for Deep Metric Learning. ECCV.
- Deepfashion: Powering robust clothes recognition and retrieval with rich annotations. In CVPR, 1096–1104.
- Localize me anywhere, anytime: a multi-task point-retrieval approach. In ICCV, 2434–2442.
- Diva: Diverse visual feature aggregation for deep metric learning. In ECCV, 590–607. Springer.
- No fuss distance metric learning using proxies. In CVPR, 360–368.
- A metric learning reality check. In ECCV, 681–699. Springer.
- Deep metric learning via facility location. In CVPR, 5382–5390.
- Deep metric learning via lifted structured feature embedding. In CVPR, 4004–4012.
- Recall@ k surrogate loss with large batches and similarity mixup. In CVPR, 7502–7511.
- Multi-adversarial domain adaptation. In AAAI.
- Pinheiro, P. O. 2018. Unsupervised domain adaptation with similarity learning. In CVPR, 8004–8013.
- Softtriple loss: Deep metric learning without triplet sampling. In ICCV, 6450–6458.
- Covariate shift and local learning by distribution matching.
- Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434.
- Improved image description via embedded object structure graph and semantic feature matching. In ISM, 73–80. IEEE.
- Beyond the deep metric learning: enhance the cross-modal matching with adversarial discriminative domain regularization. In ICPR, 10165–10172. IEEE.
- Improving diversity of image captioning through variational autoencoders and adversarial learning. In WACV, 263–272. IEEE.
- Metric learning with adaptive density discrimination. ICLR.
- Simultaneous similarity-based self-distillation for deep metric learning. In ICML, 9095–9106. PMLR.
- Revisiting training strategies and generalization performance in deep metric learning. In ICML, 8242–8252. PMLR.
- Non-isotropy Regularization for Proxy-based Deep Metric Learning. In CVPR, 7420–7430.
- Neighbourhood component analysis. NIPS, 17(513-520): 4.
- Maximum classifier discrepancy for unsupervised domain adaptation. In CVPR, 3723–3732.
- Facenet: A unified embedding for face recognition and clustering. In CVPR, 815–823.
- Learning intra-batch connections for deep metric learning. In ICML, 9410–9421. PMLR.
- Sohn, K. 2016. Improved deep metric learning with multi-class n-pair loss objective. NIPS, 29.
- Stochastic class-based hard example mining for deep metric learning. In CVPR, 7251–7259.
- Instance-level image retrieval using reranking transformers. In ICCV, 12105–12115.
- Proxynca++: Revisiting and revitalizing proxy neighborhood component analysis. In ECCV, 448–464. Springer.
- Unbiased look at dataset bias. In CVPR, 1521–1528. IEEE.
- Adversarial discriminative domain adaptation. In CVPR, 7167–7176.
- It takes two to tango: Mixup for deep metric learning. ICLR.
- The caltech-ucsd birds-200-2011 dataset.
- Adversarial cross-modal retrieval. In Multimedia, 154–162.
- Learning fine-grained image similarity with deep ranking. In CVPR, 1386–1393.
- Deep visual domain adaptation: A survey. Neurocomputing, 312: 135–153.
- Multi-similarity loss with general pair weighting for deep metric learning. In CVPR, 5022–5030.
- Cross-batch memory for embedding learning. In CVPR, 6388–6397.
- Distance metric learning for large margin nearest neighbor classification. Journal of machine learning research, 10(2).
- Deep cosine metric learning for person re-identification. In WACV, 748–756. IEEE.
- Sampling matters in deep embedding learning. In CVPR, 2840–2848.
- Hard negative examples are hard, but useful. In ECCV, 126–142. Springer.
- Retrieving and classifying affective images via deep metric learning. In AAAI, volume 32.
- Hierarchical proxy-based loss for deep metric learning. In WACV, 1859–1868.
- Deep metric learning for person re-identification. In ICPR, 34–39. IEEE.
- Attributable Visual Similarity Learning. In CVPR, 7532–7541.
- Deep compositional metric learning. In CVPR, 9320–9329.
- Deep relational metric learning. In ICCV, 12065–12074.
- Fewer is more: A deep graph metric learning perspective using fewer proxies. NeurIPS, 33: 17792–17803.