Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning (2004.13458v4)

Published 28 Apr 2020 in cs.CV

Abstract: Visual Similarity plays an important role in many computer vision applications. Deep metric learning (DML) is a powerful framework for learning such similarities which not only generalize from training data to identically distributed test distributions, but in particular also translate to unknown test classes. However, its prevailing learning paradigm is class-discriminative supervised training, which typically results in representations specialized in separating training classes. For effective generalization, however, such an image representation needs to capture a diverse range of data characteristics. To this end, we propose and study multiple complementary learning tasks, targeting conceptually different data relationships by only resorting to the available training samples and labels of a standard DML setting. Through simultaneous optimization of our tasks we learn a single model to aggregate their training signals, resulting in strong generalization and state-of-the-art performance on multiple established DML benchmark datasets.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Timo Milbich (15 papers)
  2. Karsten Roth (36 papers)
  3. Homanga Bharadhwaj (36 papers)
  4. Samarth Sinha (22 papers)
  5. Yoshua Bengio (601 papers)
  6. Björn Ommer (72 papers)
  7. Joseph Paul Cohen (50 papers)
Citations (65)

Summary

We haven't generated a summary for this paper yet.