Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Compositional Sketch Search (2106.08009v1)

Published 15 Jun 2021 in cs.CV

Abstract: We present an algorithm for searching image collections using free-hand sketches that describe the appearance and relative positions of multiple objects. Sketch based image retrieval (SBIR) methods predominantly match queries containing a single, dominant object invariant to its position within an image. Our work exploits drawings as a concise and intuitive representation for specifying entire scene compositions. We train a convolutional neural network (CNN) to encode masked visual features from sketched objects, pooling these into a spatial descriptor encoding the spatial relationships and appearances of objects in the composition. Training the CNN backbone as a Siamese network under triplet loss yields a metric search embedding for measuring compositional similarity which may be efficiently leveraged for visual search by applying product quantization.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Alexander Black (11 papers)
  2. Tu Bui (21 papers)
  3. Long Mai (32 papers)
  4. Hailin Jin (53 papers)
  5. John Collomosse (52 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.