Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-Instance Visual-Semantic Embedding (1512.06963v1)

Published 22 Dec 2015 in cs.CV

Abstract: Visual-semantic embedding models have been recently proposed and shown to be effective for image classification and zero-shot learning, by mapping images into a continuous semantic label space. Although several approaches have been proposed for single-label embedding tasks, handling images with multiple labels (which is a more general setting) still remains an open problem, mainly due to the complex underlying corresponding relationship between image and its labels. In this work, we present Multi-Instance visual-semantic Embedding model (MIE) for embedding images associated with either single or multiple labels. Our model discovers and maps semantically-meaningful image subregions to their corresponding labels. And we demonstrate the superiority of our method over the state-of-the-art on two tasks, including multi-label image annotation and zero-shot learning.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Zhou Ren (17 papers)
  2. Hailin Jin (53 papers)
  3. Zhe Lin (164 papers)
  4. Chen Fang (157 papers)
  5. Alan Yuille (295 papers)
Citations (36)

Summary

We haven't generated a summary for this paper yet.