Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Location-Free Scene Graph Generation (2303.10944v2)

Published 20 Mar 2023 in cs.CV

Abstract: Scene Graph Generation (SGG) is a visual understanding task, aiming to describe a scene as a graph of entities and their relationships with each other. Existing works rely on location labels in form of bounding boxes or segmentation masks, increasing annotation costs and limiting dataset expansion. Recognizing that many applications do not require location data, we break this dependency and introduce location-free scene graph generation (LF-SGG). This new task aims at predicting instances of entities, as well as their relationships, without the explicit calculation of their spatial localization. To objectively evaluate the task, the predicted and ground truth scene graphs need to be compared. We solve this NP-hard problem through an efficient branching algorithm. Additionally, we design the first LF-SGG method, Pix2SG, using autoregressive sequence modeling. We demonstrate the effectiveness of our method on three scene graph generation datasets as well as two downstream tasks, image retrieval and visual question answering, and show that our approach is competitive to existing methods while not relying on location cues.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Ege Özsoy (19 papers)
  2. Felix Holm (7 papers)
  3. Tobias Czempiel (20 papers)
  4. Nassir Navab (458 papers)
  5. Benjamin Busam (82 papers)
  6. Mahdi Saleh (18 papers)
  7. Chantal Pellegrini (15 papers)
Citations (3)
X Twitter Logo Streamline Icon: https://streamlinehq.com