Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation (2307.16309v1)

Published 30 Jul 2023 in cs.CV

Abstract: Video-based scene graph generation (VidSGG) is an approach that aims to represent video content in a dynamic graph by identifying visual entities and their relationships. Due to the inherently biased distribution and missing annotations in the training data, current VidSGG methods have been found to perform poorly on less-represented predicates. In this paper, we propose an explicit solution to address this under-explored issue by supplementing missing predicates that should be appear in the ground-truth annotations. Dubbed Trico, our method seeks to supplement the missing predicates by exploring three complementary spatio-temporal correlations. Guided by these correlations, the missing labels can be effectively supplemented thus achieving an unbiased predicate predictions. We validate the effectiveness of Trico on the most widely used VidSGG datasets, i.e., VidVRD and VidOR. Extensive experiments demonstrate the state-of-the-art performance achieved by Trico, particularly on those tail predicates.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Wenqing Wang (22 papers)
  2. Kaifeng Gao (11 papers)
  3. Yawei Luo (40 papers)
  4. Tao Jiang (274 papers)
  5. Fei Gao (458 papers)
  6. Jian Shao (29 papers)
  7. Jianwen Sun (18 papers)
  8. Jun Xiao (134 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.