Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Knowledge-augmented Few-shot Visual Relation Detection (2303.05342v1)

Published 9 Mar 2023 in cs.CV and cs.AI

Abstract: Visual Relation Detection (VRD) aims to detect relationships between objects for image understanding. Most existing VRD methods rely on thousands of training samples of each relationship to achieve satisfactory performance. Some papers tackle this problem by few-shot learning with elaborately designed pipelines and pre-trained word vectors. However, the performance of existing few-shot VRD models is severely hampered by the poor generalization capability, as they struggle to handle the vast semantic diversity of visual relationships. Nonetheless, humans have the ability to learn new relationships with just few examples based on their knowledge. Inspired by this, we devise a knowledge-augmented, few-shot VRD framework leveraging both textual knowledge and visual relation knowledge to improve the generalization ability of few-shot VRD. The textual knowledge and visual relation knowledge are acquired from a pre-trained LLM and an automatically constructed visual relation knowledge graph, respectively. We extensively validate the effectiveness of our framework. Experiments conducted on three benchmarks from the commonly used Visual Genome dataset show that our performance surpasses existing state-of-the-art models with a large improvement.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Tianyu Yu (20 papers)
  2. Yangning Li (49 papers)
  3. Jiaoyan Chen (85 papers)
  4. Yinghui Li (65 papers)
  5. Hai-Tao Zheng (94 papers)
  6. Xi Chen (1036 papers)
  7. Qingbin Liu (13 papers)
  8. Wenqiang Liu (18 papers)
  9. Dongxiao Huang (1 paper)
  10. Bei Wu (6 papers)
  11. Yexin Wang (16 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.