Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024 (2406.09201v3)

Published 13 Jun 2024 in cs.CV

Abstract: In this technical report, we present our findings from the research conducted on the Vast Vocabulary Visual Detection (V3Det) dataset for Supervised Vast Vocabulary Visual Detection task. How to deal with complex categories and detection boxes has become a difficulty in this track. The original supervised detector is not suitable for this task. We have designed a series of improvements, including adjustments to the network structure, changes to the loss function, and design of training strategies. Our model has shown improvement over the baseline and achieved excellent rankings on the Leaderboard for both the Vast Vocabulary Object Detection (Supervised) track and the Open Vocabulary Object Detection (OVD) track of the V3Det Challenge 2024.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Peixi Wu (9 papers)
  2. Bosong Chai (5 papers)
  3. Xuan Nie (3 papers)
  4. Longquan Yan (3 papers)
  5. Zeyu Wang (137 papers)
  6. Qifan Zhou (3 papers)
  7. Boning Wang (2 papers)
  8. Yansong Peng (9 papers)
  9. Hebei Li (11 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.