Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Efficient Scene Text Detection with Textual Attention Tower (2002.03741v1)

Published 30 Jan 2020 in cs.CV

Abstract: Scene text detection has received attention for years and achieved an impressive performance across various benchmarks. In this work, we propose an efficient and accurate approach to detect multioriented text in scene images. The proposed feature fusion mechanism allows us to use a shallower network to reduce the computational complexity. A self-attention mechanism is adopted to suppress false positive detections. Experiments on public benchmarks including ICDAR 2013, ICDAR 2015 and MSRA-TD500 show that our proposed approach can achieve better or comparable performances with fewer parameters and less computational cost.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Liang Zhang (357 papers)
  2. Yufei Liu (23 papers)
  3. Hang Xiao (31 papers)
  4. Lu Yang (82 papers)
  5. Guangming Zhu (17 papers)
  6. Syed Afaq Shah (3 papers)
  7. Mohammed Bennamoun (124 papers)
  8. Peiyi Shen (8 papers)
Citations (7)