Text Region Multiple Information Perception Network for Scene Text Detection (2401.10017v1)
Abstract: Segmentation-based scene text detection algorithms can handle arbitrary shape scene texts and have strong robustness and adaptability, so it has attracted wide attention. Existing segmentation-based scene text detection algorithms usually only segment the pixels in the center region of the text, while ignoring other information of the text region, such as edge information, distance information, etc., thus limiting the detection accuracy of the algorithm for scene text. This paper proposes a plug-and-play module called the Region Multiple Information Perception Module (RMIPM) to enhance the detection performance of segmentation-based algorithms. Specifically, we design an improved module that can perceive various types of information about scene text regions, such as text foreground classification maps, distance maps, direction maps, etc. Experiments on MSRA-TD500 and TotalText datasets show that our method achieves comparable performance with current state-of-the-art algorithms.
- “Ms-rocanet: Multi-scale residual orthogonal-channel attention network for scene text detection,” in ICASSP, 2022, pp. 2200–2204.
- “Hierarchical refined attention for scene text recognition,” in ICASSP, 2021, pp. 4175–4179.
- “Real-time scene text detection with differentiable binarization,” in AAAI, 2020, pp. 11474–11481.
- Y. Liu and L. Jin, “Deep matching prior network: Toward tighter multi-oriented text detection,” in CVPR, 2017, pp. 3454–3461.
- “Cmfn: Cross-modal fusion network for irregular scene text recognition,” in ICONIP, 2024, pp. 421–433.
- L. Zhang and H. Fan, “Visual object tracking: Progress, challenge, and future,” The Innovation, vol. 4, 2023.
- “Artificial intelligence: A powerful paradigm for scientific research,” The Innovation, vol. 2, no. 4, pp. 100179, 2021.
- “Real-time scene text detection with differentiable binarization and adaptive scale fusion,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 1, pp. 919–931, 2023.
- “Turning a clip model into a scene text detector,” in CVPR, 2023, pp. 6978–6988.
- “Single shot text detector with regional attention,” in ICCV, Oct 2017.
- “Deep direct regression for multi-oriented scene text detection,” in ICCV, 2017, pp. 745–753.
- “East: An efficient and accurate scene text detector,” in CVPR, July 2017.
- “Textboxes++: A single-shot oriented scene text detector,” IEEE Transactions on Image Processing, vol. 27, no. 8, pp. 3676–3690, 2018.
- “Feature pyramid networks for object detection,” in CVPR, 2017, pp. 936–944.
- Bala R. Vatti, “A generic solution to polygon clipping,” Commun. ACM, vol. 35, no. 7, pp. 56–63, jul 1992.
- “Textsnake: A flexible representation for detecting text of arbitrary shapes,” in ECCV, 2018, pp. 19–35.
- “Shape robust text detection with progressive scale expansion network,” 2018, pp. 9328–9337.
- “Character region awareness for text detection,” in CVPR, 2019, pp. 9357–9366.
- “Deep relational reasoning graph network for arbitrary shape text detection,” in CVPR, 2020.
- “Fourier contour embedding for arbitrary-shaped text detection,” in CVPR, 2021, pp. 3122–3130.
- “Progressive contour regression for arbitrary-shape scene text detection,” in CVPR, 2021, pp. 7389–7398.
- “Synthetic data for text localisation in natural images,” in CVPR, 2016.
- C. Ch’ng and C. Chan, “Total-text: A comprehensive dataset for scene text detection and recognition,” in ICDAR, 2017, vol. 01, pp. 935–942.
- “Detecting texts of arbitrary orientations in natural images,” in CVPR, 2012, pp. 1083–1090.
- “A unified framework for multioriented text detection and recognition,” IEEE Transactions on Image Processing, vol. 23, no. 11, pp. 4737–4749, 2014.
- “Learning shape-aware embedding for scene text detection,” in CVPR, 2019, pp. 4229–4238.
- “Most: A multi-oriented scene text detector with localization refinement,” in CVPR, 2021, pp. 8809–8818.
- “Fc2rn: A fully convolutional corner refinement network for accurate multi-oriented scene text detection,” in ICASSP, 2021, pp. 4350–4354.
- Jinzhi Zheng (3 papers)
- Libo Zhang (105 papers)
- Yanjun Wu (26 papers)
- Chen Zhao (249 papers)