Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
184 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

1st Solution Places for CVPR 2023 UG$^{\textbf{2}}$+ Challenge Track 2.1-Text Recognition through Atmospheric Turbulence (2306.08963v1)

Published 15 Jun 2023 in cs.CV

Abstract: In this technical report, we present the solution developed by our team VIELab-HUST for text recognition through atmospheric turbulence in Track 2.1 of the CVPR 2023 UG${2}$+ challenge. Our solution involves an efficient multi-stage framework that restores a high-quality image from distorted frames. Specifically, a frame selection algorithm based on sharpness is first utilized to select the sharpest set of distorted frames. Next, each frame in the selected frames is aligned to suppress geometric distortion through optical-flow-based image registration. Then, a region-based image fusion method with DT-CWT is utilized to mitigate the blur caused by the turbulence. Finally, a learning-based deartifacts method is applied to remove the artifacts in the fused image, generating a high-quality outuput. Our framework can handle both hot-air text dataset and turbulence text dataset provided in the final testing phase and achieved 1st place in text recognition accuracy. Our code will be available at https://github.com/xsqhust/Turbulence_Removal.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (10)
  1. Atmospheric turbulence mitigation using complex wavelet-based fusion. IEEE Transactions on Image Processing, 22(6):2398–2408, 2013.
  2. Towards flexible blind jpeg artifacts removal. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 4997–5006, 2021.
  3. Region-based image fusion using complex wavelets. In Proc. 7th International Conference on Information Fusion, volume 1, pages 555–562. Citeseer, 2004.
  4. Pixel-and region-based image fusion with complex wavelets. Information fusion, 8(2):119–130, 2007.
  5. Video stabilization of atmospheric turbulence distortion. Inverse Probl. Imaging, 7(3):839–861, 2013.
  6. Single frame atmospheric turbulence mitigation: A benchmark study and a new physics-inspired transformer model. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XIX, pages 430–446. Springer, 2022.
  7. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE transactions on pattern analysis and machine intelligence, 39(11):2298–2304, 2016.
  8. Aster: An attentional scene text recognizer with flexible rectification. IEEE transactions on pattern analysis and machine intelligence, 41(9):2035–2048, 2018.
  9. Decoupled attention network for text recognition. In Proceedings of the AAAI conference on artificial intelligence, volume 34, pages 12216–12224, 2020.
  10. Removing atmospheric turbulence via space-invariant deconvolution. IEEE transactions on pattern analysis and machine intelligence, 35(1):157–170, 2012.

Summary

We haven't generated a summary for this paper yet.