Banerjee, Ayan and Shivakumara, Palaiahnakote and Pal, Soumyajit and Pal, Umapada and Liu, Cheng-Lin (2022) DCT-DWT-FFT based method for text detection in underwater images. In: 6th Asian Conference on Pattern Recognition, ACPR 2021, 9-12 November 2021, Virtual, Online.
Full text not available from this repository.Abstract
Text detection in underwater images is an open challenge because of the distortions caused by refraction, absorption of light, particles, and variations depending on depth, color, and nature of water. Unlike existing methods aimed at text detection in natural scene images, in this paper, we have proposed a novel method for text detection in underwater images through a new enhancement model. Based on observations that fine details of text in image share with high energy, spatial resolution, and brightness, we consider Discrete Cosine Transform (DCT), Discrete Wavelet Transform (DWT), and Fast Fourier Transform (FFT) for image enhancement to highlight the text features. The enhanced image is fed to a modified Character Region Awareness for Text Detection (CRAFT) model to detect text in underwater images. To explore enhancement methods, we evaluate six combinations of image enhancement techniques, namely, DCT-DWT-FFT, DCT-FFT-DWT, DWT-DCT-FFT, DWT-FFT-DCT, FFT-DCT-DWT, FFT-DWTDCT. Experimental results on our dataset of underwater images and benchmark datasets of natural scene text detection, namely, MSRA-TD500, ICDAR 2019 MLT, ICDAR 2019 ArT, Total-Text, CTW1500, and COCO Text show that the proposed method performs well for both underwater and natural scene images and outperforms the existing methods on all the datasets.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Funders: | National Natural Science Foundation of China [Grant no. 61721004], National Key Research and Development Program of China [Grant no. 2018AAA0100400], Indian Statistical Institute |
Uncontrolled Keywords: | Under water images; Text detection in underwater images; Image enhancement; Discrete cosine transform; Wavelet transform; Fourier transform; Modified Character Region Awareness for Text Detection (CRAFT) |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Computer Science & Information Technology > Department of Computer System & Technology |
Depositing User: | Ms. Juhaida Abd Rahim |
Date Deposited: | 25 Feb 2025 04:47 |
Last Modified: | 25 Feb 2025 04:47 |
URI: | http://eprints.um.edu.my/id/eprint/41004 |
Actions (login required)
![]() |
View Item |