Nandanwar, Lokesh and Shivakumara, Palaiahnakote and Ramachandra, Raghavendra and Lu, Tong and Pal, Umapada and Antonacopoulos, Apostolos and Lu, Yue (2022) A new deep wavefront based model for text localization in 3D video. IEEE Transactions on Circuits and Systems for Video Technology, 32 (6). pp. 3375-3389. ISSN 1051-8215, DOI https://doi.org/10.1109/TCSVT.2021.3110990.
Full text not available from this repository.Abstract
With the evolution of electronic devices, such as 3D cameras, addressing the challenges of text localization in 3D video (e.g., for indexing) is increasingly drawing the attention of the multimedia and video processing community. Existing methods focus on 2D video and their performance in the presence of the challenges in 3D video, such as shadow areas associated with text and irregularly sized and shaped text, degrades. This paper proposes the first approach that successfully addresses the challenges of 3D video in addition to those of 2D. It employs a number of innovations, among which, the first is the Generalized Gradient Vector Flow (GGVF) for dominant points detection. The second is the Wavefront concept for text candidate point detection from those dominant points. In addition, an Adaptive B-Spline Polygon Curve Network (ABS-Net) is proposed for accurate text localization in 3D videos by constructing tight fitting bounding polygons using text candidate points. Extensive experiments on custom (3D video) and standard datasets (2D video and scene text) show that the proposed method is practical and useful, and overall outperforms existing state-of-the-art methods.
Item Type: | Article |
---|---|
Funders: | National Natural Science Foundation of China (NSFC) (Grant No: 61672273), Ministry of Education, Malaysia (Grant No: FP104-2020), Technology Innovation Hub (TIH), Indian Statistical Institute (ISI), Kolkata |
Uncontrolled Keywords: | Three-dimensional displays; Location awareness; Feature extraction; Solid modeling; Image segmentation; Deep learning; Streaming media; Gradient vector flow; Wavefront; B-Spline curve fitting; Natural scene text detection; Text localization in 3D video |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Computer Science & Information Technology > Department of Computer System & Technology |
Depositing User: | Ms. Juhaida Abd Rahim |
Date Deposited: | 16 Oct 2023 04:02 |
Last Modified: | 16 Oct 2023 04:02 |
URI: | http://eprints.um.edu.my/id/eprint/42144 |
Actions (login required)
View Item |