Chenwei Cui
Conceptual Text Region Network: Cognition-Inspired Accurate Scene Text Detection
Cui, Chenwei; Lu, Liangfu; Tan, Zhiyuan; Hussain, Amir
Authors
Liangfu Lu
Dr Thomas Tan Z.Tan@napier.ac.uk
Associate Professor
Prof Amir Hussain A.Hussain@napier.ac.uk
Professor
Abstract
Segmentation-based methods are widely used for scene text detection due to their superiority in describing arbitrary-shaped text instances. However, two major problems still exist: (1) current label generation techniques are mostly empirical and lack theoretical support, discouraging elaborate label design; and (2) as a result, most methods rely heavily on text kernel segmentation which is unstable and requires deliberate tuning. To address these challenges, we propose a human cognition-inspired framework, termed, Conceptual Text Region Network (CTRNet). The framework utilizes Conceptual Text Regions (CTRs), which is a class of cognition-based tools inheriting good mathematical properties, allowing for sophisticated label design. Another component of CTRNet is an inference pipeline that, with the help of CTRs, completely omits the need for text kernel segmentation. Compared with previous segmentation-based methods, our approach is not only more interpretable but also more accurate. Experimental results show that CTRNet achieves state-of-the-art performance on benchmark CTW1500, Total-Text, MSRA-TD500, and ICDAR 2015 datasets, yielding performance gains of up to 2.0%. Notably, to the best of our knowledge, CTRNet is among the first detection models to achieve F-measures higher than 85.0% on all four of the benchmarks, demonstrating remarkable consistency and stability.
Citation
Cui, C., Lu, L., Tan, Z., & Hussain, A. (2021). Conceptual Text Region Network: Cognition-Inspired Accurate Scene Text Detection. Neurocomputing, 464, 252-264. https://doi.org/10.1016/j.neucom.2021.08.026
Journal Article Type | Article |
---|---|
Acceptance Date | Aug 9, 2021 |
Online Publication Date | Aug 11, 2021 |
Publication Date | 2021-11 |
Deposit Date | Aug 9, 2021 |
Publicly Available Date | Aug 12, 2022 |
Print ISSN | 0925-2312 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 464 |
Pages | 252-264 |
DOI | https://doi.org/10.1016/j.neucom.2021.08.026 |
Keywords | Scene text detection, Arbitrary-shaped text detection, Neural networks, Semantic segmentation |
Public URL | http://researchrepository.napier.ac.uk/Output/2792548 |
Files
Conceptual Text Region Network: Cognition-Inspired Accurate Scene Text Detection (accepted version)
(23.9 Mb)
PDF
Licence
http://creativecommons.org/licenses/by-nc-nd/4.0/
Copyright Statement
Accepted version licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.
You might also like
Detection of Ransomware
(2024)
Patent
Machine Un-learning: An Overview of Techniques, Applications, and Future Directions
(2023)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search