Abstract
Visual attention models have been used to recognize the most prominent region in a natural scene. These regions are going to pull the human attention. The state-of-art models keep on under-predicting the significant image regions having text. These are specifically the regions with most noteworthy semantic significance in a natural scene and turn out to be useful for saliency-based applications like image classification and captioning. The text or character detection as a salient region in image remains a challenging research problem. Text contents within the scene convey vital information about the image. For example, signboard content conveys the important information for visually impaired person. In this paper, we have proposed a new model for salient text detection in a natural scene. In the proposed model, we integrate saliency model with the segmentation and text detection approach in a natural scene to generate the text saliency. The experimental outcomes in ROC curve and DET curves illustrate that the proposed model outperformed the state-of-art methods for detection of salient text content from a natural scene.
Similar content being viewed by others
References
Achanta R, Süsstrunk S (2010) ‘Saliency detection using maximum symmetric surround’, In Image processing (ICIP), 2010 17th IEEE international conference on 2653–2656
Achanta R, Hemami S, Estrada F, Susstrunk S, (2009) ‘Frequency-tuned salient region detection’, In Computer vision and pattern recognition, 2009, IEEE conference on 1597–1604
Bylinskii Z, Recasens A, Borji A, Oliva A, Torralba A, Durand F (2016) ‘Where should saliency models look next?’, In European Conference on Computer Vision 809–824
Chen H, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B (2011) ‘Robust text detection in natural images with edge-enhanced maximally stable extremal regions’, In Image Processing (ICIP), 2011 18th IEEE International Conference on 2609–2612
Gao R, Uchida S, Shahab A, Shafait F, Frinken V (2014) Visual saliency models for text detection in real world. PLoS One 9(12):e114539
Gupta N, Jalal AS (2017). A comparison of visual attention models for the salient text content detection in natural scene. In Information and Communication Technology (CICT), 2017 Conference on (1–5). IEEE
Harel J, Koch C, Perona P (2007) ‘Graph-based visual saliency’, In Advances in neural information processing systems 545–552
Hou X, Zhang L (2007) ‘Saliency detection: A spectral residual approach’, In Computer Vision and Pattern Recognition, 2007, IEEE Conference on 1–8
Huang, W., Qiao, Y. and Tang, X., (2014) ‘Robust scene text detection with convolution neural network induced mser trees’, In European Conference on Computer Vision (497–511)
Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell 20(11):1254–1259
Jian M, Lam KM, Dong J (2014) Facial-feature detection and localization based on a hierarchical scheme. Inf Sci 262:1–14
Jian M, Lam KM, Dong J, Shen L (2015) Visual-patch-attention-aware saliency detection. IEEE Trans Cybernet 45(8):1575–1586
Jian M, Qi Q, Dong J, Sun X, Sun Y, Lam KM (2017). Saliency detection using quaternionic distance based weber local descriptor and level priors. Multimedia Tools and Applications, 1-18
Jian M, Qi Q, Dong J, Yin Y, Lam KM (2018) Integrating QDWD with pattern distinctness and local contrast for underwater saliency detection. J Vis Commun Image Represent 53:31–41
Judd T, Ehinger K, Durand F, Torralba A (2009) ‘Learning to predict where humans look’, In Computer Vision, 2009 IEEE 12th international conference on 2106–2113
Malmer T (2010) Image segmentation using GrabCut. IEEE Trans Signal Process 5(1):1–7
Manke R, Jalal AS (2014) Poisson-distribution-based approach for salient region detection. Electron Lett 51(1):37–38
Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767
Nistér D, Stewénius H, (2008) ‘Linear time maximally stable extremal regions’, In European Conference on Computer Vision (pp. 183–196)
Rahtu E, Heikkilä J (2009) ‘A simple and efficient saliency detector for background subtraction’. In Computer Vision Workshops (ICCV Workshops), 2009 IEEE 12th International Conference on 1137–1144
Rother C, Kolmogorov V, Blake A (2004) Grabcut: interactive foreground extraction using iterated graph cuts. ACM Trans Graph (TOG) 23(3):309–314
Torralba A, Oliva A, Castelhano MS, Henderson JM (2006) Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. Psychol Rev 113(4):766
Yin XC, Yin X, Huang K, Hao HW (2014) Robust text detection in natural scene images. IEEE Trans Pattern Anal Mach Intell 36(5):970–983
Zhang L, Gu Z, Li H (2013) ‘SDSP: A novel saliency detection method by combining simple priors’, In Image Processing (ICIP), 2013 20th IEEE International Conference on pp. 171–175
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Gupta, N., Jalal, A.S. A robust model for salient text detection in natural scene images using MSER feature detector and Grabcut. Multimed Tools Appl 78, 10821–10835 (2019). https://doi.org/10.1007/s11042-018-6613-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-6613-1