Log in

Text extraction and enhancement of binary images using cellular automata

  • Published:
International Journal of Automation and Computing Aims and scope Submit manuscript

Abstract

Text characters embedded in images represent a rich source of information for content-based indexing and retrieval applications. However, these text characters are difficult to be detected and recognized due to their various sizes, grayscale values, and complex backgrounds. Existing methods cannot handle well those texts with different contrast or embedded in a complex image background. In this paper, a set of sequential algorithms for text extraction and enhancement of image using cellular automata are proposed. The image enhancement includes gray level, contrast manipulation, edge detection, and filtering. First, it applies edge detection and uses a threshold to filter out for low-contrast text and simplify complex background of high-contrast text from binary image. The proposed algorithm is simple and easy to use and requires only a sample texture binary image as an input. It generates textures with perceived quality, better than those proposed by earlier published techniques. The performance of our method is demonstrated by presenting experimental results for a set of text based binary images. The quality of thresholding is assessed using the precision and recall analysis of the resultant text in the binary image.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Brazil)

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. M. Pietikainen, O. Okun. Edge-based Method for Text Detection from Complex Document Images. In Proceedings of the 6th International Conference on Document Analysis and Recognition, IEEE Press, Seattle, WA, USA, pp. 286–291, 2002.

    Google Scholar 

  2. K. Jung, K. In Kim, A. K. Jain. Text Information Extraction in Images and Video: A Survey. Pattern Recognition, vol. 37, no. 5, pp. 977–997, 2004.

    Article  Google Scholar 

  3. D. F. Dunn, N. E. Mathew. Extracting Color Halftones from Printed Documents Using Texture Analysis. Pattern Recognition, vol. 33, no. 3, pp. 445–463, 2000.

    Article  Google Scholar 

  4. T. Ojala, M. Pietikäinen, T. Mäenpää. Gray Scale and Rotation Invariant Texture Classification with Local Binary Patterns. In Proceedings of the 6th European Conference on Computer Vision, Lecture Notes in Computer Science, Springer-Verlag, London, UK, vol. 1842, pp. 404–420, 2000.

    Google Scholar 

  5. J. Gllavata, R. Ewerth, B. Freisleben. Finding Text in Images via Local Thresholding. In Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology, pp. 539–542, 2003.

  6. X. Liu, J. Samarabandu. Multiscale Edge-based Text Extraction from Complex Images. In Proceedings of IEEE International Conference on Multimedia and Expo, IEEE Press, pp. 1721–1724, 2006.

  7. K. C. Kim, H. R. Byun, Y. J. Song, Y. M. Choi, S. Y. Chi, K. K. Kim, Y. K. Chung. Scene Text Extraction in Natural Scene Images Using Hierarchical Feature Combining and Verification in Pattern Recognition. In Proceedings of the 17th International Conference on Pattern Recognition, IEEE Computer Society, Cambridge, UK, vol. 2, pp. 679–682, 2004.

    Chapter  Google Scholar 

  8. K. Wang, J. A. Kangas. Character Location in Scene Images from Digital Camera. Pattern Recognition, vol. 36, no. 10, pp. 2287–2299, 2003.

    Article  MATH  Google Scholar 

  9. G. Sahoo, T. Kumar. Theory of Computation: A New Approach of Computation into Cellular Automata. In Proceedings of the 2nd International Conference on Advanced Computing & Communication Technologies, India, pp. 608–613, 2007.

  10. Y. Zhong, H. Zhang, A. K. Jain. Automatic Caption Localization in Compressed Video. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 4, pp. 385–392, 2000.

    Article  Google Scholar 

  11. M. I. C. Murguia. Document Segmentation Using Texture Variance and Low Resolution Images. In Proceedings of IEEE Southwest Symposium on Image Analysis and Interpretation, IEEE Press, Arizona, USA, vol. 3, pp. 164–167. 1998.

    Google Scholar 

  12. Q. Ye, W. Gao, W. Wang. A New Texture-insensitive Edge Detection Method. In Proceedings of the Joint Conference of the 4th International Conference on Information, Communications and Signal Processing and the 4th Pacific Rim Conference on Multimedia, IEEE Press, Singapore, vol 2, pp. 768–772, 2003.

    Google Scholar 

  13. I. E. Sobel. Camera Models and Machine Perception, Ph.D. dissertation, Stanford University, Stanford, California, USA, 1970.

    Google Scholar 

  14. L. G. Roberts. Machine Perception of Three-dimensional Solids: Optical and Electro-optical Information Processing, MIT Press Cambridge, Massachusetts, USA, pp. 159–197, 1965.

    Google Scholar 

  15. A. Popovici, D. Popovici. Cellular Automata in Image Processing. In Proceedings of the 15th International Symposium on the Mathematical Theory of Networks and Systems, Romania, pp. 34–44, 2000.

  16. T. Mäenpää. The Local Binary Pattern Approach to Texture Analysis: Extensions and Applications, Ph. D. dissertation, University of Oulo, Finland, 2003.

    Google Scholar 

  17. A. R.Weeks. Fundamentals of Electronic Image Processing, PHI Publishing, Inc., pp. 388–391, 1999.

  18. J. Park, T. N. Dinh, G. Lee. Binarization of Text Region Based on Fuzzy Clustering and Histogram Distribution in Signboards. In Proceedings of World Academy Science, Engineering and Technology, vol. 33, pp. 85–90, 2008.

    Google Scholar 

  19. Q. Ye, J. Jiao, J. Huang, H. Yu. Text Detection and Restoration in Natural Scene Images. Journal of Visual Communication and Image Representation, vol. 18, no. 6, pp. 504–513, 2007.

    Article  Google Scholar 

  20. J. **, X. S. Hua, X. R. Chen, W. Y. Liu, H. J. Zhang. A Video Text Detection and Recognition System. In Proceedings of IEEE International Conference on Multimedia and Expo, IEEE Press, pp. 1080–1083, 2001.

  21. G. Leedham, C. Yan, K. Tarkru, J. H. N. Tan, L. Mian. Comparison of Some Thresholding Algorithms for Text/Background Segmentation in Difficult Document Images. In Proceedings of the 7th International Conference on Document Analysis and Recognition, IEEE Press, Edinburgh, Scotland, pp. 859–864, 2003.

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tapas Kumar.

Additional information

G. Sahoo received his P. G. degree from Utkal University, India, in 1980 and Ph.D. degree in the area of computational mathematics from Indian Institute of Technology, Kharagpur in 1987. He is currently a professor and heading the Department of Computer Science and Engineering in Birla Institute of Technology, India.

His research interests include theoretical computer science, parallel and distributed computing, image processing, and pattern recognition.

Tapas Kumar graduated from Amravati University, India in 1998. He received the master degree in computer science from Guru Jambeshwar University of Secience and Technology, India, and Ph.D. degree in the area of cellular automata application in image processing from Birla Institute of Technology Mesra, India. He is currently working as a lecturer (selection grade) at Lingaya’s Institute of Management and Technology (LIMAT), Fardiabad, India.

His research interests include image processing, especially the application of cellular automata as a computing tool.

B. L. Raina received his P. G. degree from Jammu and Kashmir University in 1962 and Ph.D. degree in the area of computational mathematics from Indian Institute of Technology, New Delhi, India in 1973. He is currently working as a professor in the Department of Information Technology at Lingaya’s Institute of Management and Technology (LIMAT), Fardabad, India.

His research interests include applied numerical techniques, graph theory, image processing, and cryptography.

C. M. Bhatia received his B.Eng. (Hons) degree in electrical engineering from Jabalpur University, India in 1965. He received the P. G. degree from Indian Institute of Technology (IIT) Roorkee, in applied electronics and servomechanism in 1968, and the Ph.D. from IIT Delhi in 1973. He is currently a professor and dean (P.G. studies) at Lingaya’s University, India.

His research interests include power electronics and variable speed drives, flexible AC transmission system (FACTS), microprocessor and microcomputer based system digital filters, and image processing.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sahoo, G., Kumar, T., Raina, B.L. et al. Text extraction and enhancement of binary images using cellular automata. Int. J. Autom. Comput. 6, 254–260 (2009). https://doi.org/10.1007/s11633-009-0254-9

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11633-009-0254-9

Keywords

Navigation