Writer Identification Using Inexpensive Signal Processing Techniques

  • Conference paper
  • First Online:
Innovations in Computing Sciences and Software Engineering

Abstract

We propose to use novel and classical audio and text signal-processing and otherwise techniques for “inexpensive” fast writer identification tasks of scanned hand-written documents “visually”. The “inexpensive” refers to the efficiency of the identification process in terms of CPU cycles while preserving decent accuracy for preliminary identification. This is a comparative study of multiple algorithm combinations in a pattern recognition pipeline implemented in Java around an open-source Modular Audio Recognition Framework (MARF) that can do a lot more beyond audio. We present our preliminary experimental findings in such an identification task. We simulate “visual” identification by “looking” at the hand-written document as a whole rather than trying to extract fine-grained features out of it prior classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 160.49
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 213.99
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info
Hardcover Book
EUR 213.99
Price includes VAT (Germany)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. S. A. Mokhov and M. Debbabi, “File type analysis using signal processing techniques and machine learning vs. file unix utility for forensic analysis,” in Proceedings of the IT Incident Management and IT Forensics (IMF’08), O. Goebel, S. Frings, D. Guenther, J. Nedon, and D. Schadt, Eds. Mannheim, Germany: GI, Sep. 2008, pp. 73–85, LNI140.

    Google Scholar 

  2. I. F. Darwin, J. Gilmore, G. Collyer, R. McMahon, G. Harris, C. Zoulas, C. Lowth, E. Fischer, and Various Contributors, “file – determine file type, BSD General Commands Manual, file(1),” BSD, Jan. 1973–2007, man file(1).

    Google Scholar 

  3. ——, “file - determine file type,” [online], Mar. 1973-2008, ftp://ftp.astron.com/pub/file/, last viewed April 2008.

  4. S. Mokhov, I. Clement, S. Sinclair, and D. Nicolacopoulos, “Modular Audio Recognition Framework,” Department of Computer Science and Software Engineering, Concordia University, Montreal, Canada, 2002– 2003, project report, http://marf.sf.net, last viewed April 2008.

    Google Scholar 

  5. The MARF Research and Development Group, “The Modular Audio Recognition Framework and its Applications,” [online], 2002-2009, http://marf.sf.net, last viewed October 2009.

  6. S. A. Mokhov, “Experimental results and statistics in the implementation of the modular audio recognition framework’s API for text-independent speaker identification,” in Proceedings of the 6th International Conference on Computing, Communications and Control Technologies (CCCT’08), C. D. Zinn, H.-W. Chu, M. Savoie, J. Ferrer, and A. Munitic, Eds., vol. II. Orlando, Florida, USA: IIIS, Jun. 2008, pp. 267–272.

    Google Scholar 

  7. ——, “Introducing MARF: a modular audio recognition framework and its applications for scientific and software engineering research,” in Advances in Computer and Information Sciences and Engineering. University of Bridgeport, U.S.A.: Springer Netherlands, Dec. 2007, pp. 473–478, proceedings of CISSE/SCSS’07.

    Google Scholar 

  8. ——, “Choosing best algorithm combinations for speech processing tasks in machine learning using MARF,” in Proceedings of the 21st Canadian AI’08, S. Bergler, Ed. Windsor, Ontario, Canada: Springer-Verlag, Berlin Heidelberg, May 2008, pp. 216–221, LNAI 5032.

    Google Scholar 

  9. ——, “Study of best algorithm combinations for speech processing tasks in machine learning using median vs. mean clusters in MARF,” in Proceedings of C3S2E’08, B. C. Desai, Ed. Montreal, Quebec, Canada: ACM, May 2008, pp. 29–43, ISBN 978-1-60558-101-9.

    Google Scholar 

  10. ——, “On design and implementation of distributed modular audio recognition framework: Requirements and specification design document,” [online], Aug. 2006, project report, http://arxiv.org/abs/0905. 2459, last viewed May 2009.

  11. S. A. Mokhov, L. W. Huynh, and J. Li, “Managing distributed MARF with SNMP,” Concordia Institute for Information Systems Engineering, Concordia University, Montreal, Canada, Apr. 2007, project Report. Hosted at http://marf.sf.net, last viewed April 2008.

    Google Scholar 

  12. S. A. Mokhov, “Towards security hardening of scientific distributed demand-driven and pipelined computing systems,” in Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC’08). Krakow, Poland: IEEE Computer Society, Jul. 2008, pp. 375–382.

    Google Scholar 

  13. S. A. Mokhov, L. W. Huynh, J. Li, and F. Rassai, “A privacy framework within the java data security framework (JDSF): Design refinement, implementation, and statistics,” in Proceedings of the 12th World Multi-Conference on Systemics, Cybernetics and Informatics (WM-SCI’08), N. Callaos, W. Lesso, C. D. Zinn, J. Baralt, J. Boukachour, C. White, T. Marwala, and F. V. Nelwamondo, Eds., vol. V. Orlando, Florida, USA: IIIS, Jun. 2008, pp. 131–136.

    Google Scholar 

  14. S. A. Mokhov, L. Wang, and J. Li, “Simple dynamic key management in SQL randomization,” 2009, unpublished.

    Google Scholar 

  15. E. Gamma and K. Beck, “JUnit,” Object Mentor, Inc., 2001-2004, http: //junit.org/.

    Google Scholar 

  16. S. M. Bernsee, The DFT “`a pied”: Mastering The Fourier Transform in One Day. DSPdimension.com, 1999-2005, http://www.dspdimension.com/data/html/dftapied.html.

  17. H. Abdi, “Distance,” in Encyclopedia of Measurement and Statistics, N. Salkind, Ed., Thousand Oaks (CA): Sage, 2007.

    Google Scholar 

  18. P. Mahalanobis, “On the generalised distance in statistics.” Proceedings of the National Institute of Science of India 12 (1936) 49-55, 1936, http://en.wikipedia.org/wiki/Mahalanobis_distance.

    Google Scholar 

  19. D. Mackenzie, P. Eggert, and R. Stallman, “Comparing and merging files,” [online], 2002, http://www.gnu.org/software/diffutils/manual/ps/diff.ps.gz.

  20. R. W. Hamming, “Error Detecting and Error Correcting Codes.” Bell System Technical Journal 26(2):147-160, 1950, http://en.wikipedia.org/wiki/Hamming_distance.

    MathSciNet  Google Scholar 

  21. E. Garcia, “Cosine similarity and term weight tutorial,” 2006, http://www.miislita.com/information-retrieval-tutorial/cosine-similarity-tutorial.html.

  22. A. Kishore, “Similarity measure: Cosine similarity or euclidean distance or both,” Feb. 2007, http://semanticvoid.com/blog/2007/02/23/similarity-measure-cosine-similarity-or-euclidean-distance-or-both/.

  23. M. Khalif´e, “Examining orthogonal concepts-based micro-classifiers and their correlations with noun-phrase coreference chains,” Master’s thesis, Concordia University, Montr´eal, Canada, 2004.

    Google Scholar 

  24. G. K. Zipf, The Psychobiology of Language. Houghton-Mifflin, New York, NY, 1935, http://en.wikipedia.org/wiki/Zipf%27s_law.

  25. S. Haridas, “Generation of 2-D digital filters with variable magnitude characteristics starting from a particular type of 2-variable continued fraction expansion,” Master’s thesis, Concordia University, Montr´eal, Canada, Jul. 2006.

    Google Scholar 

Download references

Acknowledgments

This work is partially funded by NSERC, FQRSC, and Graduate School and the Faculty of Engineering and Computer Science, Concordia University, Montreal, Canada.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Serguei A. Mokhov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media B.V.

About this paper

Cite this paper

Mokhov, S.A., Song, M., Suen, C.Y. (2010). Writer Identification Using Inexpensive Signal Processing Techniques. In: Sobh, T., Elleithy, K. (eds) Innovations in Computing Sciences and Software Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9112-3_74

Download citation

  • DOI: https://doi.org/10.1007/978-90-481-9112-3_74

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-90-481-9111-6

  • Online ISBN: 978-90-481-9112-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation