Writer Identification Using Inexpensive Signal Processing Techniques

Mokhov, Serguei A.; Song, Miao; Suen, Ching Y.

doi:10.1007/978-90-481-9112-3_74

Serguei A. Mokhov³,
Miao Song⁴ &
Ching Y. Suen⁵

1212 Accesses

Abstract

We propose to use novel and classical audio and text signal-processing and otherwise techniques for “inexpensive” fast writer identification tasks of scanned hand-written documents “visually”. The “inexpensive” refers to the efficiency of the identification process in terms of CPU cycles while preserving decent accuracy for preliminary identification. This is a comparative study of multiple algorithm combinations in a pattern recognition pipeline implemented in Java around an open-source Modular Audio Recognition Framework (MARF) that can do a lot more beyond audio. We present our preliminary experimental findings in such an identification task. We simulate “visual” identification by “looking” at the hand-written document as a whole rather than trying to extract fine-grained features out of it prior classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 160.49; Price includes VAT (Germany)

Softcover Book: EUR 213.99; Price includes VAT (Germany)

Hardcover Book: EUR 213.99; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Brief History of Documents and Writing Systems

Analysis and Recognition of Music Scores

Offline Writer Identification Based on CLBP and VLBP

References

S. A. Mokhov and M. Debbabi, “File type analysis using signal processing techniques and machine learning vs. file unix utility for forensic analysis,” in Proceedings of the IT Incident Management and IT Forensics (IMF’08), O. Goebel, S. Frings, D. Guenther, J. Nedon, and D. Schadt, Eds. Mannheim, Germany: GI, Sep. 2008, pp. 73–85, LNI140.
Google Scholar
I. F. Darwin, J. Gilmore, G. Collyer, R. McMahon, G. Harris, C. Zoulas, C. Lowth, E. Fischer, and Various Contributors, “file – determine file type, BSD General Commands Manual, file(1),” BSD, Jan. 1973–2007, man file(1).
Google Scholar
——, “file - determine file type,” [online], Mar. 1973-2008, ftp://ftp.astron.com/pub/file/, last viewed April 2008.
S. Mokhov, I. Clement, S. Sinclair, and D. Nicolacopoulos, “Modular Audio Recognition Framework,” Department of Computer Science and Software Engineering, Concordia University, Montreal, Canada, 2002– 2003, project report, http://marf.sf.net, last viewed April 2008.
Google Scholar
The MARF Research and Development Group, “The Modular Audio Recognition Framework and its Applications,” [online], 2002-2009, http://marf.sf.net, last viewed October 2009.
S. A. Mokhov, “Experimental results and statistics in the implementation of the modular audio recognition framework’s API for text-independent speaker identification,” in Proceedings of the 6th International Conference on Computing, Communications and Control Technologies (CCCT’08), C. D. Zinn, H.-W. Chu, M. Savoie, J. Ferrer, and A. Munitic, Eds., vol. II. Orlando, Florida, USA: IIIS, Jun. 2008, pp. 267–272.
Google Scholar
——, “Introducing MARF: a modular audio recognition framework and its applications for scientific and software engineering research,” in Advances in Computer and Information Sciences and Engineering. University of Bridgeport, U.S.A.: Springer Netherlands, Dec. 2007, pp. 473–478, proceedings of CISSE/SCSS’07.
Google Scholar
——, “Choosing best algorithm combinations for speech processing tasks in machine learning using MARF,” in Proceedings of the 21st Canadian AI’08, S. Bergler, Ed. Windsor, Ontario, Canada: Springer-Verlag, Berlin Heidelberg, May 2008, pp. 216–221, LNAI 5032.
Google Scholar
——, “Study of best algorithm combinations for speech processing tasks in machine learning using median vs. mean clusters in MARF,” in Proceedings of C3S2E’08, B. C. Desai, Ed. Montreal, Quebec, Canada: ACM, May 2008, pp. 29–43, ISBN 978-1-60558-101-9.
Google Scholar
——, “On design and implementation of distributed modular audio recognition framework: Requirements and specification design document,” [online], Aug. 2006, project report, http://arxiv.org/abs/0905. 2459, last viewed May 2009.
S. A. Mokhov, L. W. Huynh, and J. Li, “Managing distributed MARF with SNMP,” Concordia Institute for Information Systems Engineering, Concordia University, Montreal, Canada, Apr. 2007, project Report. Hosted at http://marf.sf.net, last viewed April 2008.
Google Scholar
S. A. Mokhov, “Towards security hardening of scientific distributed demand-driven and pipelined computing systems,” in Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC’08). Krakow, Poland: IEEE Computer Society, Jul. 2008, pp. 375–382.
Google Scholar
S. A. Mokhov, L. W. Huynh, J. Li, and F. Rassai, “A privacy framework within the java data security framework (JDSF): Design refinement, implementation, and statistics,” in Proceedings of the 12th World Multi-Conference on Systemics, Cybernetics and Informatics (WM-SCI’08), N. Callaos, W. Lesso, C. D. Zinn, J. Baralt, J. Boukachour, C. White, T. Marwala, and F. V. Nelwamondo, Eds., vol. V. Orlando, Florida, USA: IIIS, Jun. 2008, pp. 131–136.
Google Scholar
S. A. Mokhov, L. Wang, and J. Li, “Simple dynamic key management in SQL randomization,” 2009, unpublished.
Google Scholar
E. Gamma and K. Beck, “JUnit,” Object Mentor, Inc., 2001-2004, http: //junit.org/.
Google Scholar
S. M. Bernsee, The DFT “`a pied”: Mastering The Fourier Transform in One Day. DSPdimension.com, 1999-2005, http://www.dspdimension.com/data/html/dftapied.html.
H. Abdi, “Distance,” in Encyclopedia of Measurement and Statistics, N. Salkind, Ed., Thousand Oaks (CA): Sage, 2007.
Google Scholar
P. Mahalanobis, “On the generalised distance in statistics.” Proceedings of the National Institute of Science of India 12 (1936) 49-55, 1936, http://en.wikipedia.org/wiki/Mahalanobis_distance.
Google Scholar
D. Mackenzie, P. Eggert, and R. Stallman, “Comparing and merging files,” [online], 2002, http://www.gnu.org/software/diffutils/manual/ps/diff.ps.gz.
R. W. Hamming, “Error Detecting and Error Correcting Codes.” Bell System Technical Journal 26(2):147-160, 1950, http://en.wikipedia.org/wiki/Hamming_distance.
MathSciNet Google Scholar
E. Garcia, “Cosine similarity and term weight tutorial,” 2006, http://www.miislita.com/information-retrieval-tutorial/cosine-similarity-tutorial.html.
A. Kishore, “Similarity measure: Cosine similarity or euclidean distance or both,” Feb. 2007, http://semanticvoid.com/blog/2007/02/23/similarity-measure-cosine-similarity-or-euclidean-distance-or-both/.
M. Khalif´e, “Examining orthogonal concepts-based micro-classifiers and their correlations with noun-phrase coreference chains,” Master’s thesis, Concordia University, Montr´eal, Canada, 2004.
Google Scholar
G. K. Zipf, The Psychobiology of Language. Houghton-Mifflin, New York, NY, 1935, http://en.wikipedia.org/wiki/Zipf%27s_law.
S. Haridas, “Generation of 2-D digital filters with variable magnitude characteristics starting from a particular type of 2-variable continued fraction expansion,” Master’s thesis, Concordia University, Montr´eal, Canada, Jul. 2006.
Google Scholar

Download references

Acknowledgments

This work is partially funded by NSERC, FQRSC, and Graduate School and the Faculty of Engineering and Computer Science, Concordia University, Montreal, Canada.

Author information

Authors and Affiliations

Computer Science and Software Engineering, Concordia University, Montreal, Canada
Serguei A. Mokhov
Graduate School, Concordia University, Montreal, Canada
Miao Song
Centre for Pattern Recognition and Machine Intelligence, Concordia University, Montreal, Canada
Ching Y. Suen

Authors

Serguei A. Mokhov
View author publications
You can also search for this author in PubMed Google Scholar
Miao Song
View author publications
You can also search for this author in PubMed Google Scholar
Ching Y. Suen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Serguei A. Mokhov .

Editor information

Editors and Affiliations

School of Engineering, University of Bridgeport, University Avenue 221, Bridgeport, 06604, USA
Tarek Sobh
School of Engineering, University of Bridgeport, University Avenue 221, Bridgeport, 06604, USA
Khaled Elleithy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mokhov, S.A., Song, M., Suen, C.Y. (2010). Writer Identification Using Inexpensive Signal Processing Techniques. In: Sobh, T., Elleithy, K. (eds) Innovations in Computing Sciences and Software Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9112-3_74

Download citation

DOI: https://doi.org/10.1007/978-90-481-9112-3_74
Published: 20 May 2010
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-9111-6
Online ISBN: 978-90-481-9112-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Writer Identification Using Inexpensive Signal Processing Techniques

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Brief History of Documents and Writing Systems

Analysis and Recognition of Music Scores

Offline Writer Identification Based on CLBP and VLBP

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Writer Identification Using Inexpensive Signal Processing Techniques

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Brief History of Documents and Writing Systems

Analysis and Recognition of Music Scores

Offline Writer Identification Based on CLBP and VLBP

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation