Abstract
We propose to use novel and classical audio and text signal-processing and otherwise techniques for “inexpensive” fast writer identification tasks of scanned hand-written documents “visually”. The “inexpensive” refers to the efficiency of the identification process in terms of CPU cycles while preserving decent accuracy for preliminary identification. This is a comparative study of multiple algorithm combinations in a pattern recognition pipeline implemented in Java around an open-source Modular Audio Recognition Framework (MARF) that can do a lot more beyond audio. We present our preliminary experimental findings in such an identification task. We simulate “visual” identification by “looking” at the hand-written document as a whole rather than trying to extract fine-grained features out of it prior classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
S. A. Mokhov and M. Debbabi, “File type analysis using signal processing techniques and machine learning vs. file unix utility for forensic analysis,” in Proceedings of the IT Incident Management and IT Forensics (IMF’08), O. Goebel, S. Frings, D. Guenther, J. Nedon, and D. Schadt, Eds. Mannheim, Germany: GI, Sep. 2008, pp. 73–85, LNI140.
I. F. Darwin, J. Gilmore, G. Collyer, R. McMahon, G. Harris, C. Zoulas, C. Lowth, E. Fischer, and Various Contributors, “file – determine file type, BSD General Commands Manual, file(1),” BSD, Jan. 1973–2007, man file(1).
——, “file - determine file type,” [online], Mar. 1973-2008, ftp://ftp.astron.com/pub/file/, last viewed April 2008.
S. Mokhov, I. Clement, S. Sinclair, and D. Nicolacopoulos, “Modular Audio Recognition Framework,” Department of Computer Science and Software Engineering, Concordia University, Montreal, Canada, 2002– 2003, project report, http://marf.sf.net, last viewed April 2008.
The MARF Research and Development Group, “The Modular Audio Recognition Framework and its Applications,” [online], 2002-2009, http://marf.sf.net, last viewed October 2009.
S. A. Mokhov, “Experimental results and statistics in the implementation of the modular audio recognition framework’s API for text-independent speaker identification,” in Proceedings of the 6th International Conference on Computing, Communications and Control Technologies (CCCT’08), C. D. Zinn, H.-W. Chu, M. Savoie, J. Ferrer, and A. Munitic, Eds., vol. II. Orlando, Florida, USA: IIIS, Jun. 2008, pp. 267–272.
——, “Introducing MARF: a modular audio recognition framework and its applications for scientific and software engineering research,” in Advances in Computer and Information Sciences and Engineering. University of Bridgeport, U.S.A.: Springer Netherlands, Dec. 2007, pp. 473–478, proceedings of CISSE/SCSS’07.
——, “Choosing best algorithm combinations for speech processing tasks in machine learning using MARF,” in Proceedings of the 21st Canadian AI’08, S. Bergler, Ed. Windsor, Ontario, Canada: Springer-Verlag, Berlin Heidelberg, May 2008, pp. 216–221, LNAI 5032.
——, “Study of best algorithm combinations for speech processing tasks in machine learning using median vs. mean clusters in MARF,” in Proceedings of C3S2E’08, B. C. Desai, Ed. Montreal, Quebec, Canada: ACM, May 2008, pp. 29–43, ISBN 978-1-60558-101-9.
——, “On design and implementation of distributed modular audio recognition framework: Requirements and specification design document,” [online], Aug. 2006, project report, http://arxiv.org/abs/0905. 2459, last viewed May 2009.
S. A. Mokhov, L. W. Huynh, and J. Li, “Managing distributed MARF with SNMP,” Concordia Institute for Information Systems Engineering, Concordia University, Montreal, Canada, Apr. 2007, project Report. Hosted at http://marf.sf.net, last viewed April 2008.
S. A. Mokhov, “Towards security hardening of scientific distributed demand-driven and pipelined computing systems,” in Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC’08). Krakow, Poland: IEEE Computer Society, Jul. 2008, pp. 375–382.
S. A. Mokhov, L. W. Huynh, J. Li, and F. Rassai, “A privacy framework within the java data security framework (JDSF): Design refinement, implementation, and statistics,” in Proceedings of the 12th World Multi-Conference on Systemics, Cybernetics and Informatics (WM-SCI’08), N. Callaos, W. Lesso, C. D. Zinn, J. Baralt, J. Boukachour, C. White, T. Marwala, and F. V. Nelwamondo, Eds., vol. V. Orlando, Florida, USA: IIIS, Jun. 2008, pp. 131–136.
S. A. Mokhov, L. Wang, and J. Li, “Simple dynamic key management in SQL randomization,” 2009, unpublished.
E. Gamma and K. Beck, “JUnit,” Object Mentor, Inc., 2001-2004, http: //junit.org/.
S. M. Bernsee, The DFT “`a pied”: Mastering The Fourier Transform in One Day. DSPdimension.com, 1999-2005, http://www.dspdimension.com/data/html/dftapied.html.
H. Abdi, “Distance,” in Encyclopedia of Measurement and Statistics, N. Salkind, Ed., Thousand Oaks (CA): Sage, 2007.
P. Mahalanobis, “On the generalised distance in statistics.” Proceedings of the National Institute of Science of India 12 (1936) 49-55, 1936, http://en.wikipedia.org/wiki/Mahalanobis_distance.
D. Mackenzie, P. Eggert, and R. Stallman, “Comparing and merging files,” [online], 2002, http://www.gnu.org/software/diffutils/manual/ps/diff.ps.gz.
R. W. Hamming, “Error Detecting and Error Correcting Codes.” Bell System Technical Journal 26(2):147-160, 1950, http://en.wikipedia.org/wiki/Hamming_distance.
E. Garcia, “Cosine similarity and term weight tutorial,” 2006, http://www.miislita.com/information-retrieval-tutorial/cosine-similarity-tutorial.html.
A. Kishore, “Similarity measure: Cosine similarity or euclidean distance or both,” Feb. 2007, http://semanticvoid.com/blog/2007/02/23/similarity-measure-cosine-similarity-or-euclidean-distance-or-both/.
M. Khalif´e, “Examining orthogonal concepts-based micro-classifiers and their correlations with noun-phrase coreference chains,” Master’s thesis, Concordia University, Montr´eal, Canada, 2004.
G. K. Zipf, The Psychobiology of Language. Houghton-Mifflin, New York, NY, 1935, http://en.wikipedia.org/wiki/Zipf%27s_law.
S. Haridas, “Generation of 2-D digital filters with variable magnitude characteristics starting from a particular type of 2-variable continued fraction expansion,” Master’s thesis, Concordia University, Montr´eal, Canada, Jul. 2006.
Acknowledgments
This work is partially funded by NSERC, FQRSC, and Graduate School and the Faculty of Engineering and Computer Science, Concordia University, Montreal, Canada.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media B.V.
About this paper
Cite this paper
Mokhov, S.A., Song, M., Suen, C.Y. (2010). Writer Identification Using Inexpensive Signal Processing Techniques. In: Sobh, T., Elleithy, K. (eds) Innovations in Computing Sciences and Software Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9112-3_74
Download citation
DOI: https://doi.org/10.1007/978-90-481-9112-3_74
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-9111-6
Online ISBN: 978-90-481-9112-3
eBook Packages: Computer ScienceComputer Science (R0)