Abstract
Text categorization is widely used in applications such as spam filtering, identification of document genre, authorship attribution, and automated essay grading. The rapid growth in the amount of text data gives rise to the urgent need for fast text classification algorithms. In this paper, we propose a GPU based SVM solver for large scale text datasets. Using Platt’s Sequential Minimal Optimization algorithm, we achieve a speedup of 5–40 times over LibSVM running on a high-end traditional processor. Prediction time based on the paralleled string kernel computing scheme shows 5–90 times faster performance than the CPU based implementation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
Cristianini, N., Shawe-Taylor, J.: An introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)
Oei, C., Friedland, G., Janin, A.: Parallel Training of a Multi-Layer Perceptron on a GPU. ICSI Technical Report (2009)
Andrecut, M.: Parallel GPU Implementation of Iterative PCA Algorithms. Journal of Computational Biology 16(11), 1593–1599 (2009)
Hall, J.D., Hart, J.C.: GPU Acceleration of Iterative Clustering (2004)
Lodhi, H., Saunders, C., Shawe-Taylor, J., Cristianini, N., Watkins, C.: Text classification using string kernels. Journal of Machine Learning Research (2), 419–444 (2002)
Saigo, H., Vert, J., Ueda, N., Akutsu, T.: Protein homology detection using string alignment kernels. Oxford University Press, Oxford (2004)
Keerthi, S.S., Shevade, S.K., Bhattacharyya, C., Murthy, K.R.K.: Improvements to Platt’s SMO Algorithm for SVM Classifier Design. Neural Comput. 13, 637–649 (2001)
Moravanszky, A.: Linear algebra on the GPU. In: Engel, W.F. (ed.) Shader X 2. Wordware Publishing, Texas (2003)
Manocha, D.: Interactive geometric & scientific computations using graphics hardware. In: SIGGRAPH, Tutorial Course#11 (2003)
Catanzaro, B., Sundaram, N., Keutzer, K.: Fast support vector machine training and classification on graphics processors. In: ICML 2008: Proceedings of the 25th International Conference on Machine Learning, pp. 104–111. ACM, New York (2008)
Carpenter, A.: CUSVM: A cuda implementation of support vector classification and regression (2009)
Platt, J.C.: Fast training of support vector machines using sequential minimal optimization. In: Advances in Kernel Methods: Support Vector Learning, pp. 185–208. MIT Press, Cambridge (1999)
http://www.cs.umb.edu/~smimarog/textmining/datasets/index.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shi, Y., Ban, T., Guo, S., Xu, Q., Kadobayashi, Y. (2010). Fast Implementation of String-Kernel-Based Support Vector Classifiers by GPU Computing. In: Wong, K.W., Mendis, B.S.U., Bouzerdoum, A. (eds) Neural Information Processing. Models and Applications. ICONIP 2010. Lecture Notes in Computer Science, vol 6444. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17534-3_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-17534-3_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17533-6
Online ISBN: 978-3-642-17534-3
eBook Packages: Computer ScienceComputer Science (R0)