Fast Implementation of String-Kernel-Based Support Vector Classifiers by GPU Computing

  • Conference paper
Neural Information Processing. Models and Applications (ICONIP 2010)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6444))

Included in the following conference series:

  • 2689 Accesses

Abstract

Text categorization is widely used in applications such as spam filtering, identification of document genre, authorship attribution, and automated essay grading. The rapid growth in the amount of text data gives rise to the urgent need for fast text classification algorithms. In this paper, we propose a GPU based SVM solver for large scale text datasets. Using Platt’s Sequential Minimal Optimization algorithm, we achieve a speedup of 5–40 times over LibSVM running on a high-end traditional processor. Prediction time based on the paralleled string kernel computing scheme shows 5–90 times faster performance than the CPU based implementation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 85.59
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 106.99
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)

    Book  MATH  Google Scholar 

  2. Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)

    Book  MATH  Google Scholar 

  3. Cristianini, N., Shawe-Taylor, J.: An introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)

    MATH  Google Scholar 

  4. Oei, C., Friedland, G., Janin, A.: Parallel Training of a Multi-Layer Perceptron on a GPU. ICSI Technical Report (2009)

    Google Scholar 

  5. Andrecut, M.: Parallel GPU Implementation of Iterative PCA Algorithms. Journal of Computational Biology 16(11), 1593–1599 (2009)

    Article  MathSciNet  Google Scholar 

  6. Hall, J.D., Hart, J.C.: GPU Acceleration of Iterative Clustering (2004)

    Google Scholar 

  7. Lodhi, H., Saunders, C., Shawe-Taylor, J., Cristianini, N., Watkins, C.: Text classification using string kernels. Journal of Machine Learning Research (2), 419–444 (2002)

    Google Scholar 

  8. Saigo, H., Vert, J., Ueda, N., Akutsu, T.: Protein homology detection using string alignment kernels. Oxford University Press, Oxford (2004)

    Google Scholar 

  9. Keerthi, S.S., Shevade, S.K., Bhattacharyya, C., Murthy, K.R.K.: Improvements to Platt’s SMO Algorithm for SVM Classifier Design. Neural Comput. 13, 637–649 (2001)

    Article  MATH  Google Scholar 

  10. Moravanszky, A.: Linear algebra on the GPU. In: Engel, W.F. (ed.) Shader X 2. Wordware Publishing, Texas (2003)

    Google Scholar 

  11. Manocha, D.: Interactive geometric & scientific computations using graphics hardware. In: SIGGRAPH, Tutorial Course#11 (2003)

    Google Scholar 

  12. Catanzaro, B., Sundaram, N., Keutzer, K.: Fast support vector machine training and classification on graphics processors. In: ICML 2008: Proceedings of the 25th International Conference on Machine Learning, pp. 104–111. ACM, New York (2008)

    Chapter  Google Scholar 

  13. Carpenter, A.: CUSVM: A cuda implementation of support vector classification and regression (2009)

    Google Scholar 

  14. Platt, J.C.: Fast training of support vector machines using sequential minimal optimization. In: Advances in Kernel Methods: Support Vector Learning, pp. 185–208. MIT Press, Cambridge (1999)

    Google Scholar 

  15. http://www.cs.umb.edu/~smimarog/textmining/datasets/index.html

  16. http://spamassassin.apache.org/publiccorpus/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shi, Y., Ban, T., Guo, S., Xu, Q., Kadobayashi, Y. (2010). Fast Implementation of String-Kernel-Based Support Vector Classifiers by GPU Computing. In: Wong, K.W., Mendis, B.S.U., Bouzerdoum, A. (eds) Neural Information Processing. Models and Applications. ICONIP 2010. Lecture Notes in Computer Science, vol 6444. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17534-3_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17534-3_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17533-6

  • Online ISBN: 978-3-642-17534-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation