Automatic Clustering of Research Articles Using Domain Ontology and Fuzzy Logic

  • Conference paper
  • First Online:
Advances in Web-Based Learning – ICWL 2013 Workshops (ICWL 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8390))

Included in the following conference series:

  • 1089 Accesses

Abstract

Today, one of the key tasks for Web based learning is to retrieve research articles of specific domains. To accomplish this task, data mining techniques and semantic web technologies can be used to retrieve user relevant documents. In this work we extract author supplied keywords from a collection of computer science articles, which has a strong influence on the topic of the article when compared with other words. For these keywords term weight is computed using Fuzzy Logic which uses three criteria namely map** with concept in domain ontology, keyword frequency in the title and keyword frequency in abstract. Using domain ontology, keywords of each document with their term weights are represented hierarchically as XML documents and they are clustered. The experimental results show that the proposed technique yields better precision and recall rates when compared with some of the existing approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 42.79
Price includes VAT (Germany)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 53.49
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Ceravolo, P., Nocerino, M.C., Viviani, M.: Knowledge extraction from semi-structured data based on fuzzy techniques. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds.) KES 2004. LNCS (LNAI), vol. 3215, pp. 328–334. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  2. Dalamagas, T., Cheng, T., Winkel, K.J., Sellis, T.: A methodology for clustering XML documents by structure. Inf. Syst. 31(3), 187–228 (2006)

    Article  Google Scholar 

  3. Damiani, E., Nocerino, M.C., Viviani, M.: Knowledge extraction from an XML data flow: building a taxonomy based on clustering technique. In: Proceedings of EUROFUSE 2004: Eighth Meeting EURO Working Group on Fuzzy Sets, pp. 133–142 (2004)

    Google Scholar 

  4. Dogac, A., Laleci, G.B., Kabak, Y., Cingi, I.: Exploiting web service semantics: taxonomies vs. ontologies. IEEE Data Eng. Bull. 25(4), 10–14 (2002)

    Google Scholar 

  5. Fekete, J.D., Grinstein, G., Plaisant, C.: IEEE InfoVis 2004 Contest. In: The History of InfoVis (2004)

    Google Scholar 

  6. Ghosh, P.M., Mitra, P.: Combining content and structure similarity for XML document. In: Proceedings of the International Conference on Pattern Recognition, pp. 1–4 (2008)

    Google Scholar 

  7. Gil-García, R., Badia-Contelles, J.M., Pons-Porrata, A.: A general framework for agglomerative hierarchical clustering algorithms. In: Proceedings of 18th International Conference on Pattern Recognition ICPR, vol. 2, pp. 569–572 (2006)

    Google Scholar 

  8. Guerrini, G., Mesiti, M., Sanz, I.: An overview of similarity measures for clustering XML documents. In: Web Data Management Practices: Emerging Techniques and Technologies, pp. 56–78 (2007)

    Google Scholar 

  9. Jeong, B., Lee, D., Cho, H., Lee, J.: A novel method for measuring semantic similarity for XML matching. Expert Syst. Appl. 34(3), 1651–1658 (2008)

    Article  Google Scholar 

  10. Leung, H.P., Chung, F.L., Chan, S.C.: A new sequential mining approach to XML document similarity computation. In: Proceedings of the 7th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Seoul, Korea, pp. 569–569 (2003)

    Google Scholar 

  11. Nagypál, G.: Improving information retrieval effectiveness by using domain knowledge stored in ontologies. In: Meersman, R., Tari, Z. (eds.) OTM-WS 2005. LNCS, vol. 3762, pp. 780–789. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  12. Nierman, A., Jagadish, H.V.: Evaluating structural similarity in XML documents. In: Proceedings of 5th International Workshop on the Web and Databases (WebDB 2002), Madison, Wisconsin, USA, pp. 61–66 (2002)

    Google Scholar 

  13. Pan, H., Tan, X., Han, A., Yin, G.: A domain knowledge based approach for medical image retrieval. Int. J. Inf. Eng. Electron. Bus. 3, 16–22 (2011)

    Article  Google Scholar 

  14. Paralic, J., Kostial, I.: Ontology-based information retrieval. In: 14th International Conference on Information and Intelligent Systems, Varazdin, Croatia, pp. 23–28 (2003)

    Google Scholar 

  15. Paukkeri, M.-S.: Learning a taxonomy from a set of text documents. In: Applied Soft Computing, pp. 1138–1148 (2011)

    Google Scholar 

  16. Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic Indexing. Commun. ACM 18(11), 613–620 (1975)

    Article  MATH  Google Scholar 

  17. Sølvberg, I., Nordbø, I., Aamodt, A.: Knowledge-based information retrieval. Future Gener. Comput. Syst. 7, 379–390 (1992)

    Article  Google Scholar 

  18. Tekli, J., Chbeir, R., Yétongnon, K.: A hybrid approach for XML similarity. In: van Leeuwen, J., Italiano, G.F., van der Hoek, W., Meinel, C., Sack, H., Plášil, F. (eds.) SOFSEM 2007. LNCS, vol. 4362, pp. 783–795. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  19. Tekli, J., Chbeir, R., Yetongnon, K.: An overview on XML similarity: background, current trends and future directions. Comput. Sci. Rev. 3(3), 151–173 (2009)

    Article  MATH  Google Scholar 

  20. Tran, T., Nayak, R., Bruza, P.: Combining structure and content similarities for XML document clustering. In: Proceedings of the 7th Australasian Data Mining Conference (AusDM), vol. 87, pp. 219–226 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ramanathan Periakaruppan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Periakaruppan, R., Nadarajan, R. (2015). Automatic Clustering of Research Articles Using Domain Ontology and Fuzzy Logic. In: Chiu, D., et al. Advances in Web-Based Learning – ICWL 2013 Workshops. ICWL 2013. Lecture Notes in Computer Science(), vol 8390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46315-4_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-46315-4_5

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-46314-7

  • Online ISBN: 978-3-662-46315-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation