Abstract
The IndoWordNet is a multilingual WordNet for eighteen Indian languages. Punjabi language WordNet is a part of the IndoWordNet. The efforts made for the development of Punjabi language WordNet are undoubtedly appreciable, but various anomalies exist in this language resource. These anomalies were detected in Punjabi language WordNet, while using it for text processing. This paper presents the results of the evaluation, discusses the possible causes behind identified issues, and recommends necessary modifications. In this evaluation, the anomalies were identified by analyzing the IndoWordNet text files manually with the help of various word processors. To verify the results, these anomalies were also tested with online versions of the IndoWordNet available at www.cfilt.iitb.ac.in/indowordnet/ and www.tdil-dc.in/indowordnet/. Although, these web applications behave differently for similar inputs, but the existence of anomalies was verified by these online versions. The IndoWordNet is a structured language resource, but these anomalies violate the rules of its structure. It was tested for the purpose of improving its data accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Change history
10 November 2021
The original version of the book was inadvertently published with incorrect Gurmukhi (Panjabi) words in chapter “Anomalies in Punjabi Language WordNet: An IndoWordNet Evaluation". The correction chapter and the book have been updated with the change.
References
Luk, S.K., Knight, K.: Building a largescale knowledge base for machine translation. AAAI 94, 773–778 (1994)
Reddy, M., Manish Sinha, P.B.: An approach towards construction and application of multilingual indo-wordnet. In: 3rd Global Wordnet Conference (GWC 06), Jeju Island, Korea (2006)
Verdejo, F., Chugur, I., Julio Gonzalo, J.C.: Indexing with wordnet synsets can improve text retrieval. ar**v preprint cmplg (1998)
Harabagiu, S., Pasca, M.: The informative role of wordnet in open-domain question answering. In: Proceedings of NAACL-01 Workshop on WordNet and Other Lexical Resources, pp. 138–143 (2001)
Fellbaum, C.: WordNet.: Wiley Online Library (1998)
George, A., Fellbaum, R., Gross, C., Miller, D., Miller, K.J.: Introduction to wordnet: An on-line lexical database. Int. J. Lexicography 3(4), 235–244 (1990)
Vossen, P.: EuroWordNet: a multilingual database for information retrieval. In: DELOS, Zurich, pp. 5–7 (1997)
Bhattacharyya, P.: Indowordnet. In: Lexical Resources Engineering Conference 2010 (LREC 2010), Malta, May 2010
Chakrabarti, D., Pande, P., Dipak Narayan, P.B.: An experience in building the indo wordnet-a wordnet for hindi. In: First International Conference on Global WordNet, Mysore, India (2002)
Sharma, R.K., Ashish Narang, P.K.: Development of Punjabi WordNet. CSI Trans. ICT 1(4), 349–354 (2013)
Patel, K., Diptesh Kanojia, P.B.: India language Wordnets and their linkages with princeton WordNet. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018), Miyazaki, Japan (2018)
Bhattacharyya, P., Pawar J.D., Sekhar Dash, N. (Eds.): The WordNet in Indian Languages. Springer (2017)
Bhatia, T.K.: Major regional languages. In: Kachru, Y., Sridhar, S.N., Kachru, B.B. (Eds.) Languages in South Asia. Cambridge University Press, pp. 121–131 (2008)
Abbas Malik, M.G.: Punjabi machine transliteration. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL, Sydney, pp. 1137–1144 (2006)
Sharma, R.K., Preet, S., Bhatia, P., Kaur, R.: Punjabi WordNet Relations and Categorization of Synsets. In: 3rd national workshop on IndoWordNet under the aegis of the 8th international conference on natural language processing (ICON 2010), Kharagpur, India (2010)
Princeton WordNet. [Online]. https://wordnet.princeton.edu/documentation/wninput5wn
National Workshop on WordNet Creation. [Online]. www.cfilt.iitb.ac.in/coimbatore-amrita-university-wordnet-workshop.pdf
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Singh, H. (2022). Anomalies in Punjabi Language WordNet: An IndoWordNet Evaluation. In: Bianchini, M., Piuri, V., Das, S., Shaw, R.N. (eds) Advanced Computing and Intelligent Technologies. Lecture Notes in Networks and Systems, vol 218. Springer, Singapore. https://doi.org/10.1007/978-981-16-2164-2_44
Download citation
DOI: https://doi.org/10.1007/978-981-16-2164-2_44
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-2163-5
Online ISBN: 978-981-16-2164-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)