Abstract
Today, we live in an information, communication, and technology era. Information and data have become a major part of our life. Studies have shown that people are spending over 8 h on electronic gadgets that are a world full of information in themselves. But this available information is scattered all over the internet and if a user wants an accurate solution for his quibble, the user needs to read a lot of articles and blogs which is tedious and time consuming. Also, this information is generally available only in English language making it difficult to understand for the non-English speakers. Kee** this in mind, we have developed a multilingual chatbot that will give a response to any query in a fraction of seconds. The response will be given in the same language the query was searched or we can also select the language from the pool. Natural Language Processing (NLP) is used for language-related processing. We have used TF-IDF for document retrieval, BM-25 model for passage extraction and spaCy model for answer extraction. Finally, we have developed the proposed system as an android application as well as a website to make it device independent.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mondal, A., Dey, M., Das, D., Nagpal, S., Garda, K.: Chatbot: an automated conversation system for the educational domain. In: 2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP), pp. 1–5 (2018). https://doi.org/10.1109/iSAI-NLP.2018.8692927
Adamopoulou, E., Moussiades, L.: An overview of chatbot technology. In: Maglogiannis, I., Iliadis, L., Pimenidis, E. (eds.) Artificial Intelligence Applications and Innovations, AIAI 2020. IFIP Advances in Information and Communication Technology, vol. 584. Springer, Cham (2020)
Kadariya, D., Venkataramanan, R., Yip, H.Y., Kalra, M., Thirunarayanan, K., Sheth, A.: kBot: knowledge-enabled personalized chatbot for asthma self-management. In: 2019 IEEE International Conference on Smart Computing (SMARTCOMP), pp. 138–143 (2019). https://doi.org/10.1109/SMARTCOMP.2019.00043
Bafna, P., Pramod, D., Vaidya, A.: Document clustering: TF-IDF approach. In: 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT), pp. 61–66 (2016). https://doi.org/10.1109/ICEEOT.2016.7754750
Liu, Q., Wang, J., Zhang, D., Yang, Y., Wang, N.: Text features extraction based on TF-IDF associating semantic. In: 2018 IEEE 4th International Conference on Computer and Communications (ICCC), pp. 2338–2343 (2018). https://doi.org/10.1109/CompComm.2018.8780663
Dobesova, Z.: Programming language Python for data processing. In: 2011 International Conference on Electrical and Control Engineering, pp. 4866–4869 (2011). https://doi.org/10.1109/ICECENG.2011.60574
Li, B., Drozd, A., Guo, Y., et al.: Scaling Word2Vec on big corpus. Data Sci. Eng. 4, 157–175 (2019)
Singh, K.N., Dorendro, A., Devi, H.M., Mahanta, A.K.: Analysis of changing trends in textual data representation. In: Santosh, K.C., Gawali, B. (eds.) Recent Trends in Image Processing and Pattern Recognition, RTIP2R 2020. Communications in Computer and Information Science, vol. 1380. Springer, Singapore (2021)
Nguyen, D.T.: Interactive document retrieval system based-on natural language query processing. In: 2009 International Conference on Machine Learning and Cybernetics, pp. 2233–2237 (2009). https://doi.org/10.1109/ICMLC.2009.5212236
Sugimoto, M., Hori, K., Ohsuga, S.: A document retrieval system for assisting creative research. In: Proceedings of 3rd International Conference on Document Analysis and Recognition, vol. 1, pp. 167–170 (1995). https://doi.org/10.1109/ICDAR.1995.598968
Veeravalli, S.G., Varma, V.: Passage retrieval using answer type profiles in question answering. In: 23rd Pacific Asia Conference on Language, Information and Computation, pp. 559–568 (2014)
Sari, S., Adriani, M.: Learning to rank for determining relevant document in Indonesian-English cross language information retrieval using BM25. In: 2014 International Conference on Advanced Computer Science and Information System, pp. 309–314 (2014). https://doi.org/10.1109/ICACSIS.2014.7065896
Neumann, M., King, D., Beltagy, I., Ammar, W.: ScispaCy: fast and robust models for biomedical natural language processing, ar**v:1902.07669v3 [cs.CL], 9 October 2019
Challita, S., Zalila, F., Gourdin, C., Merle, P.: A precise model for google cloud platform. In: 2018 IEEE International Conference on Cloud Engineering (IC2E), pp. 177–183 (2018). https://doi.org/10.1109/IC2E.2018.00041
Sindhu, D.V., Sagar, B.M.: Study on machine translation approaches for Indian languages and their challenges. In: 2016 International Conference on Electrical, Electronics, Communication, Computer and Optimization Techniques (ICEECCOT), pp. 262–267 (2016). https://doi.org/10.1109/ICEECCOT.2016.7955227
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Kadam, K., Godbole, S., Joijode, D., Karoshi, S., Jadhav, P., Shilaskar, S. (2022). Multilingual Information Retrieval Chatbot. In: Gunjan, V.K., Zurada, J.M. (eds) Modern Approaches in Machine Learning & Cognitive Science: A Walkthrough. Studies in Computational Intelligence, vol 1027. Springer, Cham. https://doi.org/10.1007/978-3-030-96634-8_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-96634-8_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-96633-1
Online ISBN: 978-3-030-96634-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)