Search
Search Results
-
Analysing terminology translation errors in statistical and neural machine translation
Terminology translation plays a critical role in domain-specific machine translation (MT). Phrase-based statistical MT (PB-SMT) has been the dominant...
-
Hate Speech Detection on Code-Mixed Dataset Using a Fusion of Custom and Pre-trained Models with Profanity Vector Augmentation
With the increase in user-generated content on social media networks, hate speech and offensive language content are also increasing. From the...
-
A Novel Hybrid Translator for Gujarati to Interlingual English MTS for Personage Idioms
Gujarat is one of the states in the western part of India, and the Gujarati language is the official language of Gujarat. The Gujarati language is... -
Telugu-English Abusive Comment Detection Using XLMRoBERTa and mBERT
The proliferation of social media platforms has enabled users to express their thoughts and opinions freely, but it has also given rise to the... -
An Intelligent Mobile System for Monitoring Relapse of Depression
Depression is a common psychological disorder with high relapse rate in modern society. Due to weak self-perception and fear of public bias, most... -
KDC: New Dataset for Kannada Document Categorization
In recent years, multilingual texts are blooming in the digital world. Analysis of these multilingual texts for various computational intelligence... -
Design of a Morphological Generator for an English to Indian Languages in a Declension Rule-Based Machine Translation System
Morphology is a branch of linguistics that deals with the internal structure of words in a natural language. Any word in a natural language is... -
Development of Multi-lingual Models for Detecting Hope Speech Texts from Social Media Comments
Comments on social media can be written in any number of languages, and many of them may also be written in languages with few resources. Hope Speech... -
Impact of Transformers on Multilingual Fake News Detection for Tamil and Malayalam
Due to the availability of the technology stack for implementing state of the art neural networks, fake news or fake information classification... -
FA-Net: fused attention-based network for Hindi English code-mixed offensive text classification
Widespread usage of social media platforms like Twitter, Facebook, and YouTube allows sharing of opinions and suggestions across countries. On the...
-
An approach to automatic classification of hate speech in sports domain on social media
Hate Speech encompasses different forms of trolling, bullying, harassment, and threats directed against specific individuals or groups. This...
-
Analytical developments for the Homer Multitext: palaeography, orthography, morphology, prosody, semantics
We describe ongoing development for The Homer Multitext focusing on the interlocking challenges of automated analysis of diplomatic manuscript...
-
Addressing data sparsity for neural machine translation between morphologically rich languages
Translating between morphologically rich languages is still challenging for current machine translation systems. In this paper, we experiment with...
-
Resources and components for gujarati NLP systems: a survey
Natural Language Processing (NLP) represents the task of automatic handling of natural human language by machines. There is a large spectrum of...
-
A survey of hate speech detection in Indian languages
With the enormous increase in accessibility of high-speed internet, the number of social media users is increasing rapidly. Due to a lack of proper...
-
Forensic Analysis of Text and Messages in Smartphones by a Unification Rosetta Stone Procedure
In this paper we introduce an innovative application of translation techniques applied to the problem of forensics analysis of smartphones. This... -
Chatbot in Arabic language using seq to seq model
A conversational agent (chatbot) is a software that can communicate with humans using natural language. Conversation modeling is an extremely...
-
Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language
Language technology development is crucial for many downstream applications such as machine translation and language understanding. The lack of... -
Opinion Classification on Code-mixed Tamil Language
User Sentiment Analysis (SA) is an interesting application of Natural Language Processing (NLP) to analyze the opinions of an individual. The user's... -
A survey of historical document image datasets
This paper presents a systematic literature review of image datasets for document image analysis, focusing on historical documents, such as...