-
Chapter and Conference Paper
Creating a Persian-English Comparable Corpus
Multilingual corpora are valuable resources for cross-language information retrieval and are available in many language pairs. However the Persian language does not have rich multilingual resources due to some...
-
Chapter and Conference Paper
Exploiting Multiple Translation Resources for English-Persian Cross Language Information Retrieval
One of the most important issues in Cross Language Information Retrieval (CLIR) which affects the performance of CLIR systems is how to exploit available translation resources. This issue can be more challengi...
-
Chapter and Conference Paper
SS4MCT: A Statistical Stemmer for Morphologically Complex Texts
There have been multiple attempts to resolve various inflection matching problems in information retrieval. Stemming is a common approach to this end. Among many techniques for stemming, statistical stemming h...
-
Chapter and Conference Paper
Dimension Projection Among Languages Based on Pseudo-Relevant Documents for Query Translation
Using top-ranked documents retrieved in response to a query of a user has been shown to be an effective approach to improve the quality of query translation in dictionary-based cross-language information retri...
-
Chapter and Conference Paper
Persianp: A Persian Text Processing Toolbox
This paper describes Persianp Toolbox, an integrated Persian text processing system and easily used in other software applications. The toolbox which provides fundamental Persian text processing steps include...
-
Chapter and Conference Paper
Multiple System Combination for PersoArabic-Latin Transliteration
In this paper, we model a PersoArabic to Latin transliteration system as grapheme-to-phoneme (G2P) and word lattice methods combined with statistical machine translation (SMT). Persian is an Indo-Iranian branc...
-
Chapter and Conference Paper
Algorithms and Corpora for Persian Plagiarism Detection
The task of plagiarism detection is to find passages of text-reuse in a suspicious document. This task is of increasing relevance, since scholars around the world take advantage of the fact that information ab...
-
Chapter and Conference Paper
LICD: A Language-Independent Approach for Aspect Category Detection
Aspect-based sentiment analysis (ABSA) deals with processing and summarizing customer reviews and has been a topic of interest in recent years. Given a set of predefined categories, Aspect Category Detection (...
-
Article
Persian offensive language detection
With the proliferation of social networks and their impact on human life, one of the rising problems in this environment is the rise in verbal and written insults and hatred. As one of the significant platform...