-
Chapter and Conference Paper
TEP: Tehran English-Persian Parallel Corpus
Parallel corpora are one of the key resources in natural language processing. In spite of their importance in many multi-lingual applications, no large-scale English-Persian corpus has been made available so f...
-
Chapter and Conference Paper
An Unsupervised Approach for Linking Automatically Extracted and Manually Crafted LTAGs
Though the lack of semantic representation of automatically extracted LTAGs is an obstacle in using these formalism, due to the advent of some powerful statistical parsers that were trained on them, these gram...
-
Chapter and Conference Paper
Unsupervised Identification of Persian Compound Verbs
One of the main tasks related to multiword expressions (MWEs) is compound verb identification. There have been so many works on unsupervised identification of multiword verbs in many languages, but there has n...
-
Chapter and Conference Paper
Applying Sentiment and Social Network Analysis in User Modeling
The idea of applying a conjunction of sentiment and social network analysis to improve the performance of applications has recently attracted attention of researchers. In widely used online shop** websites, ...
-
Chapter and Conference Paper
ELEXR: Automatic Evaluation of Machine Translation Using Lexical Relationships
This paper proposes ELEXR, a novel metric to evaluate machine translation (MT). In our proposed method, we extract lexical co-occurrence relationships of a given reference translation (Ref) and its correspondi...
-
Chapter and Conference Paper
Exploiting Multiple Translation Resources for English-Persian Cross Language Information Retrieval
One of the most important issues in Cross Language Information Retrieval (CLIR) which affects the performance of CLIR systems is how to exploit available translation resources. This issue can be more challengi...
-
Chapter and Conference Paper
Modeling Persian Verb Morphology to Improve English-Persian Machine Translation
Morphological analysis is an essential process in translating from a morphologically poor language such as English into a morphologically rich language such as Persian. In this paper, first we analyze the outp...
-
Chapter and Conference Paper
SS4MCT: A Statistical Stemmer for Morphologically Complex Texts
There have been multiple attempts to resolve various inflection matching problems in information retrieval. Stemming is a common approach to this end. Among many techniques for stemming, statistical stemming h...
-
Chapter and Conference Paper
Persianp: A Persian Text Processing Toolbox
This paper describes Persianp Toolbox, an integrated Persian text processing system and easily used in other software applications. The toolbox which provides fundamental Persian text processing steps include...
-
Chapter and Conference Paper
Multiple System Combination for PersoArabic-Latin Transliteration
In this paper, we model a PersoArabic to Latin transliteration system as grapheme-to-phoneme (G2P) and word lattice methods combined with statistical machine translation (SMT). Persian is an Indo-Iranian branc...
-
Chapter and Conference Paper
Algorithms and Corpora for Persian Plagiarism Detection
The task of plagiarism detection is to find passages of text-reuse in a suspicious document. This task is of increasing relevance, since scholars around the world take advantage of the fact that information ab...
-
Article
RACER: accurate and efficient classification based on rule aggregation approach
Rule-based classification is one of the most important topics in the field of data mining due to its wide applications. This article presents a novel rule-based classifier called RACER (Rule Aggregating ClassifiE...
-
Chapter and Conference Paper
Pruned Graph Neural Network for Short Story Ordering
Text coherence is a fundamental problem in natural language generation and understanding. Organizing sentences into an order that maximizes coherence is known as sentence ordering. This paper is proposing a ne...
-
Article
Persian offensive language detection
With the proliferation of social networks and their impact on human life, one of the rising problems in this environment is the rise in verbal and written insults and hatred. As one of the significant platform...