-
Chapter and Conference Paper
Creating a Persian-English Comparable Corpus
Multilingual corpora are valuable resources for cross-language information retrieval and are available in many language pairs. However the Persian language does not have rich multilingual resources due to some...
-
Chapter and Conference Paper
TEP: Tehran English-Persian Parallel Corpus
Parallel corpora are one of the key resources in natural language processing. In spite of their importance in many multi-lingual applications, no large-scale English-Persian corpus has been made available so f...
-
Chapter and Conference Paper
An Unsupervised Approach for Linking Automatically Extracted and Manually Crafted LTAGs
Though the lack of semantic representation of automatically extracted LTAGs is an obstacle in using these formalism, due to the advent of some powerful statistical parsers that were trained on them, these gram...
-
Chapter and Conference Paper
Unsupervised Identification of Persian Compound Verbs
One of the main tasks related to multiword expressions (MWEs) is compound verb identification. There have been so many works on unsupervised identification of multiword verbs in many languages, but there has n...
-
Chapter and Conference Paper
Applying Sentiment and Social Network Analysis in User Modeling
The idea of applying a conjunction of sentiment and social network analysis to improve the performance of applications has recently attracted attention of researchers. In widely used online shop** websites, ...
-
Chapter and Conference Paper
Dimension Projection Among Languages Based on Pseudo-Relevant Documents for Query Translation
Using top-ranked documents retrieved in response to a query of a user has been shown to be an effective approach to improve the quality of query translation in dictionary-based cross-language information retri...
-
Chapter and Conference Paper
Persianp: A Persian Text Processing Toolbox
This paper describes Persianp Toolbox, an integrated Persian text processing system and easily used in other software applications. The toolbox which provides fundamental Persian text processing steps include...
-
Chapter and Conference Paper
Multiple System Combination for PersoArabic-Latin Transliteration
In this paper, we model a PersoArabic to Latin transliteration system as grapheme-to-phoneme (G2P) and word lattice methods combined with statistical machine translation (SMT). Persian is an Indo-Iranian branc...
-
Chapter and Conference Paper
LICD: A Language-Independent Approach for Aspect Category Detection
Aspect-based sentiment analysis (ABSA) deals with processing and summarizing customer reviews and has been a topic of interest in recent years. Given a set of predefined categories, Aspect Category Detection (...
-
Article
RACER: accurate and efficient classification based on rule aggregation approach
Rule-based classification is one of the most important topics in the field of data mining due to its wide applications. This article presents a novel rule-based classifier called RACER (Rule Aggregating ClassifiE...
-
Article
Solving submodular text processing problems using influence graphs
Submodular functions appear in a considerable number of important natural language processing problems such as text summarization and dataset selection. Current graph-based approaches to solving such problems ...
-
Article
MLPR: Efficient influence maximization in linear threshold propagation model using linear programming
Influence maximization is an important research topic in social networks that has different applications such as analyzing spread of rumors, interest, adoption of innovations, and feed ranking. The goal is to ...