Information Retrieval Technology
Asia Information Retrieval Symposium, AIRS 2004, Bei**g, China, October 18-20, 2004. Revised Selected Papers
Chapter and Conference Paper
Semi-structured data are typically represented in the form of labeled directed graphs. They are self-describing and schemaless. The lack of a schema renders query processing over semi-structured data expensive...
Chapter and Conference Paper
As the World Wide Web grows dramatically in recent years, there is increasing interest in semi-structured data on the web. Semi-structured data are usually represented in graph format, many graph schemas have ...
Book and Conference Proceedings
Asia Information Retrieval Symposium, AIRS 2004, Bei**g, China, October 18-20, 2004. Revised Selected Papers
Chapter and Conference Paper
We propose the Critical Sentence Vector Model (CSVM), a novel model to measure text similarity. The CSVM accounts for the structural and semantic information of the document. Compared to existing methods based on...
Chapter and Conference Paper
Automatic transliteration of foreign names is basically regarded as a diminutive clone of the machine translation (MT) problem. It thus follows IBM’s conventional MT models under the source-channel framework. ...
Chapter and Conference Paper
A structural join evaluates structural relationship (parent-child or ancestor-descendant) between xml elements. It serves as an important computation unit in xml pattern matching, such as twig joins. There exists...
Chapter and Conference Paper
Document clustering techniques mostly depend on models that impose explicit and/or implicit priori assumptions as to the number, size, disjunction characteristics of clusters, and/or the probability distributi...
Chapter and Conference Paper
Opinion mining systems suffer a great loss when unknown opinion targets constantly appear in newly composed reviews. Previous opinion target extraction methods typically consider human-compiled opinion targets...
Chapter and Conference Paper
Ranking for multilingual information retrieval (MLIR) is a task to rank documents of different languages solely based on their relevancy to the query regardless of query’s language. Existing approaches are foc...
Chapter and Conference Paper
The Chinese sentences in news articles are usually very long, which set up obstacles for further opinion mining steps. Sentence compression is the task of producing a brief summary at the sentence level. Conve...
Chapter and Conference Paper
As more and more people are willing to publish their attitudes and feelings in blogs, how to provide an efficient way to summarize and extract public opinion in blogosphere has become a major concern for both ...
Chapter and Conference Paper
Adaptation techniques based on importance weighting were shown effective for RankSVM and RankNet, viz., each training instance is assigned a target weight denoting its importance to the target domain and incorpor...
Chapter and Conference Paper
Topic-specific opinion summarization (TOS) plays an important role in hel** users digest online opinions, which targets to extract a summary of opinion expressions specified by a query, i.e. topic-specific o...
Article
Recently, blogs have emerged as the major platform for people to express their feelings and sentiments in the age of Web 2.0. The common emotions, which reflect people’s collective and overall sentiments, are ...
Chapter and Conference Paper
Microblog has become a popular platform for people to share their ideas, information and opinions. In addition to textual content data, social relations and user behaviors in microblog provide us additional li...
Chapter and Conference Paper
One of the most important properties of social networking sites is its reachability – no physical location constraint. In addition, all social networking sites allow us to search people with common interests, ...
Chapter and Conference Paper
Microblogging websites have emerged to the center of information production and diffusion, on which people can get useful information from other users’ microblog posts. In the era of Big Data, we are overwhelm...
Chapter and Conference Paper
Recurrent Neural Networks (RNNs) have become increasingly popular for the task of language understanding. In this task, a semantic tagger is deployed to associate a semantic label to each word in an input sequ...
Chapter and Conference Paper
User influence analysis in social media has attracted tremendous interest from both the sociology and social data mining. It is becoming a hot topic recently. However, most approaches ignore the temporal chara...
Chapter and Conference Paper
Automatic rumor detection for events on online social media has attracted considerable attention in recent years. Usually, the events on social media are divided into several time segments, and for each segmen...