![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Chapter and Conference Paper
Approximate Graph Schema Extraction for Semi-structured Data
Semi-structured data are typically represented in the form of labeled directed graphs. They are self-describing and schemaless. The lack of a schema renders query processing over semi-structured data expensive...
-
Chapter and Conference Paper
A New Conceptual Graph Generated Algorithm for Semi-structured Databases
As the World Wide Web grows dramatically in recent years, there is increasing interest in semi-structured data on the web. Semi-structured data are usually represented in graph format, many graph schemas have ...
-
Chapter and Conference Paper
Phoneme-Based Transliteration of Foreign Names for OOV Problem
A proper noun dictionary is never complete rendering name translation from English to Chinese ineffective. One way to solve this problem is not to rely on a dictionary alone but to adopt automatic translation ...
-
Chapter and Conference Paper
Improving Text Similarity Measurement by Critical Sentence Vector Model
We propose the Critical Sentence Vector Model (CSVM), a novel model to measure text similarity. The CSVM accounts for the structural and semantic information of the document. Compared to existing methods based on...
-
Chapter and Conference Paper
Accelerating XML Structural Join by Partitioning
Structural join is the core part of XML queries and has a significant impact on the performance of XML queries, several classical structural join algorithms have been proposed such as Stack-tree join and XR-Tree ...
-
Chapter and Conference Paper
Improving Transliteration with Precise Alignment of Phoneme Chunks and Using Contextual Features
Automatic transliteration of foreign names is basically regarded as a diminutive clone of the machine translation (MT) problem. It thus follows IBM’s conventional MT models under the source-channel framework. ...
-
Chapter and Conference Paper
Information Flow Analysis with Chinese Text
This article investigates the effectiveness of an information inference mechanism on Chinese text. The information inference derives implicit associations via computation of information flow on a high dimensio...
-
Chapter and Conference Paper
A Preliminary Work on Classifying Time Granularities of Temporal Questions
Temporal question classification assigns time granularities to temporal questions ac-cording to their anticipated answers. It is very important for answer extraction and verification in the literature of tempo...
-
Chapter and Conference Paper
Binarization Approaches to Email Categorization
Email categorization becomes very popular today in personal information management. However, most n-way classification methods suffer from feature unevenness problem, namely, features learned from training sam...
-
Chapter and Conference Paper
Building Document Graphs for Multiple News Articles Summarization: An Event-Based Approach
Since most of news articles report several events and these events are referred in many related documents, we propose an event-based approach to visualize documents as graph on different conceptual granulariti...
-
Chapter and Conference Paper
Fast Structural Join with a Location Function
A structural join evaluates structural relationship (parent-child or ancestor-descendant) between xml elements. It serves as an important computation unit in xml pattern matching, such as twig joins. There exists...
-
Chapter and Conference Paper
Clique Percolation Method for Finding Naturally Cohesive and Overlap** Document Clusters
Techniques for find document clusters mostly depend on models that impose strong explicit and/or implicit priori assumptions. As a consequence, the clustering effects tend to be unnatural and stray away from t...
-
Chapter and Conference Paper
Natural Document Clustering by Clique Percolation in Random Graphs
Document clustering techniques mostly depend on models that impose explicit and/or implicit priori assumptions as to the number, size, disjunction characteristics of clusters, and/or the probability distributi...
-
Chapter and Conference Paper
An Improved Method for Finding Bilingual Collocation Correspondences from Monolingual Corpora
Bilingual collocation correspondence is helpful to machine translation and second language learning. Existing techniques for identifying Chinese-English collocation correspondence suffer from two major problem...
-
Chapter and Conference Paper
Event-Based Summarization Using Time Features
We investigate whether time features help to improve event-based summarization. In this paper, events are defined as event terms and the associated event elements. While event terms represent the actions thems...
-
Chapter and Conference Paper
Learning MultiLinguistic Knowledge for Opinion Analysis
Most existing opinion analysis techniques used word-level sentiment knowledge but lack the learning capacity on the behaviors of context-dependent opinion words. Meanwhile, the use of collocation-level sentime...
-
Chapter and Conference Paper
Opinion Target Network and Bootstrap** Method for Chinese Opinion Target Extraction
Opinion mining systems suffer a great loss when unknown opinion targets constantly appear in newly composed reviews. Previous opinion target extraction methods typically consider human-compiled opinion targets...
-
Chapter and Conference Paper
Joint Ranking for Multilingual Web Search
Ranking for multilingual information retrieval (MLIR) is a task to rank documents of different languages solely based on their relevancy to the query regardless of query’s language. Existing approaches are foc...
-
Chapter and Conference Paper
A Chinese Sentence Compression Method for Opinion Mining
The Chinese sentences in news articles are usually very long, which set up obstacles for further opinion mining steps. Sentence compression is the task of producing a brief summary at the sentence level. Conve...
-
Chapter and Conference Paper
Summarizing and Extracting Online Public Opinion from Blog Search Results
As more and more people are willing to publish their attitudes and feelings in blogs, how to provide an efficient way to summarize and extract public opinion in blogosphere has become a major concern for both ...