![Loading...](https://link.springer.com/static/c4a417b97a76cc2980e3c25e2271af3129e08bbe/images/pdf-preview/spacer.gif)
-
Article
An n-gram-based approach for detecting approximately duplicate database records
Detecting and eliminating duplicate records is one of the major tasks for improving data quality. The task, however, is not as trivial as it seems since various errors, such as character insertion, deletion, t...
-
Chapter and Conference Paper
Enhancing XML Data Processing in Relational System with Indices
Using relational database to query XML documents is becoming a common and viable practice. To meet the request of retrieving XML data on Web, which are usually semi-structured or unstructured, the current data...
-
Chapter and Conference Paper
Optimizing Classifiers by Genetic Algorithm
The paper focuses on methods of optimizing a single classifier and combining multiple classifiers by genetic algorithms (GAs). The method uses both the strategies of stacking and GAs to enhance the predictive ...