Skip to main content

and
  1. No Access

    Article

    An n-gram-based approach for detecting approximately duplicate database records

    Detecting and eliminating duplicate records is one of the major tasks for improving data quality. The task, however, is not as trivial as it seems since various errors, such as character insertion, deletion, t...

    Zeng** Tian, Hongjun Lu, Wenyun Ji in International Journal on Digital Libraries (2002)

  2. No Access

    Chapter and Conference Paper

    Enhancing XML Data Processing in Relational System with Indices

    Using relational database to query XML documents is becoming a common and viable practice. To meet the request of retrieving XML data on Web, which are usually semi-structured or unstructured, the current data...

    Yuqi Liang, Aoying Zhou, Shihui Zheng in Advances in Web-Age Information Management (2001)

  3. No Access

    Chapter and Conference Paper

    Optimizing Classifiers by Genetic Algorithm

    The paper focuses on methods of optimizing a single classifier and combining multiple classifiers by genetic algorithms (GAs). The method uses both the strategies of stacking and GAs to enhance the predictive ...

    Wenyun Ji, Liang Zhang, Wen ** in Web-Age Information Management (2000)