Skip to main content

and
  1. No Access

    Article

    An n-gram-based approach for detecting approximately duplicate database records

    Detecting and eliminating duplicate records is one of the major tasks for improving data quality. The task, however, is not as trivial as it seems since various errors, such as character insertion, deletion, t...

    Zeng** Tian, Hongjun Lu, Wenyun Ji in International Journal on Digital Libraries (2002)

  2. No Access

    Chapter and Conference Paper

    Optimizing Classifiers by Genetic Algorithm

    The paper focuses on methods of optimizing a single classifier and combining multiple classifiers by genetic algorithms (GAs). The method uses both the strategies of stacking and GAs to enhance the predictive ...

    Wenyun Ji, Liang Zhang, Wen ** in Web-Age Information Management (2000)