Skip to main content

and
  1. No Access

    Article

    An n-gram-based approach for detecting approximately duplicate database records

    Detecting and eliminating duplicate records is one of the major tasks for improving data quality. The task, however, is not as trivial as it seems since various errors, such as character insertion, deletion, t...

    Zeng** Tian, Hongjun Lu, Wenyun Ji in International Journal on Digital Libraries (2002)