Browsing by Keywords "Near-Duplicate Detection"
Now showing items 1-1 of 1
-
CoDet : a new algorithm for containment and near duplicate detection in text corpora
(Bilkent University, 2012)In this thesis, we investigate containment detection, which is a generalized version of the well known near-duplicate detection problem concerning whether a document is a subset of another document. In text-based ...