Updating large itemsets with early pruning

buir.supervisorArkun, Erol
dc.contributor.authorAyan, Necip Fazıl
dc.date.accessioned2016-01-08T20:16:25Z
dc.date.available2016-01-08T20:16:25Z
dc.date.copyright1999-07
dc.date.issued1999-07
dc.description Cataloged from PDF version of article. en_US
dc.descriptionThesis (Master's): Department of Computer Engineering and Information Science and the Institute of Engineering and Science of Bilkent University, 1999.en_US
dc.descriptionIncludes bibliographical references (leaves 66-75).en_US
dc.description.abstractWith the computerization of many business and government transactions, huge amounts of data have been stored in computers. The e.xisting database systems do not provide the users with the necessary tools and functionalities to capture all stored information easily. Therefore, automatic knowledge discovery techniques have been developed to capture and use the voluminous information hidden in large databases. Discovery of association rules is an important class of data mining, which is the process of extracting interesting and frequent patterns from the data. Association rules aim to capture the co-occurrences of items, and have wide applicability in many areas. Discovering association rules is based on the computation of large itemsets (set of items that occur frequently in the database) efficiently, and is a computationally expensive operation in large databases. Thus, maintenance of them in large dynamic databases is an important issue. In this thesis, we propose an efficient algorithm, to update large itemsets by considering the set of previously discovered itemsets. The main idea is to prune an itemset as soon as it is understood to be small in the updated database, and to keep the set of candidate large itemsets as small as possible. The proposed algorithm outperforms the existing update algorithms in terms of the number of scans over the databases, and the number of candidate large itemsets generated and counted. Moreover, it can be applied to other data mining tasks that are based on large itemset framework easily.
dc.description.provenanceMade available in DSpace on 2016-01-08T20:16:25Z (GMT). No. of bitstreams: 1 1.pdf: 78510 bytes, checksum: d85492f20c2362aa2bcf4aad49380397 (MD5)en
dc.description.statementofresponsibilityNecip Fazıl Ayanen_US
dc.format.extentxi, 75 leaves ; 30 cm.en_US
dc.identifier.urihttp://hdl.handle.net/11693/18117
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectData mining
dc.subjectAssociation rules
dc.subjectLarge itemsets
dc.subjectUpdate of large itemsets
dc.subjectEarly pruning
dc.titleUpdating large itemsets with early pruningen_US
dc.title.alternativeErken eliminasyon ile yoğun nesne kümelerinin güncellenmesi
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
B049042.pdf
Size:
2.45 MB
Format:
Adobe Portable Document Format
Description:
Full printable version