On-line new event detection and clustering using the concepts of the cover coefficient-based clustering methodology

buir.advisorCan, Fazlı
dc.contributor.authorVural, Ahmet
dc.date.accessioned2016-07-01T10:56:30Z
dc.date.available2016-07-01T10:56:30Z
dc.date.issued2002
dc.descriptionCataloged from PDF version of article.en_US
dc.description.abstractIn this study, we use the concepts of the cover coefficient-based clustering methodology (C3 M) for on-line new event detection and event clustering. The main idea of the study is to use the seed selection process of the C3 M algorithm for the purpose of detecting new events. Since C3 M works in a retrospective manner, we modify the algorithm to work in an on-line environment. Furthermore, in order to prevent producing oversized event clusters, and to give equal chance to all documents to be the seed of a new event, we employ the window size concept. Since we desire to control the number of seed documents, we introduce a threshold concept to the event clustering algorithm. We also use the threshold concept, with a little modification, in the on-line event detection. In the experiments we use TDT1 corpus, which is also used in the original topic detection and tracking study. In event clustering and event detection, we use both binary and weighted versions of TDT1 corpus. With the binary implementation, we obtain better results. When we compare our on-line event detection results to the results of UMASS approach, we obtain better performance in terms of false alarm rates.en_US
dc.description.provenanceMade available in DSpace on 2016-07-01T10:56:30Z (GMT). No. of bitstreams: 1 0002229.pdf: 843371 bytes, checksum: cd26e8cd70589295f2c31754e7d93f02 (MD5) Previous issue date: 2002en
dc.description.statementofresponsibilityVural, Ahmeten_US
dc.format.extentxiii, 68 leaves, illustrations, 30 cmen_US
dc.identifier.itemidBILKUTUPB067651
dc.identifier.urihttp://hdl.handle.net/11693/29247
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectClusteringen_US
dc.subjecton-line event clusteringen_US
dc.subjecton-line event detectionen_US
dc.subject.lccQA278 .V87 2002en_US
dc.subject.lcshCluster analysis Data processing.en_US
dc.titleOn-line new event detection and clustering using the concepts of the cover coefficient-based clustering methodologyen_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0002229.pdf
Size:
823.6 KB
Format:
Adobe Portable Document Format
Description:
Full printable version