New event detection using chronological term ranking

buir.advisorCan, Fazlı
dc.contributor.authorBağlıoğlu, Özgür
dc.date.accessioned2016-01-08T19:52:37Z
dc.date.available2016-01-08T19:52:37Z
dc.date.issued2009
dc.descriptionAnkara : The Department of Computer Engineering and the Institute of Engineering and Science of Bilkent University, 2009.en_US
dc.descriptionThesis (Master's) -- Bilkent University, 2009.en_US
dc.descriptionIncludes bibliographical references leaves 57-63en_US
dc.description.abstractNews web pages are an important resource for news consumers since the Internet provides the most up-to-date information. However, the abundance of this information is overwhelming. In order to solve this problem, news articles should be organized in various ways. For example, new event detection (NED) and tracking studies aim to solve this problem by categorizing news stories according to events. Generally, important issues are presented at the beginning of news articles. Based on this observation, we modify the term weighting component of the Okapi similarity measure in several different ways and use them in NED. We perform numerous experiments in Turkish using the BilCol2005 test collection that contains 209,305 documents from the entire year of 2005 and involves several events in which eighty of them are annotated by humans. In this study, we developed various chronological term ranking (CTR) functions using term positions with several parameters. Our experimental results show that CTR in combination with Okapi improves the effectiveness of a baseline system with a desirable performance up to 13%. We demonstrate that NED using CTR has a robust performance in different versions of TDT collection generated by N-pass detection evaluation. The tests indicate that the improvements are statistically significant.en_US
dc.description.provenanceMade available in DSpace on 2016-01-08T19:52:37Z (GMT). No. of bitstreams: 1 0006330.pdf: 831048 bytes, checksum: 026a087a7384f5149b679cb69fe38b6f (MD5)en
dc.description.statementofresponsibilityBağlıoğlu, Özgüren_US
dc.format.extentxii, 74 leaves, graphicsen_US
dc.identifier.itemidBILKUTUPB116266
dc.identifier.urihttp://hdl.handle.net/11693/16498
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectChronological term ranking (CTR)en_US
dc.subjectFirst story detection (FSD)en_US
dc.subjectNew event detection (NED)en_US
dc.subjectPerformance evaluationen_US
dc.subjectTDTen_US
dc.subjectTurkish News Test Collection (BilCol2005)en_US
dc.subject.lccZ699 .B34 2009en_US
dc.subject.lcshInformation storage and retrieval systems.en_US
dc.subject.lcshInformation retrieval.en_US
dc.subject.lcshText processing (Computer science)en_US
dc.titleNew event detection using chronological term rankingen_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0006330.pdf
Size:
811.57 KB
Format:
Adobe Portable Document Format