dc.contributor.advisor | Can, Fazlı | |
dc.contributor.author | Bağlıoğlu, Özgür | |
dc.date.accessioned | 2016-01-08T19:52:37Z | |
dc.date.available | 2016-01-08T19:52:37Z | |
dc.date.issued | 2009 | |
dc.identifier.uri | http://hdl.handle.net/11693/16498 | |
dc.description | Ankara : The Department of Computer Engineering and the Institute of Engineering and Science of Bilkent University, 2009. | en_US |
dc.description | Thesis (Master's) -- Bilkent University, 2009. | en_US |
dc.description | Includes bibliographical references leaves 57-63 | en_US |
dc.description.abstract | News web pages are an important resource for news consumers since the Internet
provides the most up-to-date information. However, the abundance of this information
is overwhelming. In order to solve this problem, news articles should be organized in
various ways. For example, new event detection (NED) and tracking studies aim to
solve this problem by categorizing news stories according to events. Generally,
important issues are presented at the beginning of news articles. Based on this
observation, we modify the term weighting component of the Okapi similarity measure
in several different ways and use them in NED. We perform numerous experiments in
Turkish using the BilCol2005 test collection that contains 209,305 documents from the
entire year of 2005 and involves several events in which eighty of them are annotated by
humans. In this study, we developed various chronological term ranking (CTR)
functions using term positions with several parameters. Our experimental results show
that CTR in combination with Okapi improves the effectiveness of a baseline system
with a desirable performance up to 13%. We demonstrate that NED using CTR has a
robust performance in different versions of TDT collection generated by N-pass
detection evaluation. The tests indicate that the improvements are statistically
significant. | en_US |
dc.description.statementofresponsibility | Bağlıoğlu, Özgür | en_US |
dc.format.extent | xii, 74 leaves, graphics | en_US |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Chronological term ranking (CTR) | en_US |
dc.subject | First story detection (FSD) | en_US |
dc.subject | New event detection (NED) | en_US |
dc.subject | Performance evaluation | en_US |
dc.subject | TDT | en_US |
dc.subject | Turkish News Test Collection (BilCol2005) | en_US |
dc.subject.lcc | Z699 .B34 2009 | en_US |
dc.subject.lcsh | Information storage and retrieval systems. | en_US |
dc.subject.lcsh | Information retrieval. | en_US |
dc.subject.lcsh | Text processing (Computer science) | en_US |
dc.title | New event detection using chronological term ranking | en_US |
dc.type | Thesis | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.publisher | Bilkent University | en_US |
dc.description.degree | M.S. | en_US |
dc.identifier.itemid | BILKUTUPB116266 | |