Information retrieval on Turkish texts

dc.citation.epage421en_US
dc.citation.issueNumber3en_US
dc.citation.spage407en_US
dc.citation.volumeNumber59en_US
dc.contributor.authorCan, F.en_US
dc.contributor.authorKocberber, S.en_US
dc.contributor.authorBalcik, E.en_US
dc.contributor.authorKaynak, C.en_US
dc.contributor.authorOcalan, H. C.en_US
dc.contributor.authorVursavas, O. M.en_US
dc.date.accessioned2016-02-08T10:10:21Z
dc.date.available2016-02-08T10:10:21Z
dc.date.issued2008-02en_US
dc.departmentDepartment of Computer Engineeringen_US
dc.description.abstractIn this study, we investigate information retrieval (IR) on Turkish texts using a large-scale test collection that contains 408,305 documents and 72 ad hoc queries. We examine the effects of several stemming options and query-document matching functions on retrieval performance. We show that a simple word truncation approach, a word truncation approach that uses language-dependent corpus statistics, and an elaborate lemmatizer-based stemmer provide similar retrieval effectiveness in Turkish IR. We investigate the effects of a range of search conditions on the retrieval performance; these include scalability issues, query and document length effects, and the use of stop-word list in indexing. © 2007 Wiley Periodicals, Inc.en_US
dc.description.provenanceMade available in DSpace on 2016-02-08T10:10:21Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2008en
dc.identifier.doi10.1002/asi.20750en_US
dc.identifier.issn2330-1635en_US
dc.identifier.urihttp://hdl.handle.net/11693/23211en_US
dc.language.isoEnglishen_US
dc.publisherJohn Wiley & Sons, Inc.en_US
dc.relation.isversionofhttp://dx.doi.org/10.1002/asi.20750en_US
dc.source.titleAssociation for Information Science and Technology. Journalen_US
dc.subjectAd hoc networksen_US
dc.subjectQuery processingen_US
dc.subjectScalabilityen_US
dc.subjectStatisticsen_US
dc.subjectText processingen_US
dc.subjectStop-word listen_US
dc.subjectTest collectionen_US
dc.subjectWord truncationen_US
dc.subjectInformation retrievalen_US
dc.titleInformation retrieval on Turkish textsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Information retrieval on turkish texts.pdf
Size:
820.41 KB
Format:
Adobe Portable Document Format
Description:
Full printable version