Information retrieval on Turkish texts
dc.citation.epage | 421 | en_US |
dc.citation.issueNumber | 3 | en_US |
dc.citation.spage | 407 | en_US |
dc.citation.volumeNumber | 59 | en_US |
dc.contributor.author | Can, F. | en_US |
dc.contributor.author | Kocberber, S. | en_US |
dc.contributor.author | Balcik, E. | en_US |
dc.contributor.author | Kaynak, C. | en_US |
dc.contributor.author | Ocalan, H. C. | en_US |
dc.contributor.author | Vursavas, O. M. | en_US |
dc.date.accessioned | 2016-02-08T10:10:21Z | |
dc.date.available | 2016-02-08T10:10:21Z | |
dc.date.issued | 2008-02 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description.abstract | In this study, we investigate information retrieval (IR) on Turkish texts using a large-scale test collection that contains 408,305 documents and 72 ad hoc queries. We examine the effects of several stemming options and query-document matching functions on retrieval performance. We show that a simple word truncation approach, a word truncation approach that uses language-dependent corpus statistics, and an elaborate lemmatizer-based stemmer provide similar retrieval effectiveness in Turkish IR. We investigate the effects of a range of search conditions on the retrieval performance; these include scalability issues, query and document length effects, and the use of stop-word list in indexing. © 2007 Wiley Periodicals, Inc. | en_US |
dc.description.provenance | Made available in DSpace on 2016-02-08T10:10:21Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2008 | en |
dc.identifier.doi | 10.1002/asi.20750 | en_US |
dc.identifier.issn | 2330-1635 | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/23211 | en_US |
dc.language.iso | English | en_US |
dc.publisher | John Wiley & Sons, Inc. | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1002/asi.20750 | en_US |
dc.source.title | Association for Information Science and Technology. Journal | en_US |
dc.subject | Ad hoc networks | en_US |
dc.subject | Query processing | en_US |
dc.subject | Scalability | en_US |
dc.subject | Statistics | en_US |
dc.subject | Text processing | en_US |
dc.subject | Stop-word list | en_US |
dc.subject | Test collection | en_US |
dc.subject | Word truncation | en_US |
dc.subject | Information retrieval | en_US |
dc.title | Information retrieval on Turkish texts | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Information retrieval on turkish texts.pdf
- Size:
- 820.41 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version