Automatic categorization of Ottoman poems
Date
2014Source Title
Glottotheory: international journal of theoretical linguistics
Print ISSN
1337-7892
Electronic ISSN
2196-6907
Publisher
De Gruyter Akademie Forschung
Volume
4
Issue
2
Pages
1 - 15
Language
English
Type
ArticleItem Usage Stats
203
views
views
165
downloads
downloads
Abstract
Authorship attribution and identifying time period of literary works are fundamental problems
in quantitative analysis of languages. We investigate two fundamentally different machine learning text
categorization methods, Support Vector Machines (SVM) and Naïve Bayes (NB), and several style
markers in the categorization of Ottoman poems according to their poets and time periods. We use the
collected works (divans) of ten different Ottoman poets: two poets from each of the five different
hundred-year periods ranging from the 15th to 19 th century. Our experimental evaluation and statistical
assessments show that it is possible to obtain highly accurate and reliable classifications and to
distinguish the methods and style markers in terms of their effectiveness.
Keywords
Ottoman poemsAuthorship attribution
Automatic text categorization
Historical literary texts
Ottoman