Text mining analysis of translation, social communication and literary writing for Turkish

buir.advisorCan, Fazlı
dc.contributor.authorÇalışkan, Sevil
dc.date.accessioned2021-01-12T11:24:51Z
dc.date.available2021-01-12T11:24:51Z
dc.date.copyright2020-12
dc.date.issued2020-12
dc.date.submitted2021-01-11
dc.descriptionCataloged from PDF version of article.en_US
dc.descriptionIncludes bibliographical references (leaves 93-103).en_US
dc.description.abstractText mining is an important research area considering the increase in text generation and the need for analysis. Text mining in Turkish is still not a wellinvested research area, compared to the other languages. In this thesis, we analyze different types of Turkish text from different points of views, having an overall review on text mining in Turkish at the end. First, we analyze the translation quality of a Turkish novel, My Names is Red novel, to English, French, and Spanish with the features generated for each chapter. With the proposed method, translation loyalties to the original text can be quantified without any parallel comparisons. Then, we analyze the Turkish spoken texts of 98 people in different age groups in terms of gender and age attributes of the speakers. We also analyze the difference between written and spoken texts in Turkish. Results show that it is possible to predict the attributes of the speaker from the spoken text and written and spoken texts are significantly different in terms of stylometric measures. Later on, we make an assessment on cross-lingual transferring performances of multilingual networks from English to Turkish. We see that transferring is possible; however zero-shot cross-lingual transferring still has its way to be competitive with monolingual networks for Turkish. Lastly, we conduct a time-based stylometric analysis of Ahmet Hamdi Tanpınar’s works. We see that Ahmet Hamdi Tanpınar shows some differences compared to his contemporaries.en_US
dc.description.statementofresponsibilityby Sevil Çalışkanen_US
dc.embargo.release2021-07-11
dc.format.extentxvi, 103 leaves : charts ; 30 cm.en_US
dc.identifier.itemidB125495
dc.identifier.urihttp://hdl.handle.net/11693/54882
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectText miningen_US
dc.subjectStylometric analysisen_US
dc.subjectSpoken text analysisen_US
dc.subjectDiscourse analysisen_US
dc.subjectCross-lingual learningen_US
dc.subjectTransfer learningen_US
dc.subjectMulti-lingual dataen_US
dc.titleText mining analysis of translation, social communication and literary writing for Turkishen_US
dc.title.alternativeMetin madenciliği ile Türkçede çeviri, sosyal iletişim ve edebi yazı analizien_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
10374084.pdf
Size:
7.95 MB
Format:
Adobe Portable Document Format
Description:
Full printable version

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: