Language change quantification using time-separated parallel translations

Date
2007
Authors
Altintas, K.
Can F.
Patton J.M.
Editor(s)
Advisor
Supervisor
Co-Advisor
Co-Supervisor
Instructor
Source Title
Literary and Linguistic Computing
Print ISSN
0268-1145
Electronic ISSN
Publisher
Volume
22
Issue
4
Pages
375 - 393
Language
English
Journal Title
Journal ISSN
Volume Title
Series
Abstract

We introduce a systematic approach to language change quantification by studying unconsciously used language features in time-separated parallel translations. For this purpose, we use objective style markers such as vocabulary richness and lengths of words, word stems and suffixes, and employ statistical methods to measure their changes over time. In this study, we focus on the change in Turkish in the second half of the twentieth century. To obtain word stems, we first introduce various stemming techniques and show that they are highly effective. Our statistical analyses show that over time, for both text and lexicon, the length of Turkish words has become significantly longer, and word stems have become significantly shorter. We also show that suffix lengths have become significantly longer for types and the vocabulary richness based on word stems has shrunk significantly. These observations indicate that in contemporary Turkish one would use more suffixes to compensate for the fewer stems to preserve the expressive power of the language at the same level. Our approach can be adapted for quantifying the change in other languages. © The Author 2007. Published by Oxford University Press on behalf of ALLC and ACH. All rights reserved.

Course
Other identifiers
Book Title
Keywords
Citation
Published Version (Please cite this version)