Semantic similarity between Turkish and European languages using word embeddings
Author
Şenel, Lütfü Kerem
Yücesoy, V.
Koç, A.
Çukur, Tolga
Date
2017Source Title
Proceedings of the IEEE 25th Signal Processing and Communications Applications Conference, SIU 2017
Publisher
IEEE
Language
Turkish
Type
Conference PaperItem Usage Stats
182
views
views
152
downloads
downloads
Abstract
Representation of words coming from vocabulary of a language as real vectors in a high dimensional space is called as word embeddings. Word embeddings are proven to be successful in modelling semantic relations between words and numerous natural language processing applications. Although developed mainly for English, word embeddings perform well for many other languages. In this study, semantic similarity between Turkish (two different corpora) and five basic European languages (English, German, French, Spanish, Italian) is calculated using word embeddings over a fixed vocabulary, obtained results are verified using statistical testing. Also, the effect of using different corpora, and additional preprocess steps on the performance of word embeddings on similarity and analogy test sets prepared for Turkish is studied.
Keywords
Natural language processingSemantic similarity between languages
Word embeddings
Linguistics
Modeling languages
Semantics
European languages
High dimensional spaces
Semantic relations
Permalink
http://hdl.handle.net/11693/37594Published Version (Please cite this version)
http://dx.doi.org/10.1109/SIU.2017.7960365Collections
Related items
Showing items related by title, author, creator and subject.
-
The systems biology graphical notation
Le Novère, N.; Hucka, M.; Mi, H.; Moodie, S.; Schreiber, F.; Sorokin, A.; Demir, Emek; Wegner, K.; Aladjem, M. I.; Wimalaratne, S. M.; Bergman, F. T.; Gauges, R.; Ghazal, P.; Kawaji, H.; Li, L.; Matsuoka, Y.; Villéger, A.; Boyd, S. E.; Calzone, L.; Courtot, M.; Doğrusöz, Uğur; Freeman, T. C.; Funahashi, A.; Ghosh, S.; Jouraku, A.; Kim, S.; Kolpakov, F.; Luna, A.; Sahle, S.; Schmidt, E.; Watterson, S.; Wu, G.; Goryanin, I.; Kell, D. B.; Sander, C.; Sauro, H.; Snoep, J. L.; Kohn, K.; Kitano, H. (Nature Publishing Group, 2009-08)Circuit diagrams and Unified Modeling Language diagrams are just two examples of standard visual languages that help accelerate work by promoting regularity, removing ambiguity and enabling software tool support for ... -
Domain specific language for deployment of parallel applications on parallel computing platforms
Arkın, E.; Tekinerdoğan, Bedir (Association for Computing Machinery, 2014-08)To increase the computing performance the current trend is towards applying parallel computing in which parallel tasks are executed on multiple nodes. The deployment of tasks on the computing platform usually impacts the ... -
Architecture framework for mapping parallel algorithms to parallel computing platforms
Tekinerdogan, Bedir; Arkin, E. (CEUR-WS, 2013)Mapping parallel algorithms to parallel computing platforms requires several activities such as the analysis of the parallel algorithm, the definition of the logical configuration of the platform, and the mapping of the ...