Semantic similarity between Turkish and European languages using word embeddings

Şenel, Lütfü Kerem; Yücesoy, V.; Koç, A.; Çukur, Tolga

Semantic similarity between Turkish and European languages using word embeddings

Files

Date

2017

Authors

BUIR Usage Stats

5
views

68
downloads

Citation Stats

Abstract

Representation of words coming from vocabulary of a language as real vectors in a high dimensional space is called as word embeddings. Word embeddings are proven to be successful in modelling semantic relations between words and numerous natural language processing applications. Although developed mainly for English, word embeddings perform well for many other languages. In this study, semantic similarity between Turkish (two different corpora) and five basic European languages (English, German, French, Spanish, Italian) is calculated using word embeddings over a fixed vocabulary, obtained results are verified using statistical testing. Also, the effect of using different corpora, and additional preprocess steps on the performance of word embeddings on similarity and analogy test sets prepared for Turkish is studied.

Source Title

Proceedings of the IEEE 25th Signal Processing and Communications Applications Conference, SIU 2017

Publisher

IEEE

Keywords

Natural language processing, Semantic similarity between languages, Word embeddings, Linguistics, Modeling languages, Semantics, European languages, High dimensional spaces, Semantic relations

Permalink

http://hdl.handle.net/11693/37594

Published Version (Please cite this version)

http://dx.doi.org/10.1109/SIU.2017.7960365

Collections

Scholarly Publications - Electrical and Electronics Engineering
Scholarly Publications - BAM
Scholarly Publications - UMRAM

Language

Turkish

Type

Conference Paper

Full item page

Semantic similarity between Turkish and European languages using word embeddings

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Semantic similarity between Turkish and European languages using word embeddings

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type