Semantic similarity between Turkish and European languages using word embeddings
buir.contributor.author | Şenel, Lütfü Kerem | |
buir.contributor.author | Çukur, Tolga | |
dc.contributor.author | Şenel, Lütfü Kerem | en_US |
dc.contributor.author | Yücesoy, V. | en_US |
dc.contributor.author | Koç, A. | en_US |
dc.contributor.author | Çukur, Tolga | en_US |
dc.coverage.spatial | Antalya, Turkey | en_US |
dc.date.accessioned | 2018-04-12T11:45:01Z | |
dc.date.available | 2018-04-12T11:45:01Z | |
dc.date.issued | 2017 | en_US |
dc.department | Department of Electrical and Electronics Engineering | en_US |
dc.department | National Magnetic Resonance Research Center (UMRAM) | en_US |
dc.department | Interdisciplinary Program in Neuroscience (NEUROSCIENCE) | en_US |
dc.department | Aysel Sabuncu Brain Research Center (BAM) | en_US |
dc.description | Date of Conference: 15-18 May 2017 | en_US |
dc.description | Conference Name: IEEE 25th Signal Processing and Communications Applications Conference, SIU 2017 | en_US |
dc.description.abstract | Representation of words coming from vocabulary of a language as real vectors in a high dimensional space is called as word embeddings. Word embeddings are proven to be successful in modelling semantic relations between words and numerous natural language processing applications. Although developed mainly for English, word embeddings perform well for many other languages. In this study, semantic similarity between Turkish (two different corpora) and five basic European languages (English, German, French, Spanish, Italian) is calculated using word embeddings over a fixed vocabulary, obtained results are verified using statistical testing. Also, the effect of using different corpora, and additional preprocess steps on the performance of word embeddings on similarity and analogy test sets prepared for Turkish is studied. | en_US |
dc.description.provenance | Made available in DSpace on 2018-04-12T11:45:01Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 179475 bytes, checksum: ea0bedeb05ac9ccfb983c327e155f0c2 (MD5) Previous issue date: 2017 | en |
dc.identifier.doi | 10.1109/SIU.2017.7960365 | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/37594 | |
dc.language.iso | Turkish | en_US |
dc.publisher | IEEE | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1109/SIU.2017.7960365 | en_US |
dc.source.title | Proceedings of the IEEE 25th Signal Processing and Communications Applications Conference, SIU 2017 | en_US |
dc.subject | Natural language processing | en_US |
dc.subject | Semantic similarity between languages | en_US |
dc.subject | Word embeddings | en_US |
dc.subject | Linguistics | en_US |
dc.subject | Modeling languages | en_US |
dc.subject | Semantics | en_US |
dc.subject | European languages | en_US |
dc.subject | High dimensional spaces | en_US |
dc.subject | Semantic relations | en_US |
dc.title | Semantic similarity between Turkish and European languages using word embeddings | en_US |
dc.title.alternative | Türkçe ile Avrupa dilleri arasındaki anlamsal benzerliğin kelime temsilleri ile gösterimi | en_US |
dc.type | Conference Paper | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Semantic similarity between Turkish and European languages using word embeddings.pdf
- Size:
- 109.24 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full Printable Version