Semantic change detection with gaussian word embeddings

Yüksel, Arda; Uğurlu, Berke; Koç, Aykut

Semantic change detection with gaussian word embeddings

Files

Semantic_change_detection_with_gaussian_word_embeddings.pdf (1.77 MB)

Date

2021-10-20

Authors

Yüksel, Arda

Uğurlu, Berke

Koç, Aykut

BUIR Usage Stats

2
views

157
downloads

Citation Stats

Abstract

Diachronic study of the evolution of languages is of importance in natural language processing (NLP). Recent years have witnessed a surge of computational approaches for the detection and characterization of lexical semantic change (LSC) due to the availability of diachronic corpora and advancing word representation techniques. We propose a Gaussian word embedding (w2g)-based method and present a comprehensive study for the LSC detection. W2g is a probabilistic distribution-based word embedding model and represents words as Gaussian mixture models using covariance information along with the existing mean (word vector). We also extensively study several aspects of w2g-based LSC detection under the SemEval-2020 Task 1 evaluation framework as well as using Google N-gram corpus. In the Sub-task 1 (LSC binary classification) of the SemEval-2020 Task 1, we report the highest overall ranking as well as the highest ranks for the two (German and Swedish) of the four languages (English, Swedish, German and Latin). We also report the highest Spearman correlation in the Sub-task 2 (LSC ranking) for Swedish. Our overall rankings in the LSC classification and ranking sub-tasks are 1st and 7th , respectively. Qualitative analysis has also been presented.

Source Title

IEEE/ACM Transactions on Audio, Speech, and Language Processing

Publisher

IEEE

Keywords

Diachronic embeddings, Semantic change computation, Semantic change detection, Lexical semantic change, Diachronic NLP, Word embeddings, Word2gauss

Permalink

http://hdl.handle.net/11693/76841

Published Version (Please cite this version)

https://doi.org/10.1109/TASLP.2021.3120645

Collections

Scholarly Publications - Electrical and Electronics Engineering
Scholarly Publications - UMRAM

Language

English

Type

Article

Full item page

Semantic change detection with gaussian word embeddings

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Semantic change detection with gaussian word embeddings

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type