Named-entity recognition in Turkish legal texts

Çetindağ, Can; Yazıcıoğlu, Berkay; Koç, Aykut

Named-entity recognition in Turkish legal texts

Files

Named-entity_recognition_in_Turkish_legal_texts.pdf (1015.14 KB)

Date

2022-07-11

Authors

Çetindağ, Can

Yazıcıoğlu, Berkay

Koç, Aykut

BUIR Usage Stats

34
views

670
downloads

Citation Stats

Attention Stats

Abstract

Natural language processing (NLP) technologies and applications in legal text processing are gaining momentum. Being one of the most prominent tasks in NLP, named-entity recognition (NER) can substantiate a great convenience for NLP in law due to the variety of named entities in the legal domain and their accentuated importance in legal documents. However, domain-specific NER models in the legal domain are not well studied. We present a NER model for Turkish legal texts with a custom-made corpus as well as several NER architectures based on conditional random fields and bidirectional long-short-term memories (BiLSTMs) to address the task. We also study several combinations of different word embeddings consisting of GloVe, Morph2Vec, and neural network-based character feature extraction techniques either with BiLSTM or convolutional neural networks. We report 92.27% F1 score with a hybrid word representation of GloVe and Morph2Vec with character-level features extracted with BiLSTM. Being an agglutinative language, the morphological structure of Turkish is also considered. To the best of our knowledge, our work is the first legal domain-specific NER study in Turkish and also the first study for an agglutinative language in the legal domain. Thus, our work can also have implications beyond the Turkish language.

Source Title

Natural Language Engineering

Publisher

Cambridge University Press

Keywords

NLP in law, NER, Turkish NER, Computational law, Named-entity recognition

Permalink

http://hdl.handle.net/11693/111393

Published Version (Please cite this version)

http://dx.doi.org/10.1017/S1351324922000304

Collections

Scholarly Publications - UMRAM
Scholarly Publications - Electrical and Electronics Engineering

Language

English

Type

Article

Full item page

Named-entity recognition in Turkish legal texts

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Attention Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Named-entity recognition in Turkish legal texts

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Attention Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type