Automatic rule learning exploiting morphological features for named entity recognition in Turkish

Tatar, S.; Cicekli I.

Automatic rule learning exploiting morphological features for named entity recognition in Turkish

Files

Automatic rule learning exploiting morphological features for named entity recognition in Turkish.pdf (981.72 KB)

Date

2011

Authors

Tatar, S.

Cicekli I.

BUIR Usage Stats

2
views

55
downloads

Citation Stats

Abstract

Named entity recognition (NER) is one of the basic tasks in automatic extraction of information from natural language texts. In this paper, we describe an automatic rule learning method that exploits different features of the input text to identify the named entities located in the natural language texts. Moreover, we explore the use of morphological features for extracting named entities from Turkish texts. We believe that the developed system can also be used for other agglutinative languages. The paper also provides a comprehensive overview of the field by reviewing the NER research literature. We conducted our experiments on the TurkIE dataset, a corpus of articles collected from different Turkish newspapers. Our method achieved an average F-score of 91.08% on the dataset. The results of the comparative experiments demonstrate that the developed technique is successfully applicable to the task of automatic NER and exploiting morphological features can significantly improve the NER from Turkish, an agglutinative language. © The Author(s) 2011.

Source Title

Journal of Information Science

Permalink

http://hdl.handle.net/11693/21980

Published Version (Please cite this version)

http://dx.doi.org/10.1177/0165551511398573

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Automatic rule learning exploiting morphological features for named entity recognition in Turkish

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Automatic rule learning exploiting morphological features for named entity recognition in Turkish

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type