• About
  • Policies
  • What is open access
  • Library
  • Contact
Advanced search
      View Item 
      •   BUIR Home
      • Scholarly Publications
      • National Magnetic Resonance Research Center (UMRAM)
      • View Item
      •   BUIR Home
      • Scholarly Publications
      • National Magnetic Resonance Research Center (UMRAM)
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Named-entity recognition in Turkish legal texts

      Thumbnail
      View / Download
      1015.1 Kb
      Author(s)
      Çetindağ, Can
      Yazıcıoğlu, Berkay
      Koç, Aykut
      Date
      2022-07-11
      Source Title
      Natural Language Engineering
      Print ISSN
      1351-3249
      Electronic ISSN
      1469-8110
      Publisher
      Cambridge University Press
      Pages
      1 - 28
      Language
      English
      Type
      Article
      Item Usage Stats
      12
      views
      15
      downloads
      Abstract
      Natural language processing (NLP) technologies and applications in legal text processing are gaining momentum. Being one of the most prominent tasks in NLP, named-entity recognition (NER) can substantiate a great convenience for NLP in law due to the variety of named entities in the legal domain and their accentuated importance in legal documents. However, domain-specific NER models in the legal domain are not well studied. We present a NER model for Turkish legal texts with a custom-made corpus as well as several NER architectures based on conditional random fields and bidirectional long-short-term memories (BiLSTMs) to address the task. We also study several combinations of different word embeddings consisting of GloVe, Morph2Vec, and neural network-based character feature extraction techniques either with BiLSTM or convolutional neural networks. We report 92.27% F1 score with a hybrid word representation of GloVe and Morph2Vec with character-level features extracted with BiLSTM. Being an agglutinative language, the morphological structure of Turkish is also considered. To the best of our knowledge, our work is the first legal domain-specific NER study in Turkish and also the first study for an agglutinative language in the legal domain. Thus, our work can also have implications beyond the Turkish language.
      Keywords
      NLP in law
      NER
      Turkish NER
      Computational law
      Named-entity recognition
      Permalink
      http://hdl.handle.net/11693/111393
      Published Version (Please cite this version)
      http://dx.doi.org/10.1017/S1351324922000304
      Collections
      • Department of Electrical and Electronics Engineering 4011
      • National Magnetic Resonance Research Center (UMRAM) 301
      Show full item record

      Browse

      All of BUIRCommunities & CollectionsTitlesAuthorsAdvisorsBy Issue DateKeywordsTypeDepartmentsCoursesThis CollectionTitlesAuthorsAdvisorsBy Issue DateKeywordsTypeDepartmentsCourses

      My Account

      Login

      Statistics

      View Usage StatisticsView Google Analytics Statistics

      Bilkent University

      If you have trouble accessing this page and need to request an alternate format, contact the site administrator. Phone: (312) 290 2976
      © Bilkent University - Library IT

      Contact Us | Send Feedback | Off-Campus Access | Admin | Privacy