Measuring and mitigating gender bias in legal contextualized language models

Bozdağ, Mustafa; Sevim, Nurullah; Koç, Aykut

Measuring and mitigating gender bias in legal contextualized language models

Files

Measuring_and_Mitigating_Gender_Bias_in_Legal_Contextualized_Language_Models.pdf (1.68 MB)

Date

2024-02-13

Authors

Bozdağ, Mustafa

Sevim, Nurullah

Koç, Aykut

BUIR Usage Stats

0
views

13
downloads

Citation Stats

Abstract

Transformer-based contextualized language models constitute the state-of-the-art in several natural language processing (NLP) tasks and applications. Despite their utility, contextualized models can contain human-like social biases, as their training corpora generally consist of human-generated text. Evaluating and removing social biases in NLP models has been a major research endeavor. In parallel, NLP approaches in the legal domain, namely, legal NLP or computational law, have also been increasing. Eliminating unwanted bias in legal NLP is crucial, since the law has the utmost importance and effect on people. In this work, we focus on the gender bias encoded in BERT-based models. We propose a new template-based bias measurement method with a new bias evaluation corpus using crime words from the FBI database. This method quantifies the gender bias present in BERT-based models for legal applications. Furthermore, we propose a new fine-tuning-based debiasing method using the European Court of Human Rights (ECtHR) corpus to debias legal pre-trained models. We test the debiased models’ language understanding performance on the LexGLUE benchmark to confirm that the underlying semantic vector space is not perturbed during the debiasing process. Finally, we propose a bias penalty for the performance scores to emphasize the effect of gender bias on model performance.

Source Title

ACM Journals

Publisher

Association for Computing Machinery

Keywords

Legal NLP, Gender bias, Contextualized models, BERT, LegalBERT

Permalink

https://hdl.handle.net/11693/116698

Published Version (Please cite this version)

https://doi.org/

Rights

https://creativecommons.org/licenses/by/4.0/

Collections

Scholarly Publications - Electrical and Electronics Engineering

Language

English

Type

Article

Full item page

Measuring and mitigating gender bias in legal contextualized language models

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Rights

Collections

Language

Type

Measuring and mitigating gender bias in legal contextualized language models

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Rights

Collections

Language

Type