Cleaning ground truth data in software task assignment

Tecimer, K. A.; Tüzün, Eray; Moran, Cansu; Erdogmus, H.

Cleaning ground truth data in software task assignment

Available

The embargo period has ended, and this item is now available.

Date

2022-05-25

Authors

Tecimer, K. A.

Tüzün, Eray

Moran, Cansu

Erdogmus, H.

Citation Stats

Abstract

Context: In the context of collaborative software development, there are many application areas of task assignment such as assigning a developer to fix a bug, or assigning a code reviewer to a pull request. Most task assignment techniques in the literature build and evaluate their models based on datasets collected from real projects. The techniques invariably presume that these datasets reliably represent the “ground truth”. In a project dataset used to build an automated task assignment system, the recommended assignee for the task is usually assumed to be the best assignee for that task. However, in practice, the task assignee may not be the best possible task assignee, or even a sufficiently qualified one. Objective: We aim to clean up the ground truth by removing the samples that are potentially problematic or suspect with the assumption that removing such samples would reduce any systematic labeling bias in the dataset and lead to performance improvements. Method: We devised a debiasing method to detect potentially problematic samples in task assignment datasets. We then evaluated the method’s impact on the performance of seven task assignment techniques by comparing the Mean Reciprocal Rank (MRR) scores before and after debiasing. We used two different task assignment applications for this purpose: Code Reviewer Recommendation (CRR) and Bug Assignment (BA). Results: In the CRR application, we achieved an average MRR improvement of 18.17% for the three learning-based techniques tested on two datasets. No significant improvements were observed for the two optimization-based techniques tested on the same datasets. In the BA application, we achieved a similar average MRR improvement of 18.40% for the two learning-based techniques tested on four different datasets. Conclusion: Debiasing the ground truth data by removing suspect samples can help improve the performance of learning-based techniques in software task assignment applications.

Source Title

Information and Software Technology

Publisher

Elsevier BV

Keywords

Task assignment, Code reviewer recommendation, Bug assignment, Ground truth, Labeling bias elimination, Systematic labeling bias, Data cleaning

Permalink

http://hdl.handle.net/11693/111501

Published Version (Please cite this version)

https://doi.org/10.1016/j.infsof.2022.106956

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Cleaning ground truth data in software task assignment

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Cleaning ground truth data in software task assignment

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type