Cleaning ground truth data in software task assignment

Tecimer, K. A.; Tüzün, Eray; Moran, Cansu; Erdogmus, H.

Cleaning ground truth data in software task assignment

buir.contributor.author	Tüzün, Eray
buir.contributor.author	Moran, Cansu
buir.contributor.orcid	Tüzün, Eray\|0000-0002-5550-7816
buir.contributor.orcid	Moran, Cansu\|0000-0003-2101-1449
dc.citation.epage	106956- 14	en_US
dc.citation.spage	106956- 1	en_US
dc.citation.volumeNumber	149	en_US
dc.contributor.author	Tecimer, K. A.
dc.contributor.author	Tüzün, Eray
dc.contributor.author	Moran, Cansu
dc.contributor.author	Erdogmus, H.
dc.date.accessioned	2023-02-17T10:49:51Z
dc.date.available	2023-02-17T10:49:51Z
dc.date.issued	2022-05-25
dc.department	Department of Computer Engineering	en_US
dc.description.abstract	Context: In the context of collaborative software development, there are many application areas of task assignment such as assigning a developer to fix a bug, or assigning a code reviewer to a pull request. Most task assignment techniques in the literature build and evaluate their models based on datasets collected from real projects. The techniques invariably presume that these datasets reliably represent the “ground truth”. In a project dataset used to build an automated task assignment system, the recommended assignee for the task is usually assumed to be the best assignee for that task. However, in practice, the task assignee may not be the best possible task assignee, or even a sufficiently qualified one. Objective: We aim to clean up the ground truth by removing the samples that are potentially problematic or suspect with the assumption that removing such samples would reduce any systematic labeling bias in the dataset and lead to performance improvements. Method: We devised a debiasing method to detect potentially problematic samples in task assignment datasets. We then evaluated the method’s impact on the performance of seven task assignment techniques by comparing the Mean Reciprocal Rank (MRR) scores before and after debiasing. We used two different task assignment applications for this purpose: Code Reviewer Recommendation (CRR) and Bug Assignment (BA). Results: In the CRR application, we achieved an average MRR improvement of 18.17% for the three learning-based techniques tested on two datasets. No significant improvements were observed for the two optimization-based techniques tested on the same datasets. In the BA application, we achieved a similar average MRR improvement of 18.40% for the two learning-based techniques tested on four different datasets. Conclusion: Debiasing the ground truth data by removing suspect samples can help improve the performance of learning-based techniques in software task assignment applications.	en_US
dc.embargo.release	2024-05-25
dc.identifier.doi	10.1016/j.infsof.2022.106956	en_US
dc.identifier.eissn	1873-6025	en_US
dc.identifier.issn	0950-5849	en_US
dc.identifier.uri	http://hdl.handle.net/11693/111501	en_US
dc.language.iso	English	en_US
dc.publisher	Elsevier BV	en_US
dc.relation.isversionof	https://doi.org/10.1016/j.infsof.2022.106956	en_US
dc.source.title	Information and Software Technology	en_US
dc.subject	Task assignment	en_US
dc.subject	Code reviewer recommendation	en_US
dc.subject	Bug assignment	en_US
dc.subject	Ground truth	en_US
dc.subject	Labeling bias elimination	en_US
dc.subject	Systematic labeling bias	en_US
dc.subject	Data cleaning	en_US
dc.title	Cleaning ground truth data in software task assignment	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Cleaning_ground_truth_data_in_software_task_assignment.pdf
Size:: 1.84 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.69 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Computer Engineering