Assessing the quality of GitHub copilot’s code generation

Yetiştiren, Burak; Özsoy, Işık; Tüzün, Eray

Assessing the quality of GitHub copilot’s code generation

buir.contributor.author	Yetiştiren, Burak
buir.contributor.author	Özsoy, Işık
buir.contributor.author	Tüzün, Eray
buir.contributor.orcid	Tüzün, Eray\|0000-0002-5550-7816
dc.citation.epage	71	en_US
dc.citation.spage	62	en_US
dc.contributor.author	Yetiştiren, Burak
dc.contributor.author	Özsoy, Işık
dc.contributor.author	Tüzün, Eray
dc.coverage.spatial	Singapore Singapore	en_US
dc.date.accessioned	2023-02-21T08:17:43Z
dc.date.available	2023-02-21T08:17:43Z
dc.date.issued	2022-11-09
dc.department	Department of Computer Engineering	en_US
dc.description	Conference Name: PROMISE 2022: Proceedings of the 18th International Conference on Predictive Models and Data Analytics in Software Engineering	en_US
dc.description	Conference of Date: 17 November 2022	en_US
dc.description.abstract	The introduction of GitHub’s new code generation tool, GitHub Copilot, seems to be the first well-established instance of an AI pair-programmer. GitHub Copilot has access to a large number of open-source projects, enabling it to utilize more extensive code in various programming languages than other code generation tools. Although the initial and informal assessments are promising, a systematic evaluation is needed to explore the limits and benefits of GitHub Copilot. The main objective of this study is to assess the quality of generated code provided by GitHub Copilot. We also aim to evaluate the impact of the quality and variety of input parameters fed to GitHub Copilot. To achieve this aim, we created an experimental setup for evaluating the generated code in terms of validity, correctness, and efficiency. Our results suggest that GitHub Copilot was able to generate valid code with a 91.5% success rate. In terms of code correctness, out of 164 problems, 47 (28.7%) were correctly, while 84 (51.2%) were partially correctly, and 33 (20.1%) were incorrectly generated. Our empirical analysis shows that GitHub Copilot is a promising tool based on the results we obtained, however further and more comprehensive assessment is needed in the future.	en_US
dc.identifier.doi	10.1145/3558489.3559072	en_US
dc.identifier.isbn	978-1-4503-9860-2	en_US
dc.identifier.uri	http://hdl.handle.net/11693/111572	en_US
dc.language.iso	English	en_US
dc.publisher	Association for Computing Machinery	en_US
dc.relation.isversionof	https://doi.org/10.1145/3558489.3559072	en_US
dc.source.title	International Conference on Software Engineering	en_US
dc.subject	GitHub Copilot	en_US
dc.subject	Code generation	en_US
dc.subject	Code completion	en_US
dc.subject	AI pair programmer	en_US
dc.subject	Empirical study	en_US
dc.title	Assessing the quality of GitHub copilot’s code generation	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Assessing_the_quality_of_GitHub_copilot’s_code_generation.pdf
Size:: 1.13 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.69 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Computer Engineering