Assessing the quality of GitHub copilot’s code generation

Yetiştiren, Burak; Özsoy, Işık; Tüzün, Eray

Assessing the quality of GitHub copilot’s code generation

Date

2022-11-09

Authors

Yetiştiren, Burak

Özsoy, Işık

Tüzün, Eray

BUIR Usage Stats

15
views

206
downloads

Citation Stats

Abstract

The introduction of GitHub’s new code generation tool, GitHub Copilot, seems to be the first well-established instance of an AI pair-programmer. GitHub Copilot has access to a large number of open-source projects, enabling it to utilize more extensive code in various programming languages than other code generation tools. Although the initial and informal assessments are promising, a systematic evaluation is needed to explore the limits and benefits of GitHub Copilot. The main objective of this study is to assess the quality of generated code provided by GitHub Copilot. We also aim to evaluate the impact of the quality and variety of input parameters fed to GitHub Copilot. To achieve this aim, we created an experimental setup for evaluating the generated code in terms of validity, correctness, and efficiency. Our results suggest that GitHub Copilot was able to generate valid code with a 91.5% success rate. In terms of code correctness, out of 164 problems, 47 (28.7%) were correctly, while 84 (51.2%) were partially correctly, and 33 (20.1%) were incorrectly generated. Our empirical analysis shows that GitHub Copilot is a promising tool based on the results we obtained, however further and more comprehensive assessment is needed in the future.

Source Title

International Conference on Software Engineering

Publisher

Association for Computing Machinery

Keywords

GitHub Copilot, Code generation, Code completion, AI pair programmer, Empirical study

Permalink

http://hdl.handle.net/11693/111572

Published Version (Please cite this version)

https://doi.org/10.1145/3558489.3559072

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Conference Paper

Full item page

Assessing the quality of GitHub copilot’s code generation

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Assessing the quality of GitHub copilot’s code generation

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type