Abstract
Software development has an important human-side, which implies that developers' feelings have a significant impact to software development and could affect developers' quality, productivity, and performance. In this paper, we explore the process to find, understand and relate the effects of toxic emotions on code quality. We propose a tool and sentiments dataset, a clean set of commit messages, extracted from SonarQube code quality metrics and toxic comments obtained from GitHub. Moreover, we perform a preliminary statistical analysis of the dataset. We apply natural language processing techniques to identify toxic developer sentiments on commits that could impact code quality. Our study describes data retrieval process along with tools used for performing a preliminary analysis. The preliminary dataset is available in CSV format to facilitate queries on the data and to investigate in depth factors that impact developer emotions. Preliminary results imply that there is a relationship between toxic comments and code quality that may affect the quality of the software project. Future research will be the development of a complete dataset and an in-depth analysis for efficiency validation experiments along with a linear regression. Finally, we will estimate the code quality as a function of developers' toxic comments.
Original language | English |
---|---|
Title of host publication | Proceedings of the 17th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2022 |
Editors | Hermann Kaindl, Mike Mannion, Leszek Maciaszek, Leszek Maciaszek |
Publisher | Science and Technology Publications, Lda |
Pages | 335-343 |
Number of pages | 9 |
ISBN (Electronic) | 9789897585685 |
DOIs | |
State | Published - 2022 |
Event | 17th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2022 - Virtual, Online Duration: 25 Apr 2022 → 26 Apr 2022 |
Publication series
Name | International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE - Proceedings |
---|---|
ISSN (Electronic) | 2184-4895 |
Conference
Conference | 17th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2022 |
---|---|
City | Virtual, Online |
Period | 25/04/22 → 26/04/22 |
Bibliographical note
Publisher Copyright:Copyright © 2022 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved.
Keywords
- Commits
- GitHub
- Sentiments Analysis
- Software Engineering
- Software Quality
- SonarQube
- Toxic Comment Classification