Exploring the Impact of Toxic Comments in Code Quality

Jaime Sayago-Heredia, Gustavo Chango, Ricardo Pérez-Castillo, Mario Piattini

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Software development has an important human-side, which implies that developers' feelings have a significant impact to software development and could affect developers' quality, productivity, and performance. In this paper, we explore the process to find, understand and relate the effects of toxic emotions on code quality. We propose a tool and sentiments dataset, a clean set of commit messages, extracted from SonarQube code quality metrics and toxic comments obtained from GitHub. Moreover, we perform a preliminary statistical analysis of the dataset. We apply natural language processing techniques to identify toxic developer sentiments on commits that could impact code quality. Our study describes data retrieval process along with tools used for performing a preliminary analysis. The preliminary dataset is available in CSV format to facilitate queries on the data and to investigate in depth factors that impact developer emotions. Preliminary results imply that there is a relationship between toxic comments and code quality that may affect the quality of the software project. Future research will be the development of a complete dataset and an in-depth analysis for efficiency validation experiments along with a linear regression. Finally, we will estimate the code quality as a function of developers' toxic comments.

Original languageEnglish
Title of host publicationProceedings of the 17th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2022
EditorsHermann Kaindl, Mike Mannion, Leszek Maciaszek, Leszek Maciaszek
PublisherScience and Technology Publications, Lda
Pages335-343
Number of pages9
ISBN (Electronic)9789897585685
DOIs
StatePublished - 2022
Event17th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2022 - Virtual, Online
Duration: 25 Apr 202226 Apr 2022

Publication series

NameInternational Conference on Evaluation of Novel Approaches to Software Engineering, ENASE - Proceedings
ISSN (Electronic)2184-4895

Conference

Conference17th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2022
CityVirtual, Online
Period25/04/2226/04/22

Bibliographical note

Publisher Copyright:
Copyright © 2022 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved.

Keywords

  • Commits
  • GitHub
  • Sentiments Analysis
  • Software Engineering
  • Software Quality
  • SonarQube
  • Toxic Comment Classification

Fingerprint

Dive into the research topics of 'Exploring the Impact of Toxic Comments in Code Quality'. Together they form a unique fingerprint.

Cite this