TY - JOUR
T1 - Analyzing the Correlation Between Toxic Comments and Code Quality
AU - Sayago-Heredia, Jaime
AU - Sailema, Gustavo Chango
AU - Pérez-Castillo, Ricardo
AU - Piattini, Mario
N1 - Publisher Copyright:
© 2024 The Author(s). Journal of Software: Evolution and Process published by John Wiley & Sons Ltd.
PY - 2024/11/12
Y1 - 2024/11/12
N2 - Software development has a relevant human side, and this could, for example, imply that developers' feelings have an impact on certain aspects of software development such as quality, productivity, or performance. This paper explores the effects of toxic emotions on code quality and presents the SentiQ tool, which gathers and analyzes sentiments from commit messages (obtained from GitHub) and code quality measures (obtained from SonarQube). The SentiQ tool we proposed performs a sentiment analysis (based on natural language processing techniques) and relates the results to the code quality measures. The datasets extracted are then used as the basis on which to conduct a preliminary case study, which demonstrates that there is a relationship between toxic comments and code quality that may affect the quality of the whole software project. This has resulted in the drafting of a predictive model to validate the correlation of the impact of toxic comments on code quality. The main implication of this work is that these results could, in the future, make it possible to estimate code quality as a function of developers' toxic comments.
AB - Software development has a relevant human side, and this could, for example, imply that developers' feelings have an impact on certain aspects of software development such as quality, productivity, or performance. This paper explores the effects of toxic emotions on code quality and presents the SentiQ tool, which gathers and analyzes sentiments from commit messages (obtained from GitHub) and code quality measures (obtained from SonarQube). The SentiQ tool we proposed performs a sentiment analysis (based on natural language processing techniques) and relates the results to the code quality measures. The datasets extracted are then used as the basis on which to conduct a preliminary case study, which demonstrates that there is a relationship between toxic comments and code quality that may affect the quality of the whole software project. This has resulted in the drafting of a predictive model to validate the correlation of the impact of toxic comments on code quality. The main implication of this work is that these results could, in the future, make it possible to estimate code quality as a function of developers' toxic comments.
KW - commit analysis
KW - GitHub
KW - sentiments analysis
KW - software engineering
KW - software quality
KW - SonarQube
KW - toxic comment classification
UR - http://www.scopus.com/inward/record.url?scp=85208794089&partnerID=8YFLogxK
U2 - 10.1002/smr.2739
DO - 10.1002/smr.2739
M3 - Article
SN - 1532-060X
JO - Journal of software: Evolution and Process
JF - Journal of software: Evolution and Process
ER -