Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments

The IT community contains a vast amount of code snippets, but the quality of these codes cannot be guaranteed, causing inconvenience for users in discovering and utilizing them. If quality evaluation can be conducted on these codes to provide reference for users, it will greatly improve the user exp...

Full description

Bibliographic Details
Published in:Proceedings - 2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security Companion, QRS-C 2023
Main Author: Qin H.; Xue P.; Ma X.; Gao B.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186748279&doi=10.1109%2fQRS-C60940.2023.00038&partnerID=40&md5=ebba20b23b426a383e96bb1d7013024e
id 2-s2.0-85186748279
spelling 2-s2.0-85186748279
Qin H.; Xue P.; Ma X.; Gao B.
Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments
2023
Proceedings - 2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security Companion, QRS-C 2023


10.1109/QRS-C60940.2023.00038
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186748279&doi=10.1109%2fQRS-C60940.2023.00038&partnerID=40&md5=ebba20b23b426a383e96bb1d7013024e
The IT community contains a vast amount of code snippets, but the quality of these codes cannot be guaranteed, causing inconvenience for users in discovering and utilizing them. If quality evaluation can be conducted on these codes to provide reference for users, it will greatly improve the user experience. Existing code quality evaluation methods primarily analyze the source code, which is more suitable for complete program in software projects and not applicable to fragmented codes in IT communities. To address this issue, this paper proposes a code quality evaluation model(CQCS) based on the sentiment analysis of user comments. The model collects user sentiment comments from the IT community as training data using a semi-supervised approach, balances the data using the Synthetic Minority Oversampling Technique(SMOTE), applies Word2Vec for text vectorization, performs sentiment analysis on the user comment using a Long Short-Term Memory(LSTM) model, and finally evaluates code quality according to the sentiment analysis results. A higher proportion of positive sentiment data indicates better code quality. The performance and practicality of the CQCS model are experimentally evaluated using data obtained from the Chinese Software Developer Network(CSDN) developer community. The experimental results demonstrate the practicality and effectiveness of the proposed method. © 2023 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Qin H.; Xue P.; Ma X.; Gao B.
spellingShingle Qin H.; Xue P.; Ma X.; Gao B.
Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments
author_facet Qin H.; Xue P.; Ma X.; Gao B.
author_sort Qin H.; Xue P.; Ma X.; Gao B.
title Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments
title_short Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments
title_full Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments
title_fullStr Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments
title_full_unstemmed Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments
title_sort Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments
publishDate 2023
container_title Proceedings - 2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security Companion, QRS-C 2023
container_volume
container_issue
doi_str_mv 10.1109/QRS-C60940.2023.00038
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186748279&doi=10.1109%2fQRS-C60940.2023.00038&partnerID=40&md5=ebba20b23b426a383e96bb1d7013024e
description The IT community contains a vast amount of code snippets, but the quality of these codes cannot be guaranteed, causing inconvenience for users in discovering and utilizing them. If quality evaluation can be conducted on these codes to provide reference for users, it will greatly improve the user experience. Existing code quality evaluation methods primarily analyze the source code, which is more suitable for complete program in software projects and not applicable to fragmented codes in IT communities. To address this issue, this paper proposes a code quality evaluation model(CQCS) based on the sentiment analysis of user comments. The model collects user sentiment comments from the IT community as training data using a semi-supervised approach, balances the data using the Synthetic Minority Oversampling Technique(SMOTE), applies Word2Vec for text vectorization, performs sentiment analysis on the user comment using a Long Short-Term Memory(LSTM) model, and finally evaluates code quality according to the sentiment analysis results. A higher proportion of positive sentiment data indicates better code quality. The performance and practicality of the CQCS model are experimentally evaluated using data obtained from the Chinese Software Developer Network(CSDN) developer community. The experimental results demonstrate the practicality and effectiveness of the proposed method. © 2023 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677889139900416