Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments
The IT community contains a vast amount of code snippets, but the quality of these codes cannot be guaranteed, causing inconvenience for users in discovering and utilizing them. If quality evaluation can be conducted on these codes to provide reference for users, it will greatly improve the user exp...
Published in: | Proceedings - 2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security Companion, QRS-C 2023 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2023
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186748279&doi=10.1109%2fQRS-C60940.2023.00038&partnerID=40&md5=ebba20b23b426a383e96bb1d7013024e |
id |
2-s2.0-85186748279 |
---|---|
spelling |
2-s2.0-85186748279 Qin H.; Xue P.; Ma X.; Gao B. Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments 2023 Proceedings - 2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security Companion, QRS-C 2023 10.1109/QRS-C60940.2023.00038 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186748279&doi=10.1109%2fQRS-C60940.2023.00038&partnerID=40&md5=ebba20b23b426a383e96bb1d7013024e The IT community contains a vast amount of code snippets, but the quality of these codes cannot be guaranteed, causing inconvenience for users in discovering and utilizing them. If quality evaluation can be conducted on these codes to provide reference for users, it will greatly improve the user experience. Existing code quality evaluation methods primarily analyze the source code, which is more suitable for complete program in software projects and not applicable to fragmented codes in IT communities. To address this issue, this paper proposes a code quality evaluation model(CQCS) based on the sentiment analysis of user comments. The model collects user sentiment comments from the IT community as training data using a semi-supervised approach, balances the data using the Synthetic Minority Oversampling Technique(SMOTE), applies Word2Vec for text vectorization, performs sentiment analysis on the user comment using a Long Short-Term Memory(LSTM) model, and finally evaluates code quality according to the sentiment analysis results. A higher proportion of positive sentiment data indicates better code quality. The performance and practicality of the CQCS model are experimentally evaluated using data obtained from the Chinese Software Developer Network(CSDN) developer community. The experimental results demonstrate the practicality and effectiveness of the proposed method. © 2023 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Qin H.; Xue P.; Ma X.; Gao B. |
spellingShingle |
Qin H.; Xue P.; Ma X.; Gao B. Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments |
author_facet |
Qin H.; Xue P.; Ma X.; Gao B. |
author_sort |
Qin H.; Xue P.; Ma X.; Gao B. |
title |
Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments |
title_short |
Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments |
title_full |
Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments |
title_fullStr |
Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments |
title_full_unstemmed |
Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments |
title_sort |
Code Quality Evaluation in IT Communities Based on Sentiment Analysis of User Comments |
publishDate |
2023 |
container_title |
Proceedings - 2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security Companion, QRS-C 2023 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/QRS-C60940.2023.00038 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186748279&doi=10.1109%2fQRS-C60940.2023.00038&partnerID=40&md5=ebba20b23b426a383e96bb1d7013024e |
description |
The IT community contains a vast amount of code snippets, but the quality of these codes cannot be guaranteed, causing inconvenience for users in discovering and utilizing them. If quality evaluation can be conducted on these codes to provide reference for users, it will greatly improve the user experience. Existing code quality evaluation methods primarily analyze the source code, which is more suitable for complete program in software projects and not applicable to fragmented codes in IT communities. To address this issue, this paper proposes a code quality evaluation model(CQCS) based on the sentiment analysis of user comments. The model collects user sentiment comments from the IT community as training data using a semi-supervised approach, balances the data using the Synthetic Minority Oversampling Technique(SMOTE), applies Word2Vec for text vectorization, performs sentiment analysis on the user comment using a Long Short-Term Memory(LSTM) model, and finally evaluates code quality according to the sentiment analysis results. A higher proportion of positive sentiment data indicates better code quality. The performance and practicality of the CQCS model are experimentally evaluated using data obtained from the Chinese Software Developer Network(CSDN) developer community. The experimental results demonstrate the practicality and effectiveness of the proposed method. © 2023 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677889139900416 |