Summary: | Sentiment analysis, sometimes known as opinion mining, is a technique or process for finding and obtaining information on a certain topic, such as opinions and attitudes, from both written and spoken language. Most of the netizens in Malaysia have unique dialects or slang and commonly use abbreviations while communicating, resulting in a more efficient exchange of information, which makes it impossible to capture the text messages and, as a result, makes it more difficult to categorise the polarity of the text. This paper develops a Chinese monolingual sentiment classifier for social media data using a corpus-based approach. It is implemented as a web-based system utilising the Flask Framework, which uses front-end and back-end programming languages including HTML, CSS, and Python. A vast corpus of Chinese texts from social media is collected. Then an open-source dataset with sentiment labels is utilised. The anticipated outcome of improving sentiment analysis is effectively detecting and categorising sentiment expressions in Chinese text. A 90% accuracy rate was achieved with a Chinese sentiment classifier for social media texts. The Support Vector Machine model is implemented within a Flask-based web application. Although the classifier shows proficiency in determining positive and negative sentiments, it still needs additional refinement to interpret neutral tones and complex expressions. © 2024 IEEE.
|