Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model
Question and answer websites such as Quora, Stack Overflow, Yahoo Answers and Answer Bag are used by professionals. Multiple users post questions on these websites to get the answers from domain specific professionals. These websites are multilingual meaning they are available in many different lang...
الحاوية / القاعدة: | Intelligent Automation and Soft Computing |
---|---|
المؤلف الرئيسي: | |
التنسيق: | مقال |
اللغة: | English |
منشور في: |
Tech Science Press
2023
|
الوصول للمادة أونلاين: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85135005700&doi=10.32604%2fiasc.2023.023277&partnerID=40&md5=248d0a5695a8d7f8c9425cd718f8a96a |
id |
Naveed H.; Sohail A.; Zain J.M.; Saleem N.; Ali R.F.; Anwar S. |
---|---|
spelling |
Naveed H.; Sohail A.; Zain J.M.; Saleem N.; Ali R.F.; Anwar S. 2-s2.0-85135005700 Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model 2023 Intelligent Automation and Soft Computing 35 1 10.32604/iasc.2023.023277 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85135005700&doi=10.32604%2fiasc.2023.023277&partnerID=40&md5=248d0a5695a8d7f8c9425cd718f8a96a Question and answer websites such as Quora, Stack Overflow, Yahoo Answers and Answer Bag are used by professionals. Multiple users post questions on these websites to get the answers from domain specific professionals. These websites are multilingual meaning they are available in many different languages. Current problem for these types of websites is to handle meaningless and irrelevant content. In this paper we have worked on the Quora insincere questions (questions which are based on false assumptions or questions which are trying to make a statement rather than seeking for helpful answers) dataset in order to identify user insincere questions, so that Quora can eliminate those questions from their platform and ultimately improve the communication among users over the platform. Previously, a research was carried out with recurrent neural network and pretrained glove word embeddings, that achieved the F1 score of 0.69. The proposed study has used a pre-trained ULMFiT model. This model has outperformed the previous model with an F1 score of 0.91, which is much higher than the previous studies. © 2023, Tech Science Press. All rights reserved. Tech Science Press 10798587 English Article All Open Access; Hybrid Gold Open Access |
author |
2-s2.0-85135005700 |
spellingShingle |
2-s2.0-85135005700 Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model |
author_facet |
2-s2.0-85135005700 |
author_sort |
2-s2.0-85135005700 |
title |
Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model |
title_short |
Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model |
title_full |
Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model |
title_fullStr |
Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model |
title_full_unstemmed |
Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model |
title_sort |
Detection of Toxic Content on Social Networking Platforms Using Fine Tuned ULMFiT Model |
publishDate |
2023 |
container_title |
Intelligent Automation and Soft Computing |
container_volume |
35 |
container_issue |
1 |
doi_str_mv |
10.32604/iasc.2023.023277 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85135005700&doi=10.32604%2fiasc.2023.023277&partnerID=40&md5=248d0a5695a8d7f8c9425cd718f8a96a |
description |
Question and answer websites such as Quora, Stack Overflow, Yahoo Answers and Answer Bag are used by professionals. Multiple users post questions on these websites to get the answers from domain specific professionals. These websites are multilingual meaning they are available in many different languages. Current problem for these types of websites is to handle meaningless and irrelevant content. In this paper we have worked on the Quora insincere questions (questions which are based on false assumptions or questions which are trying to make a statement rather than seeking for helpful answers) dataset in order to identify user insincere questions, so that Quora can eliminate those questions from their platform and ultimately improve the communication among users over the platform. Previously, a research was carried out with recurrent neural network and pretrained glove word embeddings, that achieved the F1 score of 0.69. The proposed study has used a pre-trained ULMFiT model. This model has outperformed the previous model with an F1 score of 0.91, which is much higher than the previous studies. © 2023, Tech Science Press. All rights reserved. |
publisher |
Tech Science Press |
issn |
10798587 |
language |
English |
format |
Article |
accesstype |
All Open Access; Hybrid Gold Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1828987867071774720 |