Homestay Recommender System Using Content-Based Filtering and K-Means Algorithm

The challenge of finding an ideal homestay that aligns with user preferences is significant due to the volume of available choices and the limitations of existing search filters on accommodation websites. Users face several issues when searching for homestays online, such as limited search filters,...

Full description

Bibliographic Details
Published in:2024 International Visualization, Informatics and Technology Conference, IVIT 2024
Main Author: Azman N.A.B.N.; Daud N.M.N.; Sabri N.M.; Abu Bakar N.A.A.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85207069403&doi=10.1109%2fIVIT62102.2024.10692708&partnerID=40&md5=2d4b4b0c710bd662069810473e14a09f
Description
Summary:The challenge of finding an ideal homestay that aligns with user preferences is significant due to the volume of available choices and the limitations of existing search filters on accommodation websites. Users face several issues when searching for homestays online, such as limited search filters, lack of location or landmark information, irrelevant recommendations, outdated or inaccurate homestay details, and time-consuming search processes. Additionally, websites often mix homestay listings with other types of accommodations, leading to user confusion and inefficient search experiences. The objective of this study is to investigate the requirement needed to develop a homestay recommender system using Content-based filtering and to evaluate the accuracy and performance of the homestay recommender system. The data used in this research, which contain the criteria of the homestays, were collected from the online accommodation website. Recommendation list is generated using Content-based filtering approach, which includes few techniques such as Term Frequency-Inverse Document Frequency (TF-IDF) to analyze the importance of terms in the dataset, Cosine Similarity to measure the similarity between the user's preferences and homestays characteristic and K-Means algorithm for the evaluation part. The system's performance is evaluated using precision, recall, F1-score, Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). The prototype achieved an accuracy of 79.3%, a recall of 93%, and an F1 score of 86%. The RMSE and MAE were recorded at 9.95 and 9.42, respectively. As for future works, expanding the dataset and implementing a proper database to store the user profile could be done to enrich the recommendation process. © 2024 IEEE.
ISSN:
DOI:10.1109/IVIT62102.2024.10692708