The k-nearest neighbor modelling by varying Mahalanobis and correlation in distance metric for agarwood oil quality classification

Agarwood oil is well known for its unique scent and has many usages; as an incense, as ingredient in perfume, is burnt during religious ceremonies and is used in traditional medical preparation. Therefore, agarwood oil has high demand and is traded at different price based on its quality. Basically,...

Full description

Bibliographic Details
Published in:International Journal of Advances in Applied Sciences
Main Author: Jainalabidin N.S.M.; Amidon A.F.M.; Ismail N.; Yusoff Z.M.; Tajuddin S.N.; Taib M.N.
Format: Article
Language:English
Published: Intelektual Pustaka Media Utama 2022
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168616032&doi=10.11591%2fijaas.v11.i3.pp242-252&partnerID=40&md5=d77445f236c52b38c591b36e9f15b32e
Description
Summary:Agarwood oil is well known for its unique scent and has many usages; as an incense, as ingredient in perfume, is burnt during religious ceremonies and is used in traditional medical preparation. Therefore, agarwood oil has high demand and is traded at different price based on its quality. Basically, the oil quality is classified by using physical properties (odor and color) and this technique has several problems: not consistent in term of accuracy. Thus, this study presented a new technique to classify the quality of agarwood oil based on chemical properties. The work focused on the k-nearest neighbor (k-NN) modelling by varying Mahalanobis and correlation in distance metric for agarwood oil quality classification. It involved of 96 samples of agarwood oil, data pre-processing (data randomization, data normalization, and data division to testing and training datasets) and the development of k-NN model. The training dataset is used to train the k-NN model, and the testing dataset is used to test the developed model. During the model development, Mahalanobis and correlation are varied in k-NN distance metric. The k-NN values are ranging from 1 to 10. Several performance criteria including resubstitution error (closs), cross-validation error (kloss) and accuracy were applied to measure the performance of the built k-NN model. All the analytical work was performed via MATLAB software version R2020a. The result showed that the accuracy of Mahalanobis distance metric has a better performance compared to correlation from k = 1 to k = 5 with the value of 100.00%. This finding is important as it proved the capabilities of k-NN modelling in classifying the agarwood oil quality. Not limited to that, it also contributed to the agarwood oil research area as well as its industry. © 2022, Intelektual Pustaka Media Utama. All rights reserved.
ISSN:22528814
DOI:10.11591/ijaas.v11.i3.pp242-252