Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil

This study formulates a K-Nearest Neighbor (K-NN) model for the classification of Aquilaria (agarwood) oil grades by varying distance metrics: Mahalanobis, Correlation and Cosine. A total of 96 agarwood oil samples were analyzed using Gas chromatography-mass spectrometry (GC-MS), identifying key che...

全面介紹

書目詳細資料
發表在:JOURNAL OF ESSENTIAL OIL BEARING PLANTS
Main Authors: Yusoff, Zakiah Mohd; Ismail, Nurlaila; Sabri, Noor Aida Syakira Ahmad
格式: Article; Early Access
語言:English
出版: TAYLOR & FRANCIS LTD 2025
主題:
在線閱讀:https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001437517500001
author Yusoff
Zakiah Mohd; Ismail
Nurlaila; Sabri
Noor Aida Syakira Ahmad
spellingShingle Yusoff
Zakiah Mohd; Ismail
Nurlaila; Sabri
Noor Aida Syakira Ahmad
Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil
Plant Sciences
author_facet Yusoff
Zakiah Mohd; Ismail
Nurlaila; Sabri
Noor Aida Syakira Ahmad
author_sort Yusoff
spelling Yusoff, Zakiah Mohd; Ismail, Nurlaila; Sabri, Noor Aida Syakira Ahmad
Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil
JOURNAL OF ESSENTIAL OIL BEARING PLANTS
English
Article; Early Access
This study formulates a K-Nearest Neighbor (K-NN) model for the classification of Aquilaria (agarwood) oil grades by varying distance metrics: Mahalanobis, Correlation and Cosine. A total of 96 agarwood oil samples were analyzed using Gas chromatography-mass spectrometry (GC-MS), identifying key chemical compounds contributing to quality differentiation. The proposed model achieved 100% classification accuracy at k = 1 across all distance metrics with Mahalanobis distance consistently outperforming others as k increased. Cross-validation results demonstrated the lowest error rates for Mahalanobis followed by Correlation and Cosine, confirming its robustness in high-dimensional chemical datasets. These findings highlight the effectiveness of machine learning in agarwood oil grading and underscore the importance of selecting optimal distance metrics for improved classification accuracy. Future work could incorporate ensemble learning and advanced feature selection to further refine performance.
TAYLOR & FRANCIS LTD
0972-060X
0976-5026
2025


10.1080/0972060X.2025.2469677
Plant Sciences
Green Submitted
WOS:001437517500001
https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001437517500001
title Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil
title_short Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil
title_full Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil
title_fullStr Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil
title_full_unstemmed Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil
title_sort Formulation of K-nearest neighbor model by varying the distance metrics of mahalanobis, correlation and cosine in discriminating different grades of Aquilaria oil
container_title JOURNAL OF ESSENTIAL OIL BEARING PLANTS
language English
format Article; Early Access
description This study formulates a K-Nearest Neighbor (K-NN) model for the classification of Aquilaria (agarwood) oil grades by varying distance metrics: Mahalanobis, Correlation and Cosine. A total of 96 agarwood oil samples were analyzed using Gas chromatography-mass spectrometry (GC-MS), identifying key chemical compounds contributing to quality differentiation. The proposed model achieved 100% classification accuracy at k = 1 across all distance metrics with Mahalanobis distance consistently outperforming others as k increased. Cross-validation results demonstrated the lowest error rates for Mahalanobis followed by Correlation and Cosine, confirming its robustness in high-dimensional chemical datasets. These findings highlight the effectiveness of machine learning in agarwood oil grading and underscore the importance of selecting optimal distance metrics for improved classification accuracy. Future work could incorporate ensemble learning and advanced feature selection to further refine performance.
publisher TAYLOR & FRANCIS LTD
issn 0972-060X
0976-5026
publishDate 2025
container_volume
container_issue
doi_str_mv 10.1080/0972060X.2025.2469677
topic Plant Sciences
topic_facet Plant Sciences
accesstype Green Submitted
id WOS:001437517500001
url https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001437517500001
record_format wos
collection Web of Science (WoS)
_version_ 1828987784416722944