Analysis of distance metric variations in KNN for agarwood oil compounds differentiation

This paper presents the analysis of distance metric variations in KNN for agarwood oil compounds differentiation. The work involved of the development of k-Nearest Neighbor (KNN) by varying the distance metrics. The input is abundances (%) of agarwood oil compounds and the output is agarwood oil qua...

Full description

Bibliographic Details
Published in:Proceedings - 2017 IEEE Conference on Systems, Process and Control, ICSPC 2017
Main Author: Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2017
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85050690023&doi=10.1109%2fSPC.2017.8313038&partnerID=40&md5=4e783dbad59a3418dbc443efc24dfa29
id 2-s2.0-85050690023
spelling 2-s2.0-85050690023
Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N.
Analysis of distance metric variations in KNN for agarwood oil compounds differentiation
2017
Proceedings - 2017 IEEE Conference on Systems, Process and Control, ICSPC 2017
2018-January

10.1109/SPC.2017.8313038
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85050690023&doi=10.1109%2fSPC.2017.8313038&partnerID=40&md5=4e783dbad59a3418dbc443efc24dfa29
This paper presents the analysis of distance metric variations in KNN for agarwood oil compounds differentiation. The work involved of the development of k-Nearest Neighbor (KNN) by varying the distance metrics. The input is abundances (%) of agarwood oil compounds and the output is agarwood oil quality either high or low. The data is divided into two parts; training and testing dataset with ratio of 80% and 20% respectively. The training dataset is used to develop the KNN model from K equal to 1 until K equal to 5, and the testing dataset is used to test the developed model. During the training, distance metric parameters were varied using Euclidean, City-block, Cosine, and Correlation. The performance of each parameter was recorded and observed. All the analytical works are performed automatically via MATLAB software version R2014b. The results showed that, among four distance metric variations, Euclidean and City-block yield 100% accuracy for both training and testing dataset. After that, 89.5% of accuracy was obtained by Cosine and Correlation. In general, the accuracy yielded by all distance metrics is above 80.00% and indicating a good KNN model. This finding proved the capability of KNN in differentiating the agarwood oil compounds to high or low qualities. The results in this study are important and contributed to further research work in agarwood oil grading system. © 2017 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N.
spellingShingle Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N.
Analysis of distance metric variations in KNN for agarwood oil compounds differentiation
author_facet Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N.
author_sort Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N.
title Analysis of distance metric variations in KNN for agarwood oil compounds differentiation
title_short Analysis of distance metric variations in KNN for agarwood oil compounds differentiation
title_full Analysis of distance metric variations in KNN for agarwood oil compounds differentiation
title_fullStr Analysis of distance metric variations in KNN for agarwood oil compounds differentiation
title_full_unstemmed Analysis of distance metric variations in KNN for agarwood oil compounds differentiation
title_sort Analysis of distance metric variations in KNN for agarwood oil compounds differentiation
publishDate 2017
container_title Proceedings - 2017 IEEE Conference on Systems, Process and Control, ICSPC 2017
container_volume 2018-January
container_issue
doi_str_mv 10.1109/SPC.2017.8313038
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85050690023&doi=10.1109%2fSPC.2017.8313038&partnerID=40&md5=4e783dbad59a3418dbc443efc24dfa29
description This paper presents the analysis of distance metric variations in KNN for agarwood oil compounds differentiation. The work involved of the development of k-Nearest Neighbor (KNN) by varying the distance metrics. The input is abundances (%) of agarwood oil compounds and the output is agarwood oil quality either high or low. The data is divided into two parts; training and testing dataset with ratio of 80% and 20% respectively. The training dataset is used to develop the KNN model from K equal to 1 until K equal to 5, and the testing dataset is used to test the developed model. During the training, distance metric parameters were varied using Euclidean, City-block, Cosine, and Correlation. The performance of each parameter was recorded and observed. All the analytical works are performed automatically via MATLAB software version R2014b. The results showed that, among four distance metric variations, Euclidean and City-block yield 100% accuracy for both training and testing dataset. After that, 89.5% of accuracy was obtained by Cosine and Correlation. In general, the accuracy yielded by all distance metrics is above 80.00% and indicating a good KNN model. This finding proved the capability of KNN in differentiating the agarwood oil compounds to high or low qualities. The results in this study are important and contributed to further research work in agarwood oil grading system. © 2017 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1792585534282924032