Analysis of distance metric variations in KNN for agarwood oil compounds differentiation
This paper presents the analysis of distance metric variations in KNN for agarwood oil compounds differentiation. The work involved of the development of k-Nearest Neighbor (KNN) by varying the distance metrics. The input is abundances (%) of agarwood oil compounds and the output is agarwood oil qua...
Published in: | Proceedings - 2017 IEEE Conference on Systems, Process and Control, ICSPC 2017 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2017
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85050690023&doi=10.1109%2fSPC.2017.8313038&partnerID=40&md5=4e783dbad59a3418dbc443efc24dfa29 |
id |
2-s2.0-85050690023 |
---|---|
spelling |
2-s2.0-85050690023 Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N. Analysis of distance metric variations in KNN for agarwood oil compounds differentiation 2017 Proceedings - 2017 IEEE Conference on Systems, Process and Control, ICSPC 2017 2018-January 10.1109/SPC.2017.8313038 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85050690023&doi=10.1109%2fSPC.2017.8313038&partnerID=40&md5=4e783dbad59a3418dbc443efc24dfa29 This paper presents the analysis of distance metric variations in KNN for agarwood oil compounds differentiation. The work involved of the development of k-Nearest Neighbor (KNN) by varying the distance metrics. The input is abundances (%) of agarwood oil compounds and the output is agarwood oil quality either high or low. The data is divided into two parts; training and testing dataset with ratio of 80% and 20% respectively. The training dataset is used to develop the KNN model from K equal to 1 until K equal to 5, and the testing dataset is used to test the developed model. During the training, distance metric parameters were varied using Euclidean, City-block, Cosine, and Correlation. The performance of each parameter was recorded and observed. All the analytical works are performed automatically via MATLAB software version R2014b. The results showed that, among four distance metric variations, Euclidean and City-block yield 100% accuracy for both training and testing dataset. After that, 89.5% of accuracy was obtained by Cosine and Correlation. In general, the accuracy yielded by all distance metrics is above 80.00% and indicating a good KNN model. This finding proved the capability of KNN in differentiating the agarwood oil compounds to high or low qualities. The results in this study are important and contributed to further research work in agarwood oil grading system. © 2017 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N. |
spellingShingle |
Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N. Analysis of distance metric variations in KNN for agarwood oil compounds differentiation |
author_facet |
Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N. |
author_sort |
Samad M.E.M.; Ismail N.; Rahiman M.H.F.; Taib M.N.; Ali N.A.M.; Tajuddin S.N. |
title |
Analysis of distance metric variations in KNN for agarwood oil compounds differentiation |
title_short |
Analysis of distance metric variations in KNN for agarwood oil compounds differentiation |
title_full |
Analysis of distance metric variations in KNN for agarwood oil compounds differentiation |
title_fullStr |
Analysis of distance metric variations in KNN for agarwood oil compounds differentiation |
title_full_unstemmed |
Analysis of distance metric variations in KNN for agarwood oil compounds differentiation |
title_sort |
Analysis of distance metric variations in KNN for agarwood oil compounds differentiation |
publishDate |
2017 |
container_title |
Proceedings - 2017 IEEE Conference on Systems, Process and Control, ICSPC 2017 |
container_volume |
2018-January |
container_issue |
|
doi_str_mv |
10.1109/SPC.2017.8313038 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85050690023&doi=10.1109%2fSPC.2017.8313038&partnerID=40&md5=4e783dbad59a3418dbc443efc24dfa29 |
description |
This paper presents the analysis of distance metric variations in KNN for agarwood oil compounds differentiation. The work involved of the development of k-Nearest Neighbor (KNN) by varying the distance metrics. The input is abundances (%) of agarwood oil compounds and the output is agarwood oil quality either high or low. The data is divided into two parts; training and testing dataset with ratio of 80% and 20% respectively. The training dataset is used to develop the KNN model from K equal to 1 until K equal to 5, and the testing dataset is used to test the developed model. During the training, distance metric parameters were varied using Euclidean, City-block, Cosine, and Correlation. The performance of each parameter was recorded and observed. All the analytical works are performed automatically via MATLAB software version R2014b. The results showed that, among four distance metric variations, Euclidean and City-block yield 100% accuracy for both training and testing dataset. After that, 89.5% of accuracy was obtained by Cosine and Correlation. In general, the accuracy yielded by all distance metrics is above 80.00% and indicating a good KNN model. This finding proved the capability of KNN in differentiating the agarwood oil compounds to high or low qualities. The results in this study are important and contributed to further research work in agarwood oil grading system. © 2017 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677606778306560 |