Performance comparison of k nearest neighbor classifier with different distance functions
In the field of pattern recognition, K Nearest Neighbor is the classifier algorithm that use distance function to measure similarity between two samples. The well-known distance function used is the Euclidean distance which sees all samples including noisy or outliers with equal important. Euclidean...
Published in: | AIP Conference Proceedings |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
American Institute of Physics
2024
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85188428801&doi=10.1063%2f5.0192229&partnerID=40&md5=a5c61175518729d3d2350a9798e65ad9 |
id |
2-s2.0-85188428801 |
---|---|
spelling |
2-s2.0-85188428801 Mukahar N. Performance comparison of k nearest neighbor classifier with different distance functions 2024 AIP Conference Proceedings 2895 1 10.1063/5.0192229 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85188428801&doi=10.1063%2f5.0192229&partnerID=40&md5=a5c61175518729d3d2350a9798e65ad9 In the field of pattern recognition, K Nearest Neighbor is the classifier algorithm that use distance function to measure similarity between two samples. The well-known distance function used is the Euclidean distance which sees all samples including noisy or outliers with equal important. Euclidean distance is highly influenced by the noisy sample or outliers, and the value returned by similarity metrics may be affected which in turn it will deteriorate the classification performance. This paper conducts experimental comparisons of several distance functions in the KNN classification including Manhattan, Angular, Chebyshev, Cosine, Euclidean, Histogram, Kalmogorov, Mahalanobis, Match and Minkowski. Evaluation of the distance function are made on the 31 selected real-world datasets of different natures from UCI repository and the results show that Manhattan performs better over other distance functions by achieving classification accuracy at 84.63%. © 2024 AIP Publishing LLC. American Institute of Physics 0094243X English Conference paper All Open Access; Bronze Open Access |
author |
Mukahar N. |
spellingShingle |
Mukahar N. Performance comparison of k nearest neighbor classifier with different distance functions |
author_facet |
Mukahar N. |
author_sort |
Mukahar N. |
title |
Performance comparison of k nearest neighbor classifier with different distance functions |
title_short |
Performance comparison of k nearest neighbor classifier with different distance functions |
title_full |
Performance comparison of k nearest neighbor classifier with different distance functions |
title_fullStr |
Performance comparison of k nearest neighbor classifier with different distance functions |
title_full_unstemmed |
Performance comparison of k nearest neighbor classifier with different distance functions |
title_sort |
Performance comparison of k nearest neighbor classifier with different distance functions |
publishDate |
2024 |
container_title |
AIP Conference Proceedings |
container_volume |
2895 |
container_issue |
1 |
doi_str_mv |
10.1063/5.0192229 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85188428801&doi=10.1063%2f5.0192229&partnerID=40&md5=a5c61175518729d3d2350a9798e65ad9 |
description |
In the field of pattern recognition, K Nearest Neighbor is the classifier algorithm that use distance function to measure similarity between two samples. The well-known distance function used is the Euclidean distance which sees all samples including noisy or outliers with equal important. Euclidean distance is highly influenced by the noisy sample or outliers, and the value returned by similarity metrics may be affected which in turn it will deteriorate the classification performance. This paper conducts experimental comparisons of several distance functions in the KNN classification including Manhattan, Angular, Chebyshev, Cosine, Euclidean, Histogram, Kalmogorov, Mahalanobis, Match and Minkowski. Evaluation of the distance function are made on the 31 selected real-world datasets of different natures from UCI repository and the results show that Manhattan performs better over other distance functions by achieving classification accuracy at 84.63%. © 2024 AIP Publishing LLC. |
publisher |
American Institute of Physics |
issn |
0094243X |
language |
English |
format |
Conference paper |
accesstype |
All Open Access; Bronze Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677673800138752 |