Optimizing the K-Nearest Neighbor Algorithm for Text Categorization
R Angeeshwari Angeeshwari
Paper Contents
Abstract
K-Nearest Neighbor (KNN) classification algorithm is one of the simplest methods of data mining. It has been widely used in classification, regression, and pattern recognition. The traditional KNN method has some shortcomings such as large amount of sample computation and strong dependence on the sample library capacity. In this paper, a method of representative sample optimization based on CURE algorithm is proposed. Based on this, presenting a quick algorithm QKNN (Quick k-nearest neighbor) to find the nearest k neighbor samples, which greatly reduces the similarity calculation. The experimental results show that this algorithm can effectively reduce the number of samples and speed up the search for the k nearest neighbor samples to improve the performance of the algorithm.
Copyright
Copyright © 2024 R Angeeshwari. This is an open access article distributed under the Creative Commons Attribution License.