Associate professor
Supervisor of Master's Candidates
Hits:
Affiliation of Author(s):计算机科学与工程学院
Journal:Journal of Computers
Funded by:自选课题
Key Words:density peak, ESD anomaly detection, linear regression, near information
Abstract:The fast searching clustering algorithm of the density peak is a simple and efficient density-based clustering algorithm. However, there are shortcomings such as the setting of the truncation distance cd is too sensitive, the similarity measure is too simple, and the artificial selection of the cluster center points is subjective. To deal with these problems, this paper proposes a new density peak clustering algorithm KE-DPC (KNN-ESD-density-peak-cluster) that can automatically select the cluster center points. First, the algorithm uses the near information to adjust the distribution of data samples, and optimizes the similarity measurement criterion in combination with Euclidean distance. Then the local density calculation formula is redefined according to the number of neighbor samples, thereby avoiding the setting of the sensitive cd . Finally, the sample distribution on the decision map is fitted by linear regression to obtain the Residual set, and the cluster center point is automatically obtained according to the characteristics of the Residual analysis in ESD anomaly detection, removing the subjectivity of artificial selection. The experimental results of the artificial data set and UCI standard set show that the KE-DPC algorithm is better than K-means, DBSCAN, DPC, A-DPC and other algorithms.
Co-author:崔世琦,liyong,刘慧
First Author:Kevin Liu
Indexed by:Journal paper
Volume:31
Issue:6
Page Number:2
ISSN No.:1991-1599
Translation or Not:no
Date of Publication:2020-06-01