所属单位:计算机科学与工程学院
发表刊物:Journal of Computers
项目来源:自选课题
关键字:density peak, ESD anomaly detection, linear regression, near information
摘要:The fast searching clustering algorithm of the density peak is a simple and efficient density-based clustering algorithm. However, there are shortcomings such as the setting of the truncation distance cd is too sensitive, the similarity measure is too simple, and the artificial selection of the cluster center points is subjective. To deal with these problems, this paper proposes a new density peak clustering algorithm KE-DPC (KNN-ESD-density-peak-cluster) that can automatically select the cluster center points. First, the algorithm uses the near information to adjust the distribution of data samples, and optimizes the similarity measurement criterion in combination with Euclidean distance. Then the local density calculation formula is redefined according to the number of neighbor samples, thereby avoiding the setting of the sensitive cd . Finally, the sample distribution on the decision map is fitted by linear regression to obtain the Residual set, and the cluster center point is automatically obtained according to the characteristics of the Residual analysis in ESD anomaly detection, removing the subjectivity of artificial selection. The experimental results of the artificial data set and UCI standard set show that the KE-DPC algorithm is better than K-means, DBSCAN, DPC, A-DPC and other algorithms.
合写作者:崔世琦,李勇,刘慧
第一作者:刘冰
论文类型:期刊论文
卷号:31
期号:6
页面范围:2
ISSN号:1991-1599
是否译文:否
发表时间:2020-06-01
