高级检索
李蓟涛, 梁永全. 基于最小生成树的分割区域密度聚类算法[J]. 计算机辅助设计与图形学学报, 2019, 31(9): 1628-1635. DOI: 10.3724/SP.J.1089.2019.17716
引用本文: 李蓟涛, 梁永全. 基于最小生成树的分割区域密度聚类算法[J]. 计算机辅助设计与图形学学报, 2019, 31(9): 1628-1635. DOI: 10.3724/SP.J.1089.2019.17716
Li Jitao, Liang Yongquan. Segmentation Region Density Clustering Algorithm Based on Minimum Spanning Tree[J]. Journal of Computer-Aided Design & Computer Graphics, 2019, 31(9): 1628-1635. DOI: 10.3724/SP.J.1089.2019.17716
Citation: Li Jitao, Liang Yongquan. Segmentation Region Density Clustering Algorithm Based on Minimum Spanning Tree[J]. Journal of Computer-Aided Design & Computer Graphics, 2019, 31(9): 1628-1635. DOI: 10.3724/SP.J.1089.2019.17716

基于最小生成树的分割区域密度聚类算法

Segmentation Region Density Clustering Algorithm Based on Minimum Spanning Tree

  • 摘要: 针对传统密度聚类算法因使用全局变量导致对不平衡数据集的适应能力较差的问题,提出了一种基于最小生成树的密度聚类算法.首先进行数据集密度峰值计算,用于估计全局密度;然后通过密度聚类将数据集划分为高密度区域和低密度区域;接着构建和分割最小生成树对低密度区域内样本进行关联挖掘,用于将高密度区域与低密度区域互联;最后计算簇密度并以此作为特征进行簇合并,得到聚类结果.该算法结合图论知识,将数据按密度特征进行分块后合并处理,克服了传统密度聚类算法存在的局限性.通过选取多个不平衡人工数据集和UCI数据集对该算法进行测试,验证了该算法的有效性与鲁棒性.

     

    Abstract: To solve the problem that the traditional density clustering algorithms have poor adaptability to imbalanced data sets due to the use of global variables,a density clustering algorithm based on minimum spanning tree is proposed.Firstly,a data set density peaks calculation is used to estimate global density.Secondly,density clustering aims to separate the high-density clusters and low-density area.Thirdly,the minimum spanning tree is constructed and segmented to mining the associations within low density areas,and construct interconnection between high density areas and low density areas.Finally,compute all clusters’density as feature of merging the clusters,and obtain the result.This algorithm combines the knowledge of graph theory,processing the data set by segmentation and combination according to density feature,so that overcomes the limitations of traditional density clustering algorithms.By selecting multiple imbalanced artificial data sets and UCI data sets for test,we verify the effectiveness and robustness of this algorithm.

     

/

返回文章
返回