运筹与管理2025,Vol.34Issue(9):77-83,7.DOI:10.12005/orms.2025.0278
基于改进蜣螂优化算法的K-means聚类
K-means Clustering Based on Improved Dung Beetle Optimizer
摘要
Abstract
Cluster analysis is a data analysis method used to group similar data points into different categories or clusters.It is an unsupervised learning approach that does not require pre-defined class labels but rather auto-matically classifies data points based on their similarity.Through cluster analysis,similar samples are assigned to the same group,revealing similarities and differences among samples,and providing a preliminary classification of the data.Cluster analysis has been widely applied in various fields such as data mining,image processing,natural language processing,and market segmentation. K-means clustering algorithm is the most commonly used algorithm in cluster analysis due to its simplicity,scalability,suitability for high-dimensional data,and robustness.However,K-means algorithm is highly sensi-tive to the initial selection of cluster centers,and improper initialization can lead to inaccurate or unstable cluste-ring results.Swarm intelligence algorithms,which are stochastic search algorithms capable of escaping local opti-ma,have been adopted by researchers to optimize clustering algorithms and have shown promising results.Dung beetle optimization algorithm(DBO)is a swarm intelligence optimization algorithm proposed in 2022,inspired by the rolling,dancing,foraging,stealing,and reproduction behaviors of dung beetles.Compared to classical algorithms like particle swarm optimization and whale optimization algorithm,DBO exhibits better optimization performance.However,like other swarm intelligence algorithms,the dung beetle optimization algorithm may suffer from uneven distribution and lack of population diversity during the initialization of the population.Addi-tionally,during the rolling phase where positions are updated,the algorithm relies solely on the worst value for updating,resulting in a weaker global exploration capability. To overcome the limitations of K-means clustering's heavy reliance on initial cluster centers,a novel K-means clustering algorithm based on an improved beetle optimization algorithm,called POTDBO-K-means,is proposed in this study.Firstly,the beetle optimization algorithm is enhanced by incorporating a Piecewise Linear Chaotic Map(PWLCM)to improve population diversity,enhance solution accuracy,and accelerate conver-gence.Secondly,inspired by the osprey optimization algorithm for position recognition and fishing strategy,replacing the dung beetle optimization algorithm's rolling stage strategy with its global exploration strategy can compensate for the algorithm's reliance on only the worst value and its inability to communicate with other dung beetles during the rolling stage,thereby enhancing the algorithm's global exploration capability.Then,a dynamically selected adaptive t-distribution perturbation is introduced to increase both global exploitation and local search capabilities.The effectiveness and superiority of the improved dung beetle optimizer are verified through experiments on CEC2017 test functions.Finally,the improved dung beetle optimizer is combined with the K-means clustering algorithm and compared with other K-means clustering algorithms enhanced by swarm intelligence algorithms proposed by other researchers.The comparison is conducted on six UCI datasets with different characteristics.The simulation results demonstrate that the POTDBO-K-means algorithm exhibits faster convergence,stronger optimization ability,and higher clustering accuracy. In future work,the proposed POTDBO-K-means clustering algorithm can be applied to address challenging problems such as credit risk assessment,potential customer segmentation for the automotive industry,and user profiling for insurance products.Furthermore,further research will be conducted to combine swarm intelligence algorithms with K-means clustering in order to improve the convergence speed and clustering accuracy of the K-means algorithm.关键词
蜣螂优化算法/PWLCM映射/K-means聚类算法/自适应t分布Key words
dung beetle optimization algorithm/PWLCM mapping/K-means clustering algorithm/adaptive t-distribution分类
信息技术与安全科学引用本文复制引用
马志海,刘升..基于改进蜣螂优化算法的K-means聚类[J].运筹与管理,2025,34(9):77-83,7.基金项目
国家自然科学基金资助项目(61673258,61075115) (61673258,61075115)
上海市自然科学基金资助项目(19ZR1421600) (19ZR1421600)