首页|期刊导航|广西科学|CDKM:基于K-means聚类的因果分解方法

CDKM:基于K-means聚类的因果分解方法

韦慧娴韦程东陈少凡何国源李冶通

广西科学2025，Vol.32Issue(1)：121-131,11.

广西科学2025，Vol.32Issue(1)：121-131,11.DOI:10.13656/j.cnki.gxkx.20240709.001

CDKM:基于K-means聚类的因果分解方法

CDKM:A Causal Decomposition Method Based on K-means Clustering

韦慧娴 ¹韦程东 ¹陈少凡 ²何国源 ³李冶通¹

作者信息

1. 南宁师范大学数学与统计学院,广西南宁 530100
2. 广西科学院《广西科学》编辑部,广西南宁 530007
3. 贺州学院经济与管理学院,广西贺州 542899
折叠

摘要

Abstract

Redundant conditional independence tests have seriously affected the efficiency and accuracy of con-straint-based methods in causal discovery.To solve this problem,a causal decomposition method based on K-means clustering(CDKM)is proposed.CDKM divides the original causal discovery problem into multiple sub-causal discovery problems by using K-means clustering and then merges the sub-causal networks to ob-tain a complete causal network.Specifically,CDKM first uses K-means clustering to divide the original varia-ble set into k clusters and then adds two nodes with the smallest correlation distance to the current cluster from other clusters to each cluster to obtain updated k clusters.After that,it discovers causality in each clus-ter and obtain various sub-causal networks.Finally,it merges all the sub-causal networks to obtain a com-plete causal network.CDKM avoids the decomposition using high-order conditional independence tests and reduces redundant conditional independence tests.Compared with recursive constraint-based methods,CDKM can divide the original variable set into any segments.Experimental results on 8 datasets show that CDKM can greatly accelerate causal discovery,reduce time complexity,and achieve higher accuracy than baseline models.

关键词

因果发现/因果分解/K-means聚类/因果网络/条件独立性测试

Key words

causal discovery/causal decomposition/K-means clustering/causal network/conditional independ-ence test

分类

信息技术与安全科学

引用本文复制引用

韦慧娴,韦程东,陈少凡,何国源,李冶通..CDKM:基于K-means聚类的因果分解方法[J].广西科学,2025,32(1):121-131,11.

基金项目

国家自然科学基金项目(11561010)资助. （11561010）

广西科学

OA北大核心

ISSN：1005-9164

访问量0

下载量0

段落导航