计算机应用与软件2025,Vol.42Issue(6):167-177,11.DOI:10.3969/j.issn.1000-386x.2025.06.022
基于最优传输理论和格兰杰因果关系检验的文本分类优化
TEXT CLASSIFICATION OPTIMIZATION METHOD BASED ON OPTIMAL TRANSPORT THEORY AND GRANGER CAUSALITY TEST
摘要
Abstract
Deep learning and related pre-training models have achieved good performance in text classification tasks.The conflict between the generalization requirements of the model and the limited data scale is becoming more and more serious.Gradient descent is used to optimize network parameters,which requires that the network transformation must be continuously differentiable.In addition,the optimization process is easy to be trapped into local minimum values.Based on Granger causality test and optimal transport theory,a performance optimization method for deep learning pre-training models is proposed.The randomization algorithm was combined with the data-driven probability distribution algorithm to generate effective features on a small sample dataset based on the Granger causality test.Based on the optimal transport theory,the optimal combination of effective features was learned to compatible with the instability caused by the transmission mapping between continuous and non-continuous high-dimensional manifold structures.The experimental results show that compared with BERT and TextGCN,the accuracy rates on Chinese and English datasets are both improved.关键词
预训练/最优传输理论/数据分布/文本分类Key words
Pre-training/Optimal transport theory/Data distribution/Text classification分类
信息技术与安全科学引用本文复制引用
李静娟,邢凯,聂挺..基于最优传输理论和格兰杰因果关系检验的文本分类优化[J].计算机应用与软件,2025,42(6):167-177,11.基金项目
国家自然科学基金项目(61332004). (61332004)