计算机工程2011,Vol.37Issue(21):124-125,130,3.DOI:10.3969/j.issn.1000-3428.2011.21.042
基于iTopicModel的关联文本分类算法
Relational Text Classification Algorithm Based on iTopicModel
摘要
Abstract
In order to solve the problem that traditional text classification methods do not emphasize the links among text documents enough , this paper proposes a novel text classification algorithm TC-iTM based on iTopicModel. TC-iTM uses the probability that the labeled documents are assigned to each topic to judge the category that each topic represents. TC-iTM classifies unlabelled documents by using the probability that the documents are assigned to each topic and the text information of these documents. Experimental result shows that TC-iTM outperforms the traditional text classification methods when links among documents are important to the categories of the documents in document network.关键词
文本分类/文档网络/主题模型/EM算法Key words
text classification/document network/topic model/EM algorithm分类
信息技术与安全科学引用本文复制引用
梁鹏鹏,柴玉梅,王黎明..基于iTopicModel的关联文本分类算法[J].计算机工程,2011,37(21):124-125,130,3.基金项目
国家自然科学基金资助项目(60970083) (60970083)