| 注册
首页|期刊导航|计算机工程与应用|基于词条之间关联关系的文档聚类

基于词条之间关联关系的文档聚类

任建华 沈炎彬 孟祥福 王伟

计算机工程与应用2016,Vol.52Issue(7):86-90,5.
计算机工程与应用2016,Vol.52Issue(7):86-90,5.DOI:10.3778/j.issn.1002-8331.1404-0502

基于词条之间关联关系的文档聚类

Document clustering based on association relations between terms

任建华 1沈炎彬 1孟祥福 1王伟1

作者信息

  • 1. 辽宁工程技术大学 电子与信息工程学院,辽宁 葫芦岛 125105
  • 折叠

摘要

Abstract

For the existing vector space model to omit making insufficient semantic relationships between terms in docu-ments representation, this paper proposes a novel document vector representation approach based association relationship. In terms of generalized vector space model, it captures the frequent co-occurrence semantic relations between terms, and then analyzes the correlation between related terms based on association rules, digging out the potential relevance of semantic relationships between terms in the document. It represents documents with linear weighting co-occurrence semantic rela-tions with association semantic. Experimental results show that, compared with the BOW model and GVSM model, the clustering results using association rules document vector represented are more accurate.

关键词

文档聚类/关联关系/词条同现/文档相似度/潜在语义

Key words

document clustering/association/terms co-occurrence/document similarity/latent semantic

分类

信息技术与安全科学

引用本文复制引用

任建华,沈炎彬,孟祥福,王伟..基于词条之间关联关系的文档聚类[J].计算机工程与应用,2016,52(7):86-90,5.

基金项目

国家青年科学基金(No.61003162);辽宁省教育厅一般项目(No.L2013131)。 ()

计算机工程与应用

OA北大核心CSCDCSTPCD

1002-8331

访问量0
|
下载量0
段落导航相关论文