| 注册
首页|期刊导航|计算机与数字工程|基于句法结构分析的中文文本聚类方法研究

基于句法结构分析的中文文本聚类方法研究

尹积栋 谢茶花 彭崧 刘红 曾昭虎

计算机与数字工程2018,Vol.46Issue(5):933-935,1067,4.
计算机与数字工程2018,Vol.46Issue(5):933-935,1067,4.DOI:10.3969/j.issn.1672-9722.2018.05.017

基于句法结构分析的中文文本聚类方法研究

Research on Chinese Text Clustering Based on Sentence Structure Analysis

尹积栋 1谢茶花 1彭崧 1刘红 1曾昭虎1

作者信息

  • 1. 吉安职业技术学院 吉安343000
  • 折叠

摘要

Abstract

Most of the existing K-means clustering algorithms are based on the data carriers,they are difficult to apply to Chi-nese text clustering analysis.In this paper,a new text clustering method based on sentence structure analysis is introduced,which can accurately calculate the senmantic similarity and cluster texts.The method combines the advantages of the improved K-means, the sentence structure analysis is defined for reducing the complexity of the text set to improve the accuracy of the calculation of the semantic similarity between texts. Experimental results show that the method can gain a higher precision(0.96)than some widely used clustering methods.

关键词

文本聚类/K-means/句法结构分析

Key words

text clustering/K-means/sentence structure analysis

分类

信息技术与安全科学

引用本文复制引用

尹积栋,谢茶花,彭崧,刘红,曾昭虎..基于句法结构分析的中文文本聚类方法研究[J].计算机与数字工程,2018,46(5):933-935,1067,4.

基金项目

江西省教育厅科学技术研究项目"基于句法结构分析的文本聚类方法及应用研究"(编号:GJJ151492)资助. (编号:GJJ151492)

计算机与数字工程

OACSTPCD

1672-9722

访问量0
|
下载量0
段落导航相关论文