| 注册
首页|期刊导航|计算机工程与应用|使用多元语义特征的评论文本主题聚类

使用多元语义特征的评论文本主题聚类

李亚红 王素格 李德玉

计算机工程与应用2013,Vol.49Issue(2):188-193,6.
计算机工程与应用2013,Vol.49Issue(2):188-193,6.DOI:10.3778/j.issn.1002-8331.1210-0169

使用多元语义特征的评论文本主题聚类

Exploiting multiple semantic features for comment text topic clustering

李亚红 1王素格 1李德玉2

作者信息

  • 1. 山西大学计算机与信息技术学院,太原030006
  • 2. 山西大学计算机与信息技术学院,计算智能与中文信息处理教育部重点实验室,太原030006
  • 折叠

摘要

Abstract

The feature is a key to the tasks of emotional analysis and opinion mining. Particularly for unsupervised text clustering task, the text feature quality directly affects the clustering results. This paper studies three kinds of semantic features, namely nouns features, noun phrase features, semantic role features and their role on the text topic clustering. And considering the compatibility between the different features, a method is proposed to eliminate redundant features. The method can effectively remove redundant features to improve the clustering accuracy. Also another method is proposed based on semantic role labeling to directly and effectively locate word features for topic clustering. The experimental results indicate that the method is direct and effective, and a new approach to feature selection method is provided.

关键词

文本主题聚类/名词特征/短语特征/语义角色特征/相容关系

Key words

text topic clustering/ nouns features/ nouns phrase features/ semantic role features/ compatibility relation

分类

信息技术与安全科学

引用本文复制引用

李亚红,王素格,李德玉..使用多元语义特征的评论文本主题聚类[J].计算机工程与应用,2013,49(2):188-193,6.

基金项目

国家自然科学基金(No.61175067,No.60970014,No.61272095) (No.61175067,No.60970014,No.61272095)

山西省自然科学基金(No.2010011021-1) (No.2010011021-1)

山西省科技攻关项目(No.20110321027-02). (No.20110321027-02)

计算机工程与应用

OACSCDCSTPCD

1002-8331

访问量0
|
下载量0
段落导航相关论文