| 注册
首页|期刊导航|计算机应用研究|基于混合余弦相似度的中文文本层次关系挖掘

基于混合余弦相似度的中文文本层次关系挖掘

董洋溢 李伟华 于会

计算机应用研究2017,Vol.34Issue(5):1406-1409,4.
计算机应用研究2017,Vol.34Issue(5):1406-1409,4.DOI:10.3969/j.issn.1001-3695.2017.05.029

基于混合余弦相似度的中文文本层次关系挖掘

Hierarchical relation mining of Chinese text based on mixed cosine similarity

董洋溢 1李伟华 1于会1

作者信息

  • 1. 西北工业大学计算机学院,西安710072
  • 折叠

摘要

Abstract

Hierarchy relation was one of the most important relationships between the Chinese text concepts.The correct determination of the hierarchical relationship was the basic research content of the domain ontology automatic construction and text data mining and so on.Firstly,this paper listed the possibly candidate hierarchy relations,and constructed a kernel function classifier which was based on the semantic cosine similarity of part-of-speech semantic sequence and relation words.Mining problems could be transformed into a hierarchy of classification.Then it trained the classifier by the manual template.Finally,it entered the Chinese text into the preprocessed,using the kernel function classifier to determine the relationship between the candidate hierarchy relations.Using the Chinese text in the field of Air Force Weapons and equipment as the test data,experiments show that the method is simple and reliable,with good accuracy and recall rate.

关键词

自然语言处理/层次关系/文本挖掘/混合余弦相似度/本体构建

Key words

natural language processing/hierarchical relations/text mining/mixed cosine similarity/ontology construction

分类

信息技术与安全科学

引用本文复制引用

董洋溢,李伟华,于会..基于混合余弦相似度的中文文本层次关系挖掘[J].计算机应用研究,2017,34(5):1406-1409,4.

基金项目

国家部委基金智能信息处理支撑技术项目(513150703) (513150703)

陕西省自然科学基金资助项目(2015JM6290) (2015JM6290)

计算机应用研究

OA北大核心CSCDCSTPCD

1001-3695

访问量0
|
下载量0
段落导航相关论文