计算机应用研究2017,Vol.34Issue(5):1406-1409,4.DOI:10.3969/j.issn.1001-3695.2017.05.029
基于混合余弦相似度的中文文本层次关系挖掘
Hierarchical relation mining of Chinese text based on mixed cosine similarity
摘要
Abstract
Hierarchy relation was one of the most important relationships between the Chinese text concepts.The correct determination of the hierarchical relationship was the basic research content of the domain ontology automatic construction and text data mining and so on.Firstly,this paper listed the possibly candidate hierarchy relations,and constructed a kernel function classifier which was based on the semantic cosine similarity of part-of-speech semantic sequence and relation words.Mining problems could be transformed into a hierarchy of classification.Then it trained the classifier by the manual template.Finally,it entered the Chinese text into the preprocessed,using the kernel function classifier to determine the relationship between the candidate hierarchy relations.Using the Chinese text in the field of Air Force Weapons and equipment as the test data,experiments show that the method is simple and reliable,with good accuracy and recall rate.关键词
自然语言处理/层次关系/文本挖掘/混合余弦相似度/本体构建Key words
natural language processing/hierarchical relations/text mining/mixed cosine similarity/ontology construction分类
信息技术与安全科学引用本文复制引用
董洋溢,李伟华,于会..基于混合余弦相似度的中文文本层次关系挖掘[J].计算机应用研究,2017,34(5):1406-1409,4.基金项目
国家部委基金智能信息处理支撑技术项目(513150703) (513150703)
陕西省自然科学基金资助项目(2015JM6290) (2015JM6290)