计算机工程与科学2012,Vol.34Issue(2):172-175,4.DOI:10.3969/j.issn.1007-130X.2012.02.031
一种基于知网的句子相似度计算方法
A Method of Sentence Similarity Computing Based on Hownet
摘要
Abstract
Sentence similarity is the basis of document similarity, and sentence similarity computing plays an important role in the field of natural language processing. The current methods of sentence similarity computing neglect the influence of sentence structure. On the basis of the interrelated research, this paper proposes an improved method of similarity comparison. The semantic tree of sememe is constructed according to the description of entity conception in the Hownet, the semantic similarity of sememe is computed based on the relative positions in the sememe tree. Calculating of sentence similarity is based on surface similarity and semantic similarity. Under the same test conditions, the experiments show that the proposed method is much closer to the people's comprehension to the meanings of the sentences.关键词
句子相似度/知网/表层相似度/语义偏移量Key words
sentence similarity/hownet/surface similarity/semantic offset similarity分类
信息技术与安全科学引用本文复制引用
程传鹏,吴志刚..一种基于知网的句子相似度计算方法[J].计算机工程与科学,2012,34(2):172-175,4.基金项目
河南省教育厅自然科学资助项目(2008B520046) (2008B520046)