计算机工程Issue(12):177-181,5.DOI:10.3969/j.issn.1000-3428.2014.12.033
基于《知网》的词语语义相似度算法
Word Semantic Similarity Algorithm Based on HowNet
摘要
Abstract
The word semantic similarity computation is widely used in information retrieval,text clustering,word sense disambiguation,etc. This paper proposes an improved method of word semantic similarity computation based on HowNet. A new sememe classification is proposed,and sememe is divided into first basic sememe,other basic sememe and indirect sememe. A new variable coefficient of homonym similarity computation is proposed according to the effect of different sememes. Unlike previous sense similarity calculation method,according to the influence of different sememes to sense similarity calculation,different sememes similarity calculation method of sense similarity is proposed in this paper. It uses the highest item combination of the first basic sememe to calculate the word semantic similarity and removes other combinations with lower similarity. Experimental results show that the improved method effectively improves computational efficiency and precision of word semantic similarity.关键词
义原/义项/词语语义相似度/知识描述语言Key words
sememe/homonym/word semantic similarity/knowledge representation language分类
信息技术与安全科学引用本文复制引用
王小林,王东,杨思春,邰伟鹏,郑啸..基于《知网》的词语语义相似度算法[J].计算机工程,2014,(12):177-181,5.基金项目
国家自然科学基金资助项目(61003311) (61003311)
安徽省高校省级自然科学基金资助项目(KJ2011A040)。 (KJ2011A040)