数字图书馆论坛Issue(6):39-45,7.DOI:10.3772/j.issn.1673-2286.2017.06.006
基于概念向量的文本语义相似度方法探索
Measurement of Text Semantic Similarity on the Basis of Concept Vector
摘要
Abstract
Based on the previous studies on the concept semantic similarity, this paper proposed measurement of text semantic similarity on the basis of concept vector. First, mining the concepts or terms from the texts. Second, transforming concepts or terms into concept vector followed by hierarchical structure of vocabulary. At last, measuring the sematic similarity of concepts or terms and further measuring the text semantic similarity. The paper used TREC-05 genomics track data to experiment. The results showed that the method of text semantic similarity on the basis of concept vector was bet er than cosine, which was more closely to expert evaluation result.关键词
概念向量/语义相似度/文本相似度Key words
Concept Vector/Semantic Similarity/Text Similarity分类
社会科学引用本文复制引用
郭红梅,袁国华,胡正银..基于概念向量的文本语义相似度方法探索[J].数字图书馆论坛,2017,(6):39-45,7.基金项目
*本研究得到ISTIC-EBSCO文献大数据发现服务联合实验室基金项目"基于clique子团聚类的文本主题识别方法研究"资助. ()