| 注册
首页|期刊导航|数字图书馆论坛|基于概念向量的文本语义相似度方法探索

基于概念向量的文本语义相似度方法探索

郭红梅 袁国华 胡正银

数字图书馆论坛Issue(6):39-45,7.
数字图书馆论坛Issue(6):39-45,7.DOI:10.3772/j.issn.1673-2286.2017.06.006

基于概念向量的文本语义相似度方法探索

Measurement of Text Semantic Similarity on the Basis of Concept Vector

郭红梅 1袁国华 1胡正银2

作者信息

  • 1. 中国科学院文献情报中心,北京100190
  • 2. 中国科学院成都文献情报中心,成都610041
  • 折叠

摘要

Abstract

Based on the previous studies on the concept semantic similarity, this paper proposed measurement of text semantic similarity on the basis of concept vector. First, mining the concepts or terms from the texts. Second, transforming concepts or terms into concept vector followed by hierarchical structure of vocabulary. At last, measuring the sematic similarity of concepts or terms and further measuring the text semantic similarity. The paper used TREC-05 genomics track data to experiment. The results showed that the method of text semantic similarity on the basis of concept vector was bet er than cosine, which was more closely to expert evaluation result.

关键词

概念向量/语义相似度/文本相似度

Key words

Concept Vector/Semantic Similarity/Text Similarity

分类

社会科学

引用本文复制引用

郭红梅,袁国华,胡正银..基于概念向量的文本语义相似度方法探索[J].数字图书馆论坛,2017,(6):39-45,7.

基金项目

*本研究得到ISTIC-EBSCO文献大数据发现服务联合实验室基金项目"基于clique子团聚类的文本主题识别方法研究"资助. ()

数字图书馆论坛

OACSSCICSTPCD

1673-2286

访问量0
|
下载量0
段落导航相关论文