计算机应用与软件2016,Vol.33Issue(3):43-47,5.DOI:10.3969/j.issn.1000-386x.2016.03.009
一种基于 Wikipedia 的词汇语义关联度计算方法
A WIKIPEDIA-BASED LEXICAL SEMANTIC RELATEDNESS CALCULATION METHOD
摘要
Abstract
Calculating the semantic relatedness between words is one of the key issues of information retrieval and natural language processing,for this issue,we presented WGR,an improved semantic relatedness calculation method based on Wikipedia.The method uses Wikipedia dataset as the background knowledge base,integrates on the basis of traditional method the layout information in Wikipedia articles,and processes the backward link and forward link of Wiki concepts with different methods.Besides,it introduces the resources of Google search,after classification and sieving,it uses LDA modelling to calculate the semantic relatedness,and finally integrates the results from two datasets to get WGR semantic relatedness.Through experimental analysis,WGR achieves better accuracy in comparison with existing algorithms.关键词
语义关联度/文章网络/布局信息/维基百科/隐含狄利克雷分布/谷歌Key words
Semantic relatedness/Article referenced network/Layout information/Wikipedia/Latent Dirichlet allocation (LDA)/Google分类
信息技术与安全科学引用本文复制引用
汪志伟,朱福喜,刘世超..一种基于 Wikipedia 的词汇语义关联度计算方法[J].计算机应用与软件,2016,33(3):43-47,5.基金项目
国家自然科学基金项目(61272277)。 ()