计算机应用与软件2012,Vol.29Issue(2):282-284,3.
一种改进的基于向量空间文本相似度算法的研究与实现
RESEARCH AND IMPLEMENTATION OF AN IMPROVED VSM-BASED TEXT SIMILARITY ALGORITHM
摘要
Abstract
Aiming at the shortcoming of traditional VSM-based text similarity algorithm, an improved algorithm of text similarity is proposed in this paper. It fully takes into account the effect of same feature words between texts on the similarity of text, therefore effectively reduces the interference of the texts with lower similarity. Simulative experiment and system running results have attested the new algorithm in its effectiveness and accuracy.关键词
向量空间/文本相似度/特征词/覆盖度Key words
Vector space/Test similarity/Feature words/Coverage分类
信息技术与安全科学引用本文复制引用
李连,朱爱红,苏涛..一种改进的基于向量空间文本相似度算法的研究与实现[J].计算机应用与软件,2012,29(2):282-284,3.