东南大学学报(英文版)2006,Vol.22Issue(3):389-393,5.
文本聚类中基于本体的相似性测度
Ontology-based similarity measure for text clustering
摘要
Abstract
A method that combines category-based and keyword-based concepts for a better information retrieval system is introduced.To improve document clustering,a document similarity measure based on cosine vector and keywords frequency in documents is proposed,but also with an input ontology.The ontology is domain specific and includes a list of keywords organized by degree of importance to the categories of the ontology,and by means of semantic knowledge,the ontology can improve the effects of document similarity measure and feedback of information retrieval systems.Two approaches to evaluating the performance of this similarity measure and the comparison with standard cosine vector similarity measure are also described.关键词
相似性测度/文本聚类/本体/信息检索系统Key words
similarity measure/text clustering/ontology/information retrieval system分类
信息技术与安全科学引用本文复制引用
颜端武,李晓鹏,王磊,成晓..文本聚类中基于本体的相似性测度[J].东南大学学报(英文版),2006,22(3):389-393,5.基金项目
The Young Teachers Scientific Research Foundation (YTSRF) of Nanjing University of Science and Technology in the Year of 2005-2006. (YTSRF)