首页|期刊导航|东南大学学报（英文版）|文本聚类中基于本体的相似性测度

文本聚类中基于本体的相似性测度

颜端武李晓鹏王磊成晓

东南大学学报（英文版）2006，Vol.22Issue(3)：389-393,5.

文本聚类中基于本体的相似性测度

Ontology-based similarity measure for text clustering

颜端武 ¹李晓鹏 ²王磊 ¹成晓¹

作者信息

1. 南京理工大学信息管理系,南京,210094
2. 南京理工大学图书馆,南京,210094
折叠

摘要

Abstract

A method that combines category-based and keyword-based concepts for a better information retrieval system is introduced.To improve document clustering,a document similarity measure based on cosine vector and keywords frequency in documents is proposed,but also with an input ontology.The ontology is domain specific and includes a list of keywords organized by degree of importance to the categories of the ontology,and by means of semantic knowledge,the ontology can improve the effects of document similarity measure and feedback of information retrieval systems.Two approaches to evaluating the performance of this similarity measure and the comparison with standard cosine vector similarity measure are also described.

关键词

相似性测度/文本聚类/本体/信息检索系统

Key words

similarity measure/text clustering/ontology/information retrieval system

分类

信息技术与安全科学

引用本文复制引用

颜端武,李晓鹏,王磊,成晓..文本聚类中基于本体的相似性测度[J].东南大学学报（英文版）,2006,22(3):389-393,5.

基金项目

The Young Teachers Scientific Research Foundation (YTSRF) of Nanjing University of Science and Technology in the Year of 2005-2006. （YTSRF）

东南大学学报（英文版）

ISSN：1003-7985

访问量0

下载量0

段落导航