东南大学学报(英文版)2006,Vol.22Issue(3):324-329,6.
一种基于概念的信息检索方法
Concept-based approach for information retrieval
摘要
Abstract
A concept-based approach is expected to resolve the word sense ambiguities in information retrieval and apply the semantic importance of the concepts,instead of the term frequency,to representing the contents of a document.Consequently,a formalized document framework is proposed.The document framework is used to express the meaning of a document with the concepts which are expressed by high semantic importance.The framework consists of two parts:the "domain" information and the "situation & background" information of a document.A document-extracting algorithm and a two-stage smoothing method are also proposed.The quantification of the similarity between the query and the document framework depends on the smoothing method.The experiments on the TREC6 collection demonstrate the feasibility and effectiveness of the proposed approach in information retrieval tasks.The average recall level precision of the model using the proposed approach is about 10% higher than that of traditional ones.关键词
信息检索/概念/语义知识/内容表示Key words
information retrieval/concept/semantic knowledge/content representation分类
信息技术与安全科学引用本文复制引用
吴晨,张全,贾宁..一种基于概念的信息检索方法[J].东南大学学报(英文版),2006,22(3):324-329,6.基金项目
The National Basic Research Program of China (973 Program)(No.2004CB318104),the Knowledge Innovation Program of Chinese Academy of Sciences (No.13CX04). (973 Program)