东南大学学报(英文版)2008,Vol.24Issue(3):312-314,3.
Deep web站点查询界面的潜在语义分析
Latent semantic analysis for query interfaces of deep web sites
茅琴娇 1冯博琴 1潘善亮2
作者信息
- 1. 西安交通大学计算机科学与技术系,西安710049
- 2. 宁波大学信息科学与工程学院,宁波315211
- 折叠
摘要
Abstract
To further enhance the efficiencies of search engines, achieving capabilities of searching, indexing and locating the information in the deep web, latent semantic analysis is a simple and effective way. Through the latent semantic analysis of the attributes in the query interfaces and the unique entrances of the deep web sites, the hidden semantic structure information can be retrieved and dimension reduction can be achieved to a certain extent. Using this semantic structure information, the contents in the site can be inferred and the similarity measures among sites in deep web can be revised. Experimental results show that latent semantic analysis revises and improves the semantic understanding of the query form in the deep web, which overcomes the shortcomings of the keyword-based methods. This approach can be used to effectively search the most similar site for any given site and to obtain a site list which conforms to the restrictions one specifies.关键词
deep web/信息检索/潜在语义分析/奇异值分解Key words
deep web/information retrieval/latent semanticanalysis/singular value decomposition分类
信息技术与安全科学引用本文复制引用
茅琴娇,冯博琴,潘善亮..Deep web站点查询界面的潜在语义分析[J].东南大学学报(英文版),2008,24(3):312-314,3.