计算机应用与软件Issue(1):57-61,5.DOI:10.3969/j.issn.1000-386x.2014.01.016
基于链接关系的Web页面相似度搜索
LINK RELATION-BASED WEB PAGES SIMILARITY SEARCH
摘要
Abstract
Web pages similarity search plays important role in many research fields such as Web news recommendation and approximate query,etc.SimRank is a classical similarity computation model,however,it is not adaptable to large Webpage networks because its space and time cost is very high.Utilising the characteristic of SimRank in fast convergence,we propose an efficient Web pages similarity search (WSR)method.It pre-computes 1-hop iterative similarity matrix,and then conducts online computation of 2-hop iterative similarities of the given querying pages and other pages according to the computed 1-hop iterative similarity matrix.The pre-computation and online query processing efficiencies are further improved by static pruning on Web network.Experimental result shows that the WSR evidently reduces the storage cost and pre-computation time cost,and has higher accuracy and fast query responding time.关键词
Web页面网络/相似度搜索/SimRankKey words
Web page network/Similarity search/SimRank分类
信息技术与安全科学引用本文复制引用
靳黛露,张月琴,张明西..基于链接关系的Web页面相似度搜索[J].计算机应用与软件,2014,(1):57-61,5.基金项目
山西省自然科学基金项目(2012011014-2)。 ()