| 注册
首页|期刊导航|计算机应用与软件|基于链接关系的Web页面相似度搜索

基于链接关系的Web页面相似度搜索

靳黛露 张月琴 张明西

计算机应用与软件Issue(1):57-61,5.
计算机应用与软件Issue(1):57-61,5.DOI:10.3969/j.issn.1000-386x.2014.01.016

基于链接关系的Web页面相似度搜索

LINK RELATION-BASED WEB PAGES SIMILARITY SEARCH

靳黛露 1张月琴 1张明西2

作者信息

  • 1. 太原理工大学计算机科学与技术学院 山西 太原030024
  • 2. 复旦大学计算机科学技术学院 上海201203
  • 折叠

摘要

Abstract

Web pages similarity search plays important role in many research fields such as Web news recommendation and approximate query,etc.SimRank is a classical similarity computation model,however,it is not adaptable to large Webpage networks because its space and time cost is very high.Utilising the characteristic of SimRank in fast convergence,we propose an efficient Web pages similarity search (WSR)method.It pre-computes 1-hop iterative similarity matrix,and then conducts online computation of 2-hop iterative similarities of the given querying pages and other pages according to the computed 1-hop iterative similarity matrix.The pre-computation and online query processing efficiencies are further improved by static pruning on Web network.Experimental result shows that the WSR evidently reduces the storage cost and pre-computation time cost,and has higher accuracy and fast query responding time.

关键词

Web页面网络/相似度搜索/SimRank

Key words

Web page network/Similarity search/SimRank

分类

信息技术与安全科学

引用本文复制引用

靳黛露,张月琴,张明西..基于链接关系的Web页面相似度搜索[J].计算机应用与软件,2014,(1):57-61,5.

基金项目

山西省自然科学基金项目(2012011014-2)。 ()

计算机应用与软件

OACSCDCSTPCD

1000-386X

访问量5
|
下载量0
段落导航相关论文