计算机应用与软件2011,Vol.28Issue(1):272-273,301,3.
基于链接相似度Web挖掘算法的研究与改进
STUDY AND IMPROVEMENT ON LINKAGE SIMILARITY-BASED WEB MINING ALGORITHM
杨益凡 1朱明 1李华虎1
作者信息
- 1. 东华大学计算机科学与技术学院,上海,201620
- 折叠
摘要
Abstract
On the basis of Web mining classification pattern, a Web structure mining algorithm HITS based on linked-analysis is studied and analyzed in this paper. An improved DS-HITS algorithm is proposed in light of the shortcomings of HITS Algorithm which only considers the linked into and out of web pages based on root sets but does not consider the similarities of linked into and out of web pages in the acquiring course of expanded sets processing. Many kinds of weights reflecting the pages' similarities are introduced in this improved algorithm in the course of expanded sets processing, so that the core values and authorities of the acquired pages are to be improved significantly. Finally,the searching results of DS-HITS and HITS algorithm are compared based on the initial data of Webla's open source project.关键词
Web挖掘/HITS算法/DS-HITS算法引用本文复制引用
杨益凡,朱明,李华虎..基于链接相似度Web挖掘算法的研究与改进[J].计算机应用与软件,2011,28(1):272-273,301,3.