计算机与现代化Issue(11):35-38,4.DOI:10.3969/j.issn.1006-2475.2012.11.010
基于搜索结果的聚类算法
Optimization of Search Results Based on Clustering Algorithm
罗钊航 1李旭伟1
作者信息
- 1. 四川大学计算机学院,四川成都610065
- 折叠
摘要
Abstract
Nowadays there are many redundancy pages in results of search engine, and the results are not classified. An optimization algorithm of webpage search results based on an improved DBSCAN (density-based spatial clustering of applications with noise) algorithm is proposed and effective to cluster and classify the results. The algorithm selects the webpages with search weights above a certain value from all search results, then it extracts the eigenvalue of pages and candidate keys, compares the pages similarity to maximize the elimination of duplication and redundancy pages. Meanwhile, classifications are provided in accordance with the candidate keys of pages, thereby the precision and satisfaction of search engine could be improved with the effect of more intelligence.关键词
基于密度的聚类算法/网页相似度/聚类/冗余网页Key words
DBSCAN algorithm/ page similarity/ clustering/ redundancy page分类
信息技术与安全科学引用本文复制引用
罗钊航,李旭伟..基于搜索结果的聚类算法[J].计算机与现代化,2012,(11):35-38,4.