计算机与现代化Issue(2):128-130,3.DOI:10.3969/j.issn.1006-2475.2012.02.034
一种基于单模型的网页净化方法
A Method of Web Page Purification Based on Single Model
干文敏 1李俊 1李剑2
作者信息
- 1. 南京航空航天大学计算机科学与技术学院,江苏南京 210016
- 2. 南昌陆军学院战斗实验室,江西南昌 330103
- 折叠
摘要
Abstract
In order to obtain and handle with the information in Web pages effectively, this paper proposes the algorithm of Web page purification based on improved DOM tree and BP neural network. This algorithm establishes block tree by DOM tree and Web content using HTMLPareer. Because of the evident numerical characteristics in sub-blocks of Web-pages, it can establish noisy purify-model by BP neural network. As a result, it can make the Web-page purification more modelling, also it can get a more effective result.关键词
网页净化/DOM树/内容块/神经网络Key words
Web page purification/DOM tree/content block/neural network分类
信息技术与安全科学引用本文复制引用
干文敏,李俊,李剑..一种基于单模型的网页净化方法[J].计算机与现代化,2012,(2):128-130,3.