软件导刊2016,Vol.15Issue(3):76-80,5.DOI:10.11907/rjdk.1511495
基于Chrome扩展的爬虫系统设计与实现
Web Crawler System Based on Chrome Extension
魏少鹏 1夏小玲1
作者信息
- 1. 东华大学计算机科学与技术学院,上海201620
- 折叠
摘要
Abstract
This article introduces a new crawler system based on Chromeextension in order to improve data collection effi‐ciency from web pages and reduce consumption of the system resource from crawler .This crawler system uses chrome browser to analysis web pages to prevent shielding of crawling object and asynchronous loading of web pages problems ,as well as the realization of structured data .Unattended active crawl can be achieved ,and information can be grabbed at the time when users are browsing Web pages by selecting the common user extension and server extension .Front and back is separated in the whole system ,and Program To Interface is used to cover it high expansibility .Finally ,verifying efficien‐cy and feasibility of the program by gaining premiership schedule from Sodasoccer website .关键词
爬虫系统/Chrome扩展/NettyKey words
Web Crawler/Chrome Extension/Netty分类
信息技术与安全科学引用本文复制引用
魏少鹏,夏小玲..基于Chrome扩展的爬虫系统设计与实现[J].软件导刊,2016,15(3):76-80,5.