首页|期刊导航|软件导刊|基于Chrome扩展的爬虫系统设计与实现

基于Chrome扩展的爬虫系统设计与实现

魏少鹏夏小玲

软件导刊2016，Vol.15Issue(3)：76-80,5.

软件导刊2016，Vol.15Issue(3)：76-80,5.DOI:10.11907/rjdk.1511495

基于Chrome扩展的爬虫系统设计与实现

Web Crawler System Based on Chrome Extension

魏少鹏 ¹夏小玲¹

作者信息

1. 东华大学计算机科学与技术学院，上海201620
折叠

摘要

Abstract

This article introduces a new crawler system based on Chromeextension in order to improve data collection effi‐ciency from web pages and reduce consumption of the system resource from crawler .This crawler system uses chrome browser to analysis web pages to prevent shielding of crawling object and asynchronous loading of web pages problems ,as well as the realization of structured data .Unattended active crawl can be achieved ,and information can be grabbed at the time when users are browsing Web pages by selecting the common user extension and server extension .Front and back is separated in the whole system ,and Program To Interface is used to cover it high expansibility .Finally ,verifying efficien‐cy and feasibility of the program by gaining premiership schedule from Sodasoccer website .

关键词

爬虫系统/Chrome扩展/Netty

Key words

Web Crawler/Chrome Extension/Netty

分类

信息技术与安全科学

引用本文复制引用

魏少鹏,夏小玲..基于Chrome扩展的爬虫系统设计与实现[J].软件导刊,2016,15(3):76-80,5.

软件导刊

ISSN：1672-7800

访问量0

下载量0

段落导航