| 注册
首页|期刊导航|数字图书馆论坛|一种基于网页信息抽取的OA期刊资源采集方法研究

一种基于网页信息抽取的OA期刊资源采集方法研究

黄政 张学福

数字图书馆论坛Issue(5):25-32,8.
数字图书馆论坛Issue(5):25-32,8.DOI:10.3772/j.issn.1673-2286.2017.05.004

一种基于网页信息抽取的OA期刊资源采集方法研究

A Research on Open Access Journal Resource Acquisition Method Based on Web Information Extraction

黄政 1张学福1

作者信息

  • 1. 中国农业科学院农业信息研究所,北京 100081
  • 折叠

摘要

Abstract

Open access journal resources have important academic value, however, some open access journals do not follow the OAI-PMH protocol, and cannot collect resources through OAI-PMH protocol. In this paper, based on the characteristics of open Access journal resources, we propose a non OAI-PMH protocol based open access resource acquisition strategy. In this paper, from the point of view of web resources description, this paper summarizes the haracteristics of open access journal resources and classifies them from the point of view of web resources description.Based on the applicability of the web information collection method in collecting open access journal resources, this paper proposes a open access journal resource acquisition strategy non based on OAI-PMH protocol, which is based on the method of acquisition open access journal web metadata extraction and design the acquisition system. Through the empirical study of 10 open access journals which do not provide the OAI-PMH protocol at home and abroad, a total of 45785 papers were collected. It is proved that this method can be effectively applied to the acquisition of such resources. The research enriches the acquisition methods of open access journals, and provides a method to guide the acquisition of open access journals that do not follow the OAI-PMH protocol.

关键词

OA期刊/OA期刊资源采集/网页信息采集/OA期刊资源采集系统

Key words

Open Access Journal/Open Access Journal Resource Acquisition/Web Information Acquisition/Open Access Journal Resource Acquisition System

分类

社会科学

引用本文复制引用

黄政,张学福..一种基于网页信息抽取的OA期刊资源采集方法研究[J].数字图书馆论坛,2017,(5):25-32,8.

数字图书馆论坛

OACSSCICSTPCD

1673-2286

访问量0
|
下载量0
段落导航相关论文