数据与计算发展前沿2024,Vol.6Issue(2):156-164,9.DOI:10.11871/jfdc.issn.2096-742X.2024.02.014
耦合异构性的科学数据中心数据总量指标评价分析
Evaluation and Analysis of Data Volume Index Coupled with Heterogeneity in Scientific Data Centers
摘要
Abstract
[Objective]Data volume is an important index to measure the resource integration and service capability of scientific data centers.However,due to the different disciplines,levels,subordi-nate systems,construction time,and other backgrounds of scientific data centers,it is obviously unfair to directly compare data volume among data centers.[Methods]Based on the data resources volume collected by scientific data centers and the related science data centers heterogeneity factors data with the help of public service plat-forms such as"China Science and Technology Resource Sharing Net"and"Science Data Center of CAS",this ar-ticle realizes the quantification of heterogeneous factors and analysis of their impact on the data volume by using dummy variables and correlation analysis.The data volume panel model coupled with the heterogeneity of scien-tific data centers was constructed by using hypothesis testing,the least square virtual variable method,and other statistical methods.[Results]The proposed model eliminates the difference of data volume index caused by the heterogeneity of scientific data centers and realizes the horizontal differential comparative study of data volume among various types of scientific data centers.[Conclusions]The heterogeneous adjustment method for the data volume of scientific data centers is proposed for the first time in this study,which has important references for systematic and scientific evaluation of scientific data centers.关键词
科学数据中心/异构性/数据总量/面板数据模型/最小二乘虚拟变量法Key words
scientific data centers/heterogeneity/data volume/panel data model/least squares virtual variables引用本文复制引用
高孟绪,王悦悦,武新乾,陈祖刚,石蕾,王瑞丹..耦合异构性的科学数据中心数据总量指标评价分析[J].数据与计算发展前沿,2024,6(2):156-164,9.基金项目
国家自然科学基金面上项目"基于动态与异构场景的科学数据中心评价方法研究"(72074017) (72074017)