现代信息科技2024,Vol.8Issue(1):54-58,5.DOI:10.19850/j.cnki.2096-4706.2024.01.011
铁路数据分布式湖仓一体架构分析与设计
Analysis and Design of Railway Data Distributed Lake Warehouse Integrated Architecture
摘要
Abstract
A scientific and reasonable data resource classification method and an effective data lake architecture system can support the efficient storage,organization,and utilization of railway full business data,and further support and optimize various operational businesses.This paper first provides a brief analysis of the existing data lake architecture,determining the concept of integrated lake and warehouse,and categorizing railway data by theme to meet business processing needs;secondly,a railway data distributed lake warehouse integrated architecture is designed,elaborating on the architecture and functions of the sub lake warehouses at the railway bureau level and the overall lake warehouses of China Railway Group,as well as the data flow process between the two;finally,the characteristics and existing problems of the designed architecture are analyzed,providing a reference for further constructing an effective railway operation data lake.关键词
铁路大数据/数据治理/数据湖/湖仓一体/分布式架构Key words
railway big data/data governance/data lake/integrated lake and warehouse/distributed architecture分类
信息技术与安全科学引用本文复制引用
李国华,邹丹,李海军,孙思齐,王建强..铁路数据分布式湖仓一体架构分析与设计[J].现代信息科技,2024,8(1):54-58,5.基金项目
中国国家铁路集团有限公司科技研究开发计划课题(P2021S012) (P2021S012)