| 注册
首页|期刊导航|计算机应用研究|数据湖技术研究综述

数据湖技术研究综述

蔡珉官 王朋

计算机应用研究2023,Vol.40Issue(12):3529-3538,10.
计算机应用研究2023,Vol.40Issue(12):3529-3538,10.DOI:10.19734/j.issn.1001-3695.2023.05.0173

数据湖技术研究综述

Survey of data lake technology research

蔡珉官 1王朋2

作者信息

  • 1. 延边大学信息化中心,吉林延吉 133002
  • 2. 延边大学工学院,吉林延吉 133002
  • 折叠

摘要

Abstract

Traditional data storage technologies are no longer suitable for data analysis and application in the era of big data.The emergence of the concept of data lake effectively solves the problems of high data storage costs,low flexibility,and hetero-geneous data diversification.Currently,the research on data lake is still in the early stage,and there is a lack of comprehensive research and discussion covering the entire process of data processing.In order to understand data lake technology more comprehensively,this paper reviewed the research results of data lake technology in recent years.Firstly,it sorted out the deve-lopment history and concepts of data lake,and compared them with other similar concepts.Secondly,it investigated the data lake architecture,and divided the key technologies of the data lake into storage,data ingestion,data maintenance,data exploration,and data governance according to the architecture of characteristics.It analyzed and discussed the latest research progress,technical solutions,research deficiencies,and future research directions of key technologies.Finally,it investigated the typical applications of data lake in various application fields,providing references for implementers of data lake in various industries.

关键词

数据湖/元数据管理/数据组织/数据发现/数据探索

Key words

data lake/metadata management/data organization/data discovery/data exploration

分类

信息技术与安全科学

引用本文复制引用

蔡珉官,王朋..数据湖技术研究综述[J].计算机应用研究,2023,40(12):3529-3538,10.

基金项目

吉林省教育厅基金资助项目(JJKH20220540CY,JJKH20230622KJ) (JJKH20220540CY,JJKH20230622KJ)

计算机应用研究

OA北大核心CSCDCSTPCD

1001-3695

访问量0
|
下载量0
段落导航相关论文