计算机工程2012,Vol.38Issue(23):47-50,4.DOI:10.3969/j.issn.1000-3428.2012.23.011
基于逆向清理的实时异构数据整合模型研究
Study of Real-time Heterogeneous Data Integration Model Based on Reverse Cleaning
摘要
Abstract
In order to solve the problems of target data updated in real time and the quality of data source itself in the process of heterogeneous data integration, on the basis of the adapter, the XML and reverse data cleaning technology, a real-time heterogeneous data integration model based on reverse data cleaning is presented. It processes heterogeneous data in major two ways. On the one hand, it uses real-time threads to extract, clean and save the original data that is newly increased or modified. On the other hand, it uses the reverse cleaning process reverse to fix errors and missing in the original data by the valid data in platform or integration. Experimental result shows that the model can improve the data quality of the target data and the original data simultaneously.关键词
异构数据/数据整合/逆向清理/ETL过程/适配器/数据质量Key words
heterogonous data/ data integration/ reverse cleaning/ Extract, Transform, Load(ETL) process/ adapter/ data quality分类
信息技术与安全科学引用本文复制引用
唐钰,陈浩,叶柏龙..基于逆向清理的实时异构数据整合模型研究[J].计算机工程,2012,38(23):47-50,4.基金项目
国家自然科学基金资助项目(61070194) (61070194)
国家创新基金资助项目(11C26214305383) (11C26214305383)