We use cookies to improve your experience with our site.

一种基于数据虚拟化的新型ETL方法

A New ETL Approach Based on Data Virtualization

  • 摘要: 基于传统ETL的数据仓库通常使用数据源层、临时缓存层、数据集成层、数据访问层来满足不同用户的需求。在构建数据仓库时,ETL过程非常耗时,所以有必要改善其性能。本文提出了一种新型的ETL方法,称之为TEL(转换-抽取-装载)。该TEL方法利用虚拟表,在数据抽取和装载之前完成虚拟转换任务,避免应用临时数据存储区来暂存抽取过来的多源数据。此外,该TEL方法可以通过数据对象化,把数据缓存到内存中来进一步改善查询性能。通过构建基准数据对TEL性能进行了评测,实验结果表明TEL方法是可行的、实用的。

     

    Abstract: ETL (Extract-Transform-Load) usually includes three phases: extraction, transformation, and loading. In building data warehouse, it plays the role of data injection and is the most time-consuming activity. Thus it is necessary to improve the performance of ETL. In this paper, a new ETL approach, TEL (Transform-Extract-Load) is proposed. The TEL approach applies virtual tables to realize the transformation stage before extraction stage and loading stage, without data staging area or staging database which stores raw data extracted from each of the disparate source data systems. The TEL approach reduces the data transmission load, and improves the performance of query from access layers. Experimental results based on our proposed benchmarks show that the TEL approach is feasible and practical

     

/

返回文章
返回