Date Added: Mar 2011
The design and implementation of an ETL (Extract-TransformLoad) process for a data warehouse proceeds from a conceptual model to a logical model, and then a physical model and implementation. The conceptual model conveys at a high level the data sources and targets, and the transformation steps from sources to targets. The current state of the art is to express the conceptual model informally using text descriptions and diagrams. This makes the process of deriving a logical model time-consuming and error-prone. The authors' work is towards a system that covers the whole ETL life-cycle by injecting several layers of optimization and validation throughout the whole process starting with the business level objectives and ending with flow execution.