Modelling of Data Extraction in ETL Processes Using UML 2.0
The topic of data warehousing encompasses architectures, algorithms, and tools for bringing together the selected data from multiple databases or other information sources into a single repository called data warehouse. Extraction-transformation-loading tools are pieces of software responsible for the extraction of data from several sources, their cleaning, customisation, and insertion into a data warehouse. The paper proposes an object-oriented approach to accomplish the data extraction modelling of extraction-transformation-loading process. The data extraction scenario consists of data staging area, heterogeneous information sources, wrappers, monitors, integrator, and source identifier. All the afore mentioned entities have been modelled using Unified Modelling Language 2.0. Banking system has been used as an application to illustrate the modelling.