Query Processing in Distributed Data Warehouse Using Scheduling Algorithms
Data warehouse is a centralized repository for analyzing and storing huge amount of data. In distributed data warehouse, data can be shared across multiple data repositories which belong to one or more organizations. Query sorting is one of the issues for formatting the number of queries that can be selected together. Reducing the usual completion period of a random order is a common concern. In this paper, the authors are dealing three scheduling algorithms for query scheduling and the performance report based on processing time and memory size is also evaluated. The algorithms discussed are Optimal Resource Constraints (ORC), Grouping based Fine-grained Job Scheduling (GFJS) and Heuristic Algorithm (HA). ORC allocates queries according to their processor's capabilities.