DataGarage: Warehousing Massive Amounts of Performance Data on Commodity Servers
Contemporary datacenters house tens of thousands of servers. The servers are closely monitored for operating conditions and utilizations by collecting their performance data (e.g., CPU utilization). In this paper, the authors show that existing database and le-system solutions are not suitable for warehousing performance data collected from a large number of servers because of the scale and the complexity of performance data. They describe the design and implementation of DataGarage, a performance data warehousing system that they have developed at Microsoft.