University of Idaho
Organizations of all sizes are shifting their IT infrastructures to the cloud because of its cost efficiency and convenience. Because of the on-demand nature of the Infrastructure as a Service (IaaS) clouds, hundreds of thousands of Virtual Machines (VMs) may be deployed and terminated in a single large cloud data center each day. In this paper, the authors propose a content-based scheduling algorithm for the placement of VMs in data centers. They take advantage of the fact that it is possible to find identical disk blocks in different VM disk images with similar operating systems by scheduling VMs with high content similarity on the same hosts. That allows the user to reduce the amount of data transferred when deploying a VM on a destination host.