Debugging Large Scale Applications in a Virtualized Environment
With the advent of petascale machines with hundreds of thousands of processors, debugging parallel applications is becoming an increasing challenge. Aside from the complicated debugging techniques required to debug applications at such scale, it is often difficult to gain access to these machines for a sufficient period of time, if at all. Some existing parallel debuggers are capable of handling these machines, but they still require the whole machine to be allocated. In this paper, the authors present an innovative approach to address debugging on such extreme scales. By leveraging the concept of object-based processor virtualization, the technique enables debugging of even a million processor execution under a simulated environment using only a relatively small cluster.