ClusterWatch: Flexible, Lightweight Monitoring for High-end GPGPU Clusters

The ClusterWatch middleware provides runtime flexibility in what system-level metrics are monitored, how frequently such monitoring is done, and how metrics are combined to obtain reliable information about the current behavior of GPGPU clusters. Interesting attributes of ClusterWatch are the ease with which different metrics can be added to the system - by simply deploying additional \"Cluster spies,\" the ability to filter and process monitoring metrics at their sources, to reduce data movement overhead, flexibility in the rate at which monitoring is done, efficient movement of monitoring data into backend stores for long-term or historical analysis, and most importantly, specific support for monitoring the behavior and use of the GPGPUs used by applications.

Provided by: Georgia Institute of Technology Topic: Data Management Date Added: Sep 2013 Format: PDF

Find By Topic