Stardust: Tracking Activity in a Distributed Storage System
Source: Association for Computing Machinery
Performance monitoring in most distributed systems provides minimal guidance for tuning, problem diagnosis, and decision making. Stardust is a monitoring infrastructure that replaces traditional performance counters with end-to-end traces of requests and allows for efficient querying of performance metrics. Such traces better inform key administrative performance challenges by enabling, for example, extraction of per-workload, per-resource demand information and per-workload latency graphs. This paper reports on the experience building and using end-to-end tracing as an on-line monitoring tool in a distributed storage system.
| Format: | Size: | 577.20 | |
| Date: | Jun 2006 |



