Pingmesh: A Large-Scale System for Data Center Network Latency Measurement and Analysis
Can users get network latency between any two servers at any time in large-scale data center networks? The collected latency data can then be used to address a series of challenges: telling if an application perceived latency issue is caused by the network or not, defining and tracking network Service Level Agreement (SLA), and automatic network troubleshooting. The authors have developed the Pingmesh system for largescale data center network latency measurement and analysis to answer the above question affirmatively. Pingmesh has been running in Microsoft data centers for more than four years, and it collects tens of terabytes of latency data per day.