Dealer: Dynamic Request Splitting for Performance-Sensitive Applications in Multi-Cloud Environments
Enterprises are increasingly deploying their applications in the cloud given the cost-saving advantages, and the potential to geo-distribute applications to ensure resilience and better service experience. However, a key unknown is whether it is feasible to meet the stringent response time requirements of enterprise applications using the cloud. The authors make several contributions. First, they show through empirical measurement studies that there is significant short-term variability in application workload and response times of individual components; however the response times of the same component in different data-centers are often uncorrelated. This leads one to argue that there are potential latency savings if work related to a poorly performing component is dynamically reassigned to a replica in a remote data-center.