Enhancing Application Robustness in Cloud Data Centers
The authors propose OX, a runtime system that uses application-level availability constraints and application topologies discovered on the y to enhance resilience to infrastructure anomalies for cloud applications. OX allows application owners to specify groups of highly available virtual machines, following component roles and replication semantics. To discover application topologies, OX monitors network traffic among virtual machines, transparently. Based on this information, OX builds on-line topology graphs for applications and incrementally partitions these graphs across the infrastructure to en-force availability constraints and optimizes communication between virtual machines.