University of New Orleans Fund
When cloud consumers are performing rolling upgrade operations on cloud applications, they may encounter failures due to cloud uncertainty. For example, unreliable cloud API calls can make the rolling upgrade operation fail in unpredictable and subtle ways. This paper proposes two recovery strategies for recovering from rolling upgrade failures. The strategies are compensated undo & redo and reparation. The authors evaluated their recovery strategies on Asgard-based rolling upgrade operation on Amazon cloud based on two evaluation metrics: MTTR and service performance. The experiment results show that their strategies perform better than the recovery mechanisms provided by Asgard itself.