KTH - Royal Institute of Technology
Over the last decade, the number, size and complexity of large-scale networked systems has been growing fast, and this trend is expected to accelerate. The best known example of a large-scale networked system is probably the internet, while large datacenters for cloud services are the most recent ones. In such environments, a key challenge is to develop scalable and adaptive technologies for management functions. This paper addresses the challenge by engineering several protocols for distributed monitoring and resource management that are suitable for large-scale networked systems.