University of Illinois at Urbana Champaign
Cloud computing is enabling groups of academic collaborators, groups of business partners, etc., to come together in an ad-hoc manner. This paper focuses on the group-based data transfer problem in such settings. Each participant source site in such a group has a large dataset, which may range in size from gigabytes to terabytes. This data needs to be transferred to a single sink site in a manner that reduces both total dollar costs incurred by the group as well as the total transfer latency of the collective dataset.