Keynote talk: Adaptive Resource Allocation in the Cloud

Srikanth Kandula, Microsoft Research, Redmond

Carefully allocating resources can improve throughput, lower latency and offer more predictable service. In this talk, I will present three recent examples and point out future directions.

With SWAN, we show that given responsive networks and responsive applications adapting who gets to send how much, when, and along which network paths can improve network utilization without losing out on business priorities. We show how SWAN can be incorporated into the wide-area network of enterprises that have a global datacenter footprint. With Kwiken, we show how to improve the tail latency of datacenter services which are built as workflows over many components by appropriately allocating additional resources across the various stages in the workflow. Interestingly, we also cast incompleteness (i.e., returning partial results) as a resource and show that small amounts of incompleteness can improve latency by a lot. Finally, with RoPE, we show how execution plans for jobs in big data clusters can improve given additional information about properties of the user code, data and how the code and data interact. We also describe a system that extracts such properties at scale.

Special issue of Elsevier Journal of Computer and System Sciences

News:

Nov 16th: Program and proceedings now available.

Important dates:

Submission deadline:
October 11, 2013

Author notification:
October 30th, 2013

Early registration:
November 3rd, 2013

Camera ready version:
November 6th, 2013