K2: Reading quickly from storage across many datacenters

Khiem Ngo, Haonan Lu, Wyatt Lloyd

June 2021

Abstract

The infrastructure available to large-scale and medium-scale web services now spans dozens of geographically dispersed datacenters. Deploying across many datacenters has the potential to significantly reduce end-user latency by serving users nearer their location. However, deploying across many datacenters requires the backend storage system be partially replicated. In turn, this can sacrifice the low latency benefits of many datacenters, especially when a storage system provides guarantees on what operations will observe.

We present the K2 storage system that provides lower latency for large-scale and medium-scale web services using partial replication of data over many datacenters with strong guarantees: causal consistency, read-only transactions, and write-only transactions. K2 provides the best possible worst-case latency for partial replication, a single round trip to remote datacenters, and often avoids sending any requests to far away datacenters using a novel replication approach, write-only transaction algorithm, and read-only transaction algorithm.

Type

Conference paper

Publication

51st IEEE/IFIP International Conference on Dependable Systems and Networks (DSN ‘21)

K2: Reading quickly from storage across many datacenters

Abstract

Khiem Ngo

Research Scientist