Cloudy Forecast: How Predictable is Communication Latency in the Cloud? (2309.13169v1)

Published 22 Sep 2023 in cs.DC and cs.NI

Abstract: Many systems and services rely on timing assumptions for performance and availability to perform critical aspects of their operation, such as various timeouts for failure detectors or optimizations to concurrency control mechanisms. Many such assumptions rely on the ability of different components to communicate on time -- a delay in communication may trigger the failure detector or cause the system to enter a less-optimized execution mode. Unfortunately, these timing assumptions are often set with little regard to actual communication guarantees of the underlying infrastructure -- in particular, the variability of communication delays between processes in different nodes/servers. The higher communication variability holds especially true for systems deployed in the public cloud since the cloud is a utility shared by many users and organizations, making it prone to higher performance variance due to noisy neighbor syndrome. In this work, we present Cloud Latency Tester (CLT), a simple tool that can help measure the variability of communication delays between nodes to help engineers set proper values for their timing assumptions. We also provide our observational analysis of running CLT in three major cloud providers and share the lessons we learned.

References (79)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

HackerNews

Cloudy Forecast: How Predictable Is Communication Latency in the Cloud? (1 point, 0 comments)

Cloudy Forecast: How Predictable is Communication Latency in the Cloud? (5 points, 2 comments)

Cloudy Forecast: How Predictable is Communication Latency in the Cloud? (2309.13169v1)

Summary

Related Papers

HackerNews

Reddit