paper

Dapper, a Large-Scale Distributed Systems Tracing Infrastructure

  • Authors:

📜 Abstract

Dapper is a production distributed systems tracing infrastructure developed at Google. It allows Google engineers to analyze complex end-to-end events instead of individual server-side actions. Dapper enables the increasing integration of tracing information into Google's operations, both in real-time and offline. Its most important design features are application-level transparency and pervasive deployment across all of our systems, with low overhead. While Dapper was first developed several years ago, it continues to grow and evolve alongside Google's overall systems and software, and is widely used for analysis at all levels of Google's technical operations.

✨ Summary

The paper “Dapper, a Large-Scale Distributed Systems Tracing Infrastructure” by Benjamin H. Sigelman et al. discusses the development of an infrastructure at Google designed for tracing distributed systems. It highlights Dapper’s ability to facilitate the analysis of complex end-to-end events across Google’s systems with minimal overhead.

Dapper has had a significant influence on the development of distributed tracing techniques and tools beyond Google. It laid the groundwork for subsequent tracing and observability solutions such as Zipkin, a popular distributed tracing system inspired by Dapper (source). The principles and practices described in the paper have contributed to the standardization of distributed tracing protocols and methodologies used in modern cloud-native environments.

Furthermore, Dapper’s concepts have been foundational for the OpenTracing project (source), which provides APIs for distributed tracing, and OpenTelemetry, an observability framework for cloud-native software (source). These projects have widespread usage in the industry, influencing how performance monitoring and tracing are implemented in current distributed systems.

The impact of this work can be seen in the pervasive adoption of distributed tracing in observability stacks across various industries, marking a shift in performance analysis and system monitoring paradigms. The concepts initiated by Dapper continue to inform and inspire the expanding body of research and industrial applications focused on distributed systems observability and performance management.