paper

Harvest, Yield and Scalable Tolerant Systems

  • Authors:

📜 Abstract

As the Internet grows larger and more diverse, software developers are moving away from small-scale monolithic systems. Their focus now is on building high-performance, dependable distributed or cluster-based systems, which can serve as robust Internet services. However, distributed systems are prone to problems such as partial failures, network partitions, network congestion and slow or lossy networks. We introduce two new metrics, harvest and yield, that help analyze a system's ability to deliver certain qualities of service. Harvest is a measure of the completeness of the result set, while yield takes into account traditional availability. We present architecture and software design principles for high-yield, high-harvest, and scalable tolerant systems. To illustrate our approach, we describe the design and implementation of the Internet services of the Inktomi Corporation Traffic Server, which demonstrates how our design principles can be applied in practice.

✨ Summary

The paper “Harvest, Yield and Scalable Tolerant Systems” by Armando Fox and Eric A. Brewer, published in October 1999, introduces the concepts of “harvest” and “yield” as new metrics for evaluating the performance and robustness of distributed systems. The authors designed these concepts to quantify the trade-offs between availability and result completeness in network systems.

The paper has significant influence as it informs the design of distributed systems emphasizing fault tolerance and scalability, with particular reference to Internet services like those used by Inktomi Corporation. However, specific citations of this work in contemporary research or documented industry applications are not prominent in easily accessible academic databases or industry archives. Despite this, the principles discussed resonate with ongoing discussions around designing distributed services with consideration towards scaling and fault tolerance, echoing across many architectural designs in modern cloud computing.

No concrete academic paper or industry report directly references back to this seminal work within the current accessible databases. Nonetheless, the concepts introduced align well with established principles in distributed system design and continue to influence in a broader, less explicit capacity.