Papers tagged recovery mechanisms Simple Testing Can Prevent Most Critical Failures: An Analysis of Production Failures in Distributed Data-Intensive Systems Browse All Keywords By Category