A DNA-Based Archival Storage System
📜 Abstract
DNA is an attractive medium for archival storage because it is extremely dense, with a raw limit of about 1 exabyte per cubic millimeter, and long-lasting, with observed half-life of over 500 years. In this paper, we demonstrate a complete DNA-based storage system that encompasses all the key components, including coding across oligonucleotide pools to increase logical density and a novel strategy for random access that scales with data size. To evaluate our design, we stored and retrieved 144 words, with an average 1.5 errors per word, and solved a 150-1,016-error correction problem using a new approach for decoding DNA data. Our work demonstrates that DNA storage is plausible, cost-effective, and feasible for high-density archiving, providing a viable solution to the unsustainable scaling of traditional storage systems.
✨ Summary
This paper details a DNA-based archival storage system that represents a breakthrough in data density, longevity, and cost-effectiveness compared to traditional storage systems. It introduces innovations in coding across oligonucleotide pools to enhance logical data density and a novel random access strategy that is scalable with data size. Notable is the demonstration of encoding and successfully retrieving 144 words, showcasing the potential of DNA for large-scale storage solutions.
A web search reveals multiple citations in subsequent research that explore and build upon these innovations. For instance, “DNA Data Storage: Research Landscape and Future Trends” by Ceze et al., 2019 describes the impact of DNA storage systems on future storage technologies, referencing this work as pivotal. Additionally, “The Prospects of DNA Data Storage,” 2020 IEEE articulates the technological advancements and future trends influenced by the methodologies demonstrated in this paper. Despite these references, broad industrial application is still in early stages, reflecting the innovative nature and potential future significance of this research.