The exponential growth of data generated by humanity, particularly with the rise of artificial intelligence, has spurred scientific exploration into alternative storage mediums. DNA, the fundamental building block of life and nature’s inherent data storage mechanism, has emerged as a promising candidate. Its remarkable density allows it to store information at least 1,000 times more compactly than traditional solid-state hard drives. Previous demonstrations of DNA’s storage capacity have included encoding entire literary works, musical compositions, and even episodes of television shows into minuscule amounts of DNA. While these endeavors were primarily research-oriented or publicity stunts, they highlighted the potential of DNA data storage.
The field of DNA data storage has recently witnessed a significant step towards commercialization with the release of the first commercially available book encoded in DNA. Asimov Press, in collaboration with Catalog, a Boston-based biotechnology company, has launched an anthology of essays and science fiction stories stored within strands of DNA. For a modest price, consumers can acquire both a physical copy of the book and a metal capsule containing the DNA-encoded version. This marks a pivotal moment in the transition of DNA data storage from the realm of research to tangible consumer products.
Catalog employed a novel approach to encode the book into DNA, leveraging a method known as combinatorial assembly. This technique, likened to the Gutenberg printing press, involves creating a library of DNA snippets representing bits of information. These snippets can then be combined in various arrangements to encode the desired data. This method differs from the traditional approach of directly translating binary code into DNA sequences, offering potentially higher efficiency and scalability. Catalog manufactured these DNA snippets in bulk and used enzymes to assemble them into the desired sequences, significantly reducing the cost and time associated with traditional DNA synthesis.
The encoding process for the book, involving the creation of approximately 500,000 unique DNA molecules representing the 240 pages of text, incurred a cost in the low thousands of dollars. However, the cost-effectiveness of this method lies in the ability to produce numerous copies from the initial encoding. Utilizing the tools of molecular biology allows for facile replication of the DNA, thus driving down the per-unit cost for mass production. This represents a significant advantage over other DNA storage methods that rely on expensive and time-consuming synthesis for each copy.
The commercial landscape of DNA data storage is still nascent, but other companies are also exploring this field. Biomemory, a French company, previously offered a DNA storage card capable of storing a small amount of data, equivalent to a short email. While the price point was relatively high due to the costs associated with DNA synthesis, the offering served as a market test to gauge consumer interest and demonstrate the feasibility of their technology. Catalog’s approach, with its combinatorial assembly method, aims to address the cost barrier and make DNA data storage more accessible.
Following the encoding process performed by Catalog, the DNA molecules were dehydrated, transforming them into a powder form, and shipped to France. Imagene, a biological storage company, encapsulated the DNA powder within stainless steel capsules designed for long-term preservation. These capsules contain an inert atmosphere devoid of oxygen and moisture, ensuring the stability and integrity of the DNA for potentially thousands of years. This packaging method ensures the longevity and safe storage of the encoded information, safeguarding against environmental degradation and allowing for retrieval even after extensive periods. The development of robust and reliable storage methods is crucial for the widespread adoption of DNA data storage technology.