Nature                          volume  634, pages  824–832 (2024 )Cite this article                      DNA storag

Parallel molecular data storage by printing epigenetic bits on DNA

submited by
Style Pass
2024-10-23 16:00:05

Nature volume  634, pages 824–832 (2024 )Cite this article

DNA storage has shown potential to transcend current silicon-based data storage technologies in storage density, longevity and energy consumption1,2,3. However, writing large-scale data directly into DNA sequences by de novo synthesis remains uneconomical in time and cost4. We present an alternative, parallel strategy that enables the writing of arbitrary data on DNA using premade nucleic acids. Through self-assembly guided enzymatic methylation, epigenetic modifications, as information bits, can be introduced precisely onto universal DNA templates to enact molecular movable-type printing. By programming with a finite set of 700 DNA movable types and five templates, we achieved the synthesis-free writing of approximately 275,000 bits on an automated platform with 350 bits written per reaction. The data encoded in complex epigenetic patterns were retrieved high-throughput by nanopore sequencing, and algorithms were developed to finely resolve 240 modification patterns per sequencing reaction. With the epigenetic information bits framework, distributed and bespoke DNA storage was implemented by 60 volunteers lacking professional biolab experience. Our framework presents a new modality of DNA data storage that is parallel, programmable, stable and scalable. Such an unconventional modality opens up avenues towards practical data storage and dual-mode data functions in biomolecular systems.

The markedly expanding global data-sphere has posed an imminent challenge on large-scale data storage and an urgent need for better storage materials5,6. Inspired by the way genetic information is preserved in nature, DNA has been recently considered a promising biomaterial for digital data storage owing to its extraordinary storage density and durability1,2,3. In current DNA storage, data is typically transcoded into nucleotide base sequences, and writing depends on de novo synthesis in which nucleotides are added one-by-one in predetermined orders7. Although de novo synthesis technologies have advanced continuously in throughput and efficiency4,8, the serial synthesis process essentially limits the writing speed and the length of synthesized DNA, and prevents a substantial cost reduction in data writing8,9 (Supplementary Fig. 1).

Leave a Comment