DNA used to encode a book and other digital information

ScienceRocks

Democrat all the way!
Mar 16, 2010
59,455
6,793
1,900
The Good insane United states of America
DNA used to encode a book and other digital information
August 17, 2012
by Lin Edwards (Phys.org) -- A team of researchers in the US has successfully encoded a 5.27 megabit book using DNA microchips, and they then read the book using DNA sequencing. Their experiments show that DNA could be used for long-term storage of digital information.

George Church and Sriram Kosuri of Harvard’s Wyss Institute for Biologically Inspired Engineering, and colleagues, encoded Church’s book “Regenesis” of around 53,400 words into DNA sequences, along with 11 images in JPG format and a JavaScript program. This is 1,000 times more data than has been encoded in DNA previously. DNA is made up of nucleotides, and in theory at least each nucleotide can be used to encode two bits of data. This means that the density is a massive 1 million gigabits per cubic millimeter, and only four grams of DNA could theoretically store all the digital data created annually. This is much denser than digital storage media such as flash drives, and more stable, since the DNA sequences could be read thousands of years after they were encoded. The experiment’s success lay in the strategy of encoding the data in short sequences of DNA rather than long ones, and this reduced the difficulty and cost of writing and reading the data.

Dr Kosuri said the process was analogous to storing data on a hard drive, where data is written in small blocks called sectors. They first converted the book, program and images to HTML and then translated this into a sequence of 5.27 million 0s and 1s, and these 5.27 megabits were then sequenced into sections of nucleotides 96 bits long using one DNA nucleotide for one bit. The nucleotide bases A and C encoded for 0, while G and T encoded for 1. Each block also contained a 19 bit address to encode the block’s place in the overall sequence. Multiple copies of each block were synthesized to help in error correction. After the book and other information was encoded into the DNA, drops of DNA were attached to microarray chips for storage. The chips were kept at 4°C for three months and then dissolved and sequenced. Each copy of each block of nucleotides was sequenced up to 3,000 times so that a consensus could be reached. In this way they reduced the bit errors in the 5.27 megabits to just 10.

The procedure, described in a paper in the journal Science, cannot be used for rewritable data but could be used for very long-term storage of data. One advantage of using DNA is that a much greater density of information can be stored, but another major advantage is that DNA is a biological molecule that will always be able to be read biologically without special equipment such as CD or DVD players that can quickly become obsolete. The main disadvantage of this system is that at the moment the technologies used to synthesize and sequence DNA are far too expensive for it to be a practical system for everyday use. Another problem is that while DNA has been sequenced from sources such as mummies thousands of years old, the DNA tends to be fragmented, and work needs to be done on improving the stability of DNA over centuries and longer.

Read more at: DNA used to encode a book and other digital information
 
  • Thread starter
  • Banned
  • #2
Scientists successfully store 700TB of data in one gram of DNA


Traditional platter-based hard drives and flash-based solid state drives might dominate the storage landscape today, but in the future, you'll be storing more data than you could possibly sift through within your very own DNA.

George Church and Sri Kosuri, two Harvard Wyss Institute scientists have successfully managed to store 700TB of data in one gram of DNA.

ExtremeTech's Sebastian Anthony breaks it down like so:



Instead of binary data being encoded as magnetic regions on a hard drive platter, strands of DNA that store 96 bits are synthesized, with each of the bases (TGAC) representing a binary value (T and G = 1, A and C = 0).

To read the data stored in DNA, you simply sequence it — just as if you were sequencing the human genome — and convert each of the TGAC bases back into binary. To aid with sequencing, each strand of DNA has a 19-bit address block at the start (the red bits in the image below) — so a whole vat of DNA can be sequenced out of order, and then sorted into usable data using the addresses.


In addition to the obvious size advantages that would allow you to store a virtually unlimited amount of documents, photos, videos, music, etc., stashing data in DNA is great because it's not susceptible to temperature for long-term storage. What that means is your secret collection of smutty videos will still be accessible hundreds of thousands of years after you're dead.

As for the disadvantages, Church says that reading and writing data in DNA is "slower than in other media, however, which makes it better suited for archival storage of massive amounts of data, rather than for quick retrieval or data processing."

While the breakthrough is an utter success, like all advancements, it could be a while before the technology trickles down to consumers.

Harvard Medical School, via ExtremeTech

http://hms.harvard.edu/content/writing-book-dna
 
Last edited:

Forum List

Back
Top