Difference between revisions of "Team:UESTC-software/Description"

 
(8 intermediate revisions by 3 users not shown)
Line 25: Line 25:
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Design">Design</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Design">Design</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Features">Features</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Features">Features</a></li>
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Model">Model</a></li>
+
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Model">Modeling</a></li>
 
                          
 
                          
 
                         <li class="three-nav"><a href="https://2016.igem.org/Team:UESTC-software/Proof">Proof</a></li>
 
                         <li class="three-nav"><a href="https://2016.igem.org/Team:UESTC-software/Proof">Proof</a></li>
 
                         <li class="three-nav"><a href="https://2016.igem.org/Team:UESTC-software/Demonstrate">Results</a></li>
 
                         <li class="three-nav"><a href="https://2016.igem.org/Team:UESTC-software/Demonstrate">Results</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Future">Future</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Future">Future</a></li>
 +
                        <li><a href="https://2016.igem.org/Team:UESTC-software/Parts">Parts</a></li>
 
                         <li class="three-nav"><a href="https://2016.igem.org/Team:UESTC-software/Extra_work">Extra Work—Bio2048</a></li>
 
                         <li class="three-nav"><a href="https://2016.igem.org/Team:UESTC-software/Extra_work">Extra Work—Bio2048</a></li>
 
                     </ul>
 
                     </ul>
 
                 </li>
 
                 </li>
                 <li><a href="https://2016.igem.org/Team:UESTC-software/Judging">JUDGING</a>
+
                 <li><a href="https://2016.igem.org/Team:UESTC-software/Judging?id=1">JUDGING</a>
 
                     <ul class="sub-nav">
 
                     <ul class="sub-nav">
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Medal_requirements">Medal Requirements</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Medal_requirements">Medal Requirements</a></li>
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Safety">Safety</a></li>
+
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Safety?">Safety</a></li>
 
                     </ul>
 
                     </ul>
 
                 </li>
 
                 </li>
                 <li><a href="https://2016.igem.org/Team:UESTC-software/Team">TEAM</a>
+
                 <li><a href="https://2016.igem.org/Team:UESTC-software/Team?id=2">TEAM</a>
 
                     <ul class="sub-nav">
 
                     <ul class="sub-nav">
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Members">Team</a></li>
+
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Members?id=2&index=0">Team</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Collaborations">Collaborations</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Collaborations">Collaborations</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Notebooks">Notebooks</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Notebooks">Notebooks</a></li>
Line 48: Line 49:
 
                 <li><a href="https://2016.igem.org/Team:UESTC-software/HP">HUMAN PRACTICES</a>
 
                 <li><a href="https://2016.igem.org/Team:UESTC-software/HP">HUMAN PRACTICES</a>
 
                     <ul class="sub-nav">
 
                     <ul class="sub-nav">
                        <li><a href="https://2016.igem.org/Team:UESTC-software/HP/Gold">Gold</a></li>
+
<li><a href="https://2016.igem.org/Team:UESTC-software/HP/Silver">Silver</a></li>                      
                        <li><a href="https://2016.igem.org/Team:UESTC-software/HP/Silver">Silver</a></li>
+
<li><a href="https://2016.igem.org/Team:UESTC-software/HP/Gold">Gold</a></li>
 +
                       
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Integrated_Practices">Integrated Practices</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Integrated_Practices">Integrated Practices</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Engagement">Engagement</a></li>
 
                         <li><a href="https://2016.igem.org/Team:UESTC-software/Engagement">Engagement</a></li>
Line 62: Line 64:
 
</header>
 
</header>
 
<div class="content-top">
 
<div class="content-top">
     <img src="https://static.igem.org/mediawiki/2016/3/30/UESTC-Software-attributions.png">
+
     <img src="https://static.igem.org/mediawiki/igem.org/f/fe/Uestc_software-project_description2.png">
     <p class="title">Attributions</p>
+
     <p class="title">Description</p>
 
</div>
 
</div>
 
<div class="detail-content">
 
<div class="detail-content">
                <p> In the past summer, fifteen members of the iGEM UESTC-Software team worked together to achieve our goal, to develop a biological information storage system. During the process, every one made an effort to go beyond the set goal. Moreover, the team’s success is not only a result of the concerted contribution of all members and but also due to the generous help from many other people.</p>
+
                 <h2 id="Abstract">Abstract</h2>
                 <h2 id="Member Attributions">Member Attributions</h2>
+
                 <p> Towards creating a platform for information storage in artificial DNA, our team built a user-friendly <b>web-based</b> tool Bio101 to encode and decode information in DNA sequences. Through a five-step process, compression, encryption, bit-to-nt conversion, indexing and validation (Fig.1), Bio101 can be used to encode conventional electronic computer files into nucleotide sequences which are ready for chemical synthesis and decode the sequencing results of a DNA sample reversely. In addition, we proposed a solution for <b>DNA-based file editing</b>.</p>
                <strong>Projects</strong>
+
                  
                <br>
+
                 <p class="img-p" style="font-size: 13px;"><img src="https://static.igem.org/mediawiki/2016/e/e8/Uestc-software-description-1.png"/><br/><B>Fig.1.</B>Five-step design of Bio101 </p>
                <strong style="font-style: italic;">Bio101 design</strong>
+
                 <p class="one-line" style="margin-bottom:0;">Back-end and Webpage Framework: Dongkai Pu, Jianwei Xu</p>
+
                <p class="one-line" style="margin:0;">Front-end: Jianwei Xu, Jun Li, Meng Liu</p>
+
                <p class="one-line" style="margin:0;">Modeling: Dongkai Pu, Meng Liu, Chenxiang Zheng, Hao Xu, Wei Zhao</p>
+
                <p class="one-line" style="margin:0;">Test and Validation: Zhongtian Ma, Jianwei Xu</p>
+
                <p class="one-line" style="margin:0;">Art Design: Xin Ma, Zining Wu</p>
+
                <p class="one-line" style="margin:0;">Documentation: Meng Liu, Jun Li, Yuening Yan, Caixi Xi</p>
+
                <strong style="font-style: italic;">Bio2048 design</strong>
+
                <p class="one-line" style="margin-bottom:0;">User Interface: Xin Ma</p>
+
                <p class="one-line" style="margin:0;">Programming: Xin Ma</p>
+
                <p class="one-line" style="margin:0;">Icon Degisn:Haobo Zhou</p>
+
                <p class="one-line" style="margin:0;">Hierarchical Concept: Zhongtian Ma</p>
+
       
+
               
+
                <strong>Human Practices and Collaborations</strong>
+
                <br>
+
                <strong style="font-style: italic;">Development of Data-Processing Tool for UESTC-China</strong>
+
                <p class="one-line" style="margin-bottom:0;">Programming and Documentation: Jianwei Xu, Dongkai Pu, Xin Ma</p>
+
                <strong style="font-style: italic;">Support for TMMU_China</strong>
+
                <p class="one-line" style="margin-bottom:0;">Programming and Documentation: Meng Liu</p>  
+
                <strong style="font-style: italic;">Collaboration with AHUT_ China</strong>
+
                <p class="one-line" style="margin-bottom:0;">Video Meeting: Jianwei Xu, Hao Xu</p>
+
 
+
                 <strong style="font-style: italic;">iGEM Southwest China Union Meetup</strong>
+
 
+
                 <p class="one-line" style="margin-bottom:0;">Presentation: Chenxiang Zheng</p>
+
                <p class="one-line" style="margin:0;">Poster: Zining Wu</p>
+
                <p class="one-line" style="margin:0;">Brief Introduction: Hui Che</p>
+
 
+
                <strong style="font-style: italic;">Meet-up with TMMU_China and Nanjing-China</strong>
+
                <p class="one-line" style="margin-bottom:0;">Presentation: Hui Che, Haobo Zhou</p>
+
                <p class="one-line" style="margin:0;">Poster: Zining Wu</p>
+
                <p class="one-line" style="margin:0;">Brief Introduction: Jianwei Xu</p>
+
 
+
                <strong style="font-style: italic;">Science Lecture</strong>
+
                <p class="one-line" style="margin-bottom:0;">Material Preparation: Caixi Xi, Hao Xu</p>
+
                <p class="one-line" style="margin:0;">Presentation: Yuening Yan, Hui Che</p>
+
                <p class="one-line" style="margin:0;">PowerPoint Slides Design: Zining Wu</p>
+
                <p class="one-line" style="margin:0;">Photo: Haobo Zhou</p>
+
                <p class="one-line" style="margin:0;">Documentation: Yuening Yan, Hui Che</p>
+
 
+
                <strong style="font-style: italic;">Lab-Open Day Event</strong>
+
                <p class="one-line" style="margin-bottom:0;">Design, Planning and Implementation: Hao Xu, Caixi Xi</p>
+
 
+
                <strong style="font-style: italic;">Sina Microblog Public Account</strong>
+
                <p class="one-line" style="margin-bottom:0;">Management: Zhongtian Ma</p>
+
 
+
                <strong style="font-style: italic;">Community Propaganda</strong>
+
                <p class="one-line" style="margin-bottom:0;">Design and Plan: Caixi Xi, Haobo Zhou</p>
+
 
+
              <strong>Art and design</strong>
+
              <br>
+
              <strong style="font-style: italic;">Wiki</strong>
+
              <p class="one-line" style="margin-bottom:0;">Programming: Meng Liu, Jun Li</p>
+
              <p class="one-line" style="margin-bottom:0;">Design: Haobo Zhou</p>
+
              <p class="one-line" style="margin-bottom:0;">Copywriting: Hui Che, Yuening Yan, Meng Liu, Caixi Xi, Wei Zhao, Zhongtian Ma, Chenxiang Zheng</p>
+
              <strong style="font-style: italic;">Team sign and uniform</strong>
+
              <p class="one-line" style="margin-bottom:0;">Inspiration: Haobo Zhou</p>
+
              <p class="one-line" style="margin-bottom:0;">Design and Draw: Zining Wu, Xin Ma</p>
+
  
 +
                <P>This coding tool can create a convenient DNA information workflow, so researchers can choose <b>any files</b> they want and focus on synthesizing DNA. The surprisingly simple idea has the potential to reshape the global face of data storage in the not-too-distant future, and our work contributes to practical areas can accelerate the step to success.</P>
 +
                <h2 id="Background">Background </h2>
 +
                <p>Living in an information explosion era, digital production, transmission and storage have not only revolutionized the way information is accessed and used, but also made information archiving an increasingly complex task<sup>[1]</sup>.Have you been perplexed by vast quantity of information? And have you ever imagined if there exists a <b>practical, high-capacity, low-maintenance,</b> and even <b>self-copy</b> information storage medium which would be still readable after thousands of years? It is not just a dreamy illusion anymore because of the appearance of DNA storage technology.</p>
  
                <h2 id="Special Thanks" >Special Thanks</h2>
 
                <strong>Professor Xianlong Wang, Primary PI</strong>
 
 
                  
 
                  
                 <p>Helping the team design and plan the project. His guidance to all 15 members is greatly appreciated. </p>
+
                 <p>DNA, one of the most miraculous masterpiece created by Nature as the stable genetic material, holds a great promise for <b>high-density</b>, <b>long-term</b> and <b>massive</b> information storage. For example, human genome, just 3 billion base pairs, encodes all of the complex biology information of human being, including appearance, metabolism, growth, development, production and many other delicate functions (Fig.2). </p>
  
                 <strong>Professor Fengbiao Guo, Secondary PI</strong>
+
                 <p class="img-p" style="font-size: 13px;"><img src="https://static.igem.org/mediawiki/igem.org/f/f9/Uestc-software-description-2.png"/><br/><B>Fig.2.</B> Genome controls the growth and development of human being.</p>
                <p>Giving the team precious advice on the presentation and wiki design. </p>
+
                 <p>Why do not we utilize DNA to store information? Researches indicate that it is extremely dense, and spectacularly high-capacity with a raw storage density limit of <b>1 exabyte/mm<sup>3</sup>(10<sup>9</sup>GB/mm<sup>3</sup>)<sup>[2]</sup></b>. In other words, every gram of DNA is equivalent to <b>14 thousand 50 GB blue-ray discs</b> or <b>233 x 3 TB hard-disks </b>which weighs more than 151 kg. Meanwhile, compared with current routinely used information storage media, DNA can be stored up to centuries (Fig.3)<sup>[2]</sup>. Grass et al developed an innovative preservation of digital information on DNA which presumably can last 1 million years in silica<sup>[3]</sup>. In conclusion, a great number of virtues make DNA an ideal archival material for information storage.</p>
 
+
                 <p class="img-p" style="font-size: 13px;"><img src="https://static.igem.org/mediawiki/igem.org/9/90/Uestc-software-description-3.png"/><br/><B>Fig.3.</B>DNA storage as the bottom level of the storage hierarchy<sup>[2]</sup>.</p>
                <strong>Ling Quan and Qiong Zhang, Instructors</strong>
+
                <h2 id="Biotechnology Availability">Biotechnology Availability</h2>
 
+
                <p>You may wonder how a piece of arbitrary information is stored into DNA molecules. On the physical reading side, it depends on the DNA sequencing technology (Fig.4) In 1977, Frederick Sanger adopted primer-extension strategy to develop the DNA sequencing method "DNA sequencing with chain-terminating inhibitors"<sup>[4]</sup>, and it directly facilitated human genome project<B>(HGP)</B>. But Sanger method is too expensive and takes too long time, such that HGP project spent<b> 3 billion dollars</b> over fifteen years <sup>[5]</sup>. Until next-generation sequencing <B>(NGS)</B>, also named high-throughput sequencing, technologies generated, the cost of vast sequencing became comprehensive and really acceptable. 454 pyrosequencing <sup>[6]</sup>, illumina (Solexa) sequencing <sup>[7]</sup>, SOLiD sequencing <sup>[8]</sup> are three mainly applied popular methods. Meanwhile the third generation nanopore DNA sequencing has appeared <sup>[9]</sup>. Owing to technology development, sequencing cost per genome exponential decrease from 1 million dollars in 2008 to just 1 thousand now <sup>[10]</sup>. Encouragingly, the cost will keep dropping, and sequencing would become as simple as reading information from a hard disk or compact optical disk.</p>
                 <p>Their leadership and directions on how to build a successful team is worth mentioning. They taught us how to become more friendly and united. Also, they gave us advice on issues regarding travelling to Boston and the contents of our presentation.</p>
+
                <p class="img-p" style="font-size: 13px;"><img src="https://static.igem.org/mediawiki/igem.org/d/d1/Uestc-software-description-4.png"/><br/><B>Fig.4.</B>The history of DNA sequencing technology progress.</p>
 
+
                 <p>On the other hand, writing information physically into DNA molecules depends on the chemical synthesis of artificial DNA sequences. Currently, oligonucleotide synthesis is used for preparing primers, gene probes, etc. The process is implemented as solid-phase synthesis using phosphoramidite method <sup>[11]</sup> and phosphoramidite building blocks derived from protected 2'-deoxynucleosides (dA, dC, dG, and T), ribonucleosides (A, C, G, and U), or chemically modified nucleosides. (Fig.5) Unfortunately, we still cannot synthesize any DNA molecules as we wish, for example long DNA segments or some possessing special second structures or high GC-content now are still hard or impossible to synthesize chemically. Although oligonucleotide synthesis has been proposed as early as in 1955 <sup>[12]</sup>, the technique is still very expensive, comparing the synthesis cost 0.04 US cent per base (add reference here.) with just <b>1 cent in reading 1 million bases</b>. Therefore, it is not surprising to find the writing of one megabyte of information cost $12,400 while the reading only costs $220 in a recent DNA storage experiment <sup>[14]</sup>.</p>
                <strong>Kaiyue Zhang, the team leader of 2015 UESTC-China, and Yaocong Duan, the team leader of 2015 UESTC-Software.</strong>
+
                 <p class="img-p" style="font-size: 13px;"><img src="https://static.igem.org/mediawiki/igem.org/a/ab/Uestc-software-description-5.png"/><br/><B>Fig.5.</B>Synthetic cycle for preparation of oligonucleotides by phosphoramidite method<sup>[13]</sup>.</p>
                <p>For giving us advice on building our team and disciplines, providing support outside of training, which encouraged us to be focused and produce a perfect project.</p>
+
                 <p>However, we believe as the rapid development of biotechnology the DNA synthesis will become cheap enough for the information storage usage. What remains to be solved is how to encode an arbitrary computer file into DNA sequences and decode the DNA sequencing information. There lacks a bridge between today’s electronic based information world and the future biotechnology based information world, and what we want to do is to connect them.</p>
 
+
                 <h2 id="References">References</h2>
                <strong>Xu Wang, System Administrator at Xingchen Studio</strong>
+
                <br> 
                <p>For building the server and helping us set up backstage when we had problem during visualization design.</p>
+
                <ul>
 
+
                <li style="font-size:13px;">[1] Goldman N, Bertone P, Chen S, et al. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA[J]. Nature, 2013, 494(7435): 77-80.</li>
                <strong>Bohan Li, Ziyun Guan, Jingyi Wang, Huanghao Yang, Wei Liu, the Preliminary Team Members (sophomores of UESTC).</strong>
+
                <li style="font-size:13px;">[2] Bornholt J, Lopez R, Carmean D M, et al. A DNA-based archival storage system[C]//Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems. ACM, 2016: 637-649.</li>
                <p>For helping us with human practices including the implementation of the open-day visiting activity “To Be Scientists Tomorrow” and Community Advocacy. Their help is especially appreciated for taking photos, shooting videos and preparing materials. </p>
+
                <li style="font-size:13px;">[3] Grass R N, Heckel R, Puddu M, et al. Robust Chemical Preservation of Digital Information on DNA in Silica with Error‐Correcting Codes[J]. Angewandte Chemie International Edition, 2015, 54(8): 2552-2555. </li>  
 
+
                <li style="font-size:13px;">[4] Sanger F, Nicklen S, Coulson A R. DNA sequencing with chain-terminating inhibitors[J]. Proceedings of the National Academy of Sciences, 1977, 74(12): 5463-5467. </li>  
                <strong>Other teams taking part in the competition</strong>
+
                <li style="font-size:13px;">[5] Lander E S, Linton L M, Birren B, et al. Initial sequencing and analysis of the human genome[J]. Nature, 2001, 409(6822): 860-921. </li>  
                <br>
+
                 <li style="font-size:13px;">[6] Margulies M, Egholm M, Altman W E, et al. Genome sequencing in microfabricated high-density picolitre reactors[J]. Nature, 2005, 437(7057): 376-380. </li>  
                 <p><a href="https://2016.igem.org/Team:SCU-China" style="color:#3C9CD3;text-indent:28px;font-size:15px;" target="_blank"> <b>·SCU-China</b></a>,<a href="https://2016.igem.org/Team:Nanjing-China" style="color:#3C9CD3;text-indent:28px;font-size:15px;" target="_blank"><b>Nanjing-China</b></a>,<a href="https://2016.igem.org/Team:TMMU_China" style="color:#3C9CD3;text-indent:28px;font-size:15px;" target="_blank"> <b>TMMU_China</b></a>,<a href="https://2016.igem.org/Team:UESTC-China" style="color:#3C9CD3;text-indent:28px;font-size:15px;" target="_blank"> <b>UESTC-China</b></a>,<a href="https://2016.igem.org/Team:AHUT_China" style="color:#3C9CD3;text-indent:28px;font-size:15px;" target="_blank"><b>AHUT_China</b></a></p>
+
                 <li style="font-size:13px;">[7] Bentley D R, Balasubramanian S, Swerdlow H P, et al. Accurate whole human genome sequencing using reversible terminator chemistry[J]. nature, 2008, 456(7218): 53-59. </li>  
                 <p>For discussing our projects with us and putting forward some views and suggestions to improve our task. </p>
+
                 <li style="font-size:13px;">[8] Valouev A, Ichikawa J, Tonthat T, et al. A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning[J]. Genome research, 2008, 18(7): 1051-1063. </li>  
                 <p><a href="https://2016.igem.org/Team:UESTC-China" style="color:#3C9CD3;text-indent:28px;font-size:15px;" target="_blank"><b>·UESTC-China</b></a></p>
+
                 <li style="font-size:13px;">[9] Clarke J, Wu H C, Jayasinghe L, et al. Continuous base identification for single-molecule nanopore DNA sequencing[J]. Nature nanotechnology, 2009, 4(4): 265-270. </li>  
                 <p>For mentoring us to do wet-lab experiment and assisting us with two human practices, the lab-open day “To Be Scientists Tomorrow” and the science lecture in Puyang Middle School. </p>
+
                 <li style="font-size:13px;">[10] Kedes L, Liu E T. The Archon Genomics X PRIZE for whole human genome sequencing[J]. Nature genetics, 2010, 42(11): 917-918. </li>  
                 <p><a href="https://2016.igem.org/Team:AHUT_China" style="color:#3C9CD3;text-indent:28px;font-size:15px;" target="_blank"> <b>·AHUT_China</b></a></p>
+
                 <li style="font-size:13px;">[11] Reese C B. Oligo-and poly-nucleotides: 50 years of chemical synthesis[J]. Organic & biomolecular chemistry, 2005, 3(21): 3851-3868. </li>  
                <p>For doing dry-lab testing and offering testing report for our project. </p>
+
                 <li style="font-size:13px;">[12] Michelson A M, Todd A R. Nucleotides part XXXII. Synthesis of a dithymidine dinucleotide containing a 3′: 5′-internucleotidic linkage[J]. Journal of the Chemical Society (Resumed), 1955: 2632-2638. </li>  
 
+
                 <li style="font-size:13px;">[13] https://commons.wikimedia.org/wiki/File%3AOligocycle1.png </li>
                 <strong>Developers of the Following Open-Source Libraries and Packages</strong>
+
                <li style="font-size:13px;">[14] Goldman N, Bertone P, Chen S, et al. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA[J]. Nature, 2013, 494(7435): 77-80. </li>
                 <p>Django, Owl-Carousel, Bootstrap, JQuery and ISAAC.</p>
+
                </ul>
 
+
                 <strong>Dean’s Office of University of Electronic Science and Technology of China</strong>
+
                 <p>It’s a great deal for us that the Dean’s Office provided our funds during the competition. The Dean’s Office covered the flight fees, accommodation, also Team Registration Fees and the Jamboree Attendance Fee, which greatly reduced our financial burden and motivated us to focus on the competition.</p>
+
 
+
                 <strong>Thanks again to all the people who helped us during the summer. With your support, we become a successful iGEM team. </strong>
+
                 <br>
+
                 <br>
+
                 <img src="https://static.igem.org/mediawiki/2016/7/7d/Uestc_software-mingxie1.PNG" style="width:50%;margin-left:200px;"></img>
+
 
</div>
 
</div>
 
<footer class="footer">
 
<footer class="footer">
 
     <div class="footer-top">
 
     <div class="footer-top">
 
         <p>FOLLOW US:
 
         <p>FOLLOW US:
             <a href="https://github.com/IGEM-UESTC-software" target="_blank"><img src="https://static.igem.org/mediawiki/igem.org/0/06/Uestc_software-github.png" /></a>
+
             <a href="https://github.com/igemsoftware2016/UESTC-Software-2016" target="_blank"><img src="https://static.igem.org/mediawiki/igem.org/0/06/Uestc_software-github.png" /></a>
 
             <a href="http://www.uestc.edu.cn/" target="_blank"><img src="https://static.igem.org/mediawiki/igem.org/a/a4/Uestc_software-school.png" /></a>
 
             <a href="http://www.uestc.edu.cn/" target="_blank"><img src="https://static.igem.org/mediawiki/igem.org/a/a4/Uestc_software-school.png" /></a>
 
             <a href="http://weibo.com/u/5621240588?refer_flag=1001030101_&is_hot=1" target="_blank"><img src="https://static.igem.org/mediawiki/igem.org/b/b1/Uestc_software-weibo.png" /></a>
 
             <a href="http://weibo.com/u/5621240588?refer_flag=1001030101_&is_hot=1" target="_blank"><img src="https://static.igem.org/mediawiki/igem.org/b/b1/Uestc_software-weibo.png" /></a>
Line 192: Line 128:
 
     <ul>
 
     <ul>
 
         <li>
 
         <li>
             <a href="#Member Attributions">
+
             <a href="#Abstract">
 
                 <span></span>
 
                 <span></span>
               Member Attributions
+
               Abstract
 
             </a>
 
             </a>
 
         </li>
 
         </li>
 
         <li>
 
         <li>
             <a href="#Special Thanks">
+
             <a href="#Background">
 
                 <span></span>
 
                 <span></span>
               Special Thanks
+
               Background
 +
            </a>
 +
        </li>
 +
        <li>
 +
            <a href="#Biotechnology Availability">
 +
                <span></span>
 +
                Biotechnology Availability
 +
            </a>
 +
        </li>
 +
        <li>
 +
            <a href="#References">
 +
                <span></span>
 +
                References
 
             </a>
 
             </a>
 
         </li>
 
         </li>

Latest revision as of 01:32, 20 October 2016

三级页面

Description

Abstract

Towards creating a platform for information storage in artificial DNA, our team built a user-friendly web-based tool Bio101 to encode and decode information in DNA sequences. Through a five-step process, compression, encryption, bit-to-nt conversion, indexing and validation (Fig.1), Bio101 can be used to encode conventional electronic computer files into nucleotide sequences which are ready for chemical synthesis and decode the sequencing results of a DNA sample reversely. In addition, we proposed a solution for DNA-based file editing.


Fig.1.Five-step design of Bio101

This coding tool can create a convenient DNA information workflow, so researchers can choose any files they want and focus on synthesizing DNA. The surprisingly simple idea has the potential to reshape the global face of data storage in the not-too-distant future, and our work contributes to practical areas can accelerate the step to success.

Background

Living in an information explosion era, digital production, transmission and storage have not only revolutionized the way information is accessed and used, but also made information archiving an increasingly complex task[1].Have you been perplexed by vast quantity of information? And have you ever imagined if there exists a practical, high-capacity, low-maintenance, and even self-copy information storage medium which would be still readable after thousands of years? It is not just a dreamy illusion anymore because of the appearance of DNA storage technology.

DNA, one of the most miraculous masterpiece created by Nature as the stable genetic material, holds a great promise for high-density, long-term and massive information storage. For example, human genome, just 3 billion base pairs, encodes all of the complex biology information of human being, including appearance, metabolism, growth, development, production and many other delicate functions (Fig.2).


Fig.2. Genome controls the growth and development of human being.

Why do not we utilize DNA to store information? Researches indicate that it is extremely dense, and spectacularly high-capacity with a raw storage density limit of 1 exabyte/mm3(109GB/mm3)[2]. In other words, every gram of DNA is equivalent to 14 thousand 50 GB blue-ray discs or 233 x 3 TB hard-disks which weighs more than 151 kg. Meanwhile, compared with current routinely used information storage media, DNA can be stored up to centuries (Fig.3)[2]. Grass et al developed an innovative preservation of digital information on DNA which presumably can last 1 million years in silica[3]. In conclusion, a great number of virtues make DNA an ideal archival material for information storage.


Fig.3.DNA storage as the bottom level of the storage hierarchy[2].

Biotechnology Availability

You may wonder how a piece of arbitrary information is stored into DNA molecules. On the physical reading side, it depends on the DNA sequencing technology (Fig.4) In 1977, Frederick Sanger adopted primer-extension strategy to develop the DNA sequencing method "DNA sequencing with chain-terminating inhibitors"[4], and it directly facilitated human genome project(HGP). But Sanger method is too expensive and takes too long time, such that HGP project spent 3 billion dollars over fifteen years [5]. Until next-generation sequencing (NGS), also named high-throughput sequencing, technologies generated, the cost of vast sequencing became comprehensive and really acceptable. 454 pyrosequencing [6], illumina (Solexa) sequencing [7], SOLiD sequencing [8] are three mainly applied popular methods. Meanwhile the third generation nanopore DNA sequencing has appeared [9]. Owing to technology development, sequencing cost per genome exponential decrease from 1 million dollars in 2008 to just 1 thousand now [10]. Encouragingly, the cost will keep dropping, and sequencing would become as simple as reading information from a hard disk or compact optical disk.


Fig.4.The history of DNA sequencing technology progress.

On the other hand, writing information physically into DNA molecules depends on the chemical synthesis of artificial DNA sequences. Currently, oligonucleotide synthesis is used for preparing primers, gene probes, etc. The process is implemented as solid-phase synthesis using phosphoramidite method [11] and phosphoramidite building blocks derived from protected 2'-deoxynucleosides (dA, dC, dG, and T), ribonucleosides (A, C, G, and U), or chemically modified nucleosides. (Fig.5) Unfortunately, we still cannot synthesize any DNA molecules as we wish, for example long DNA segments or some possessing special second structures or high GC-content now are still hard or impossible to synthesize chemically. Although oligonucleotide synthesis has been proposed as early as in 1955 [12], the technique is still very expensive, comparing the synthesis cost 0.04 US cent per base (add reference here.) with just 1 cent in reading 1 million bases. Therefore, it is not surprising to find the writing of one megabyte of information cost $12,400 while the reading only costs $220 in a recent DNA storage experiment [14].


Fig.5.Synthetic cycle for preparation of oligonucleotides by phosphoramidite method[13].

However, we believe as the rapid development of biotechnology the DNA synthesis will become cheap enough for the information storage usage. What remains to be solved is how to encode an arbitrary computer file into DNA sequences and decode the DNA sequencing information. There lacks a bridge between today’s electronic based information world and the future biotechnology based information world, and what we want to do is to connect them.

References


  • [1] Goldman N, Bertone P, Chen S, et al. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA[J]. Nature, 2013, 494(7435): 77-80.
  • [2] Bornholt J, Lopez R, Carmean D M, et al. A DNA-based archival storage system[C]//Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems. ACM, 2016: 637-649.
  • [3] Grass R N, Heckel R, Puddu M, et al. Robust Chemical Preservation of Digital Information on DNA in Silica with Error‐Correcting Codes[J]. Angewandte Chemie International Edition, 2015, 54(8): 2552-2555.
  • [4] Sanger F, Nicklen S, Coulson A R. DNA sequencing with chain-terminating inhibitors[J]. Proceedings of the National Academy of Sciences, 1977, 74(12): 5463-5467.
  • [5] Lander E S, Linton L M, Birren B, et al. Initial sequencing and analysis of the human genome[J]. Nature, 2001, 409(6822): 860-921.
  • [6] Margulies M, Egholm M, Altman W E, et al. Genome sequencing in microfabricated high-density picolitre reactors[J]. Nature, 2005, 437(7057): 376-380.
  • [7] Bentley D R, Balasubramanian S, Swerdlow H P, et al. Accurate whole human genome sequencing using reversible terminator chemistry[J]. nature, 2008, 456(7218): 53-59.
  • [8] Valouev A, Ichikawa J, Tonthat T, et al. A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning[J]. Genome research, 2008, 18(7): 1051-1063.
  • [9] Clarke J, Wu H C, Jayasinghe L, et al. Continuous base identification for single-molecule nanopore DNA sequencing[J]. Nature nanotechnology, 2009, 4(4): 265-270.
  • [10] Kedes L, Liu E T. The Archon Genomics X PRIZE for whole human genome sequencing[J]. Nature genetics, 2010, 42(11): 917-918.
  • [11] Reese C B. Oligo-and poly-nucleotides: 50 years of chemical synthesis[J]. Organic & biomolecular chemistry, 2005, 3(21): 3851-3868.
  • [12] Michelson A M, Todd A R. Nucleotides part XXXII. Synthesis of a dithymidine dinucleotide containing a 3′: 5′-internucleotidic linkage[J]. Journal of the Chemical Society (Resumed), 1955: 2632-2638.
  • [13] https://commons.wikimedia.org/wiki/File%3AOligocycle1.png
  • [14] Goldman N, Bertone P, Chen S, et al. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA[J]. Nature, 2013, 494(7435): 77-80.
CATALOGUE