DNA Computing: Mathematics with Molecules Russell Deaton Professor Comp. Sci. & Engr. The University of Arkansas Fayetteville, AR 72701

Slides:



Advertisements
Similar presentations
Ashish Gupta Ashish Gupta Unremarkable Problem, Remarkable Technique Operations in a DNA Computer DNA : A Unique Data Structure ! Pros.
Advertisements

DNA Computing COMP308 I believe things like DNA computing will eventually lead the way to a “molecular revolution,” which ultimately will have a very dramatic.
Replication. N N H R O CH3 O T N N R H N H O C R N N N N H H N A G R N N N O H U.
13-2 Manipulating DNA.
1 DNA Computation: The Secret of Life as Non-Living Technology Russell Deaton Professor Comp. Science & Engineering The University of Arkansas Fayetteville,
1 DNA Computing: Concept and Design Ruoya Wang April 21, 2008 MATH 8803 Final presentation.
Montek Singh COMP Nov 15,  Two different technologies ◦ TODAY: DNA as biochemical computer  DNA molecules encode data  enzymes, probes.
Additional Powerful Molecular Techniques Synthesis of cDNA (complimentary DNA) Polymerase Chain Reaction (PCR) Microarray analysis Link to Gene Therapy.
What makes DNA Computing possible? Great advances in molecular biology –PCR (Polymerase Chain Reaction) –DNA Selection by affinity –DNA Filtering –DNA.
3 September, 2004 Chapter 20 Methods: Nucleic Acids.
6 The Chemical Structure, Replication, and Manipulation of DNA.
Genomic DNA purification
DNA and Quantum Computers Russell Deaton Associate Professor Computer Science and Engineering University of Arkansas.
TOOLS OF GENETIC ENGINEERING
Advanced Molecular Biological Techniques. Polymerase Chain Reaction animation.
DNA is the carrier of genetic information in all living species The double-helix structure consists of two strands of DNA wound around each other -Each.
Molecular Biology basics. Restriction enzymes Natural enzymes made by bacteria to protect against viral and other infections Each restriction enzyme recognizes.
6.3 Advanced Molecular Biological Techniques 1. Polymerase chain reaction (PCR) 2. Restriction fragment length polymorphism (RFLP) 3. DNA sequencing.
DNA Computing on Surfaces
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
1 Genetics Faculty of Agriculture and Veterinary Medicine Instructor: Dr. Jihad Abdallah Topic 15:Recombinant DNA Technology.
1 Genetics Faculty of Agriculture Instructor: Dr. Jihad Abdallah Topic 13:Recombinant DNA Technology.
Beyond Silicon: Tackling the Unsolvable with DNA.
1 Computing with DNA L. Adelman, Scientific American, pp (Aug 1998) Note: This ppt file is based on a student presentation given in October, 1999.
Basic methods in genetics PCR; Polymerase Chain Reaction Restriction enzyme digestions Gel electrophoresis.
Manipulating DNA.
DNA Technology. Overview DNA technology makes it possible to clone genes for basic research and commercial applications DNA technology is a powerful set.
Polymerase Chain Reaction (PCR) Developed in 1983 by Kary Mullis Major breakthrough in Molecular Biology Allows for the amplification of specific DNA fragments.
Fast parallel molecular solution to the Hitting-set problem Speaker Nung-Yue Shi.
Biotechnology Chapter 17.
Introduction to DNA Computing Russell Deaton Electrical Engineering The University of Memphis Memphis, TN Stephen A. Karl Department.
Cloning and Expression of Genes
What is DNA Computing? Shin, Soo-Yong Artificial Intelligence Lab.
Success criteria - PCR By the end of this lesson we will be able to: 1. The polymerase chain reaction (PCR) is a technique for the amplification ( making.
Molecular Testing and Clinical Diagnosis
Polymerase Chain Reaction (PCR)
A PCR-based Protocol for In Vitro Selection of Non-Crosshybridizing Oligonucleotides R. Deaton, J. Chen, H. Bi, M. Garzon, H. Rubin and D. H. Wood.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
6.3 Advanced Molecular Biological Techniques 1. Polymerase chain reaction (PCR) 2. Restriction fragment length polymorphism (RFLP) 3. DNA sequencing.
Chapter 10: Genetic Engineering- A Revolution in Molecular Biology.
Advantages of STR Analysis
Chapter 20: DNA Technology and Genomics - Lots of different techniques - Many used in combination with each other - Uses information from every chapter.
A Software Tool for Generating Non-Crosshybridizing libraries of DNA Oligonucleotides Russell Deaton, junghuei Chen, hong Bi, and John A. Rose Summerized.
Introduction to DNA Computing Russell Deaton Elec. & Comp. Engr. The University of Memphis Memphis, TN Junghuei Chen Department.
DNA computing on a chip Mitsunori Ogihara and Animesh Ray Nature, 2000 발표자 : 임예니.
1 Biological Computing – DNA solution Presented by Wooyoung Kim 4/8/09 CSc 8530 Parallel Algorithms, Spring 2009 Dr. Sushil K. Prasad.
Molecular Genetic Technologies Gel Electrophoresis PCR Restriction & ligation Enzymes Recombinant plasmids and transformation DNA microarrays DNA profiling.
Chapter 20 DNA Technology and Genomics. Biotechnology is the manipulation of organisms or their components to make useful products. Recombinant DNA is.
Semiconservative DNA replication Each strand of DNA acts as a template for synthesis of a new strand Daughter DNA contains one parental and one newly synthesized.
DNA Sequencing Hunter Jones, Mitchell Gage. What’s the point? In a process similar to PCR, DNA sequencing uses a mixture of temperature changes, enzymes.
Molecular pathology Lecture 4. Definition The study of biochemical and biophysical cellular mechanisms as the basic factors in disease. anatomic pathology,
Cloning of PCR Fragment into T- Vector Jung-Min Choi Department of Biochemistry, College of Life Science and Biotechnology, Mouse Genetics and Laboratory.
Polymerase Chain Reaction
Success criteria - PCR By the end of this lesson we will be know:
DNA Technologies (Introduction)
PCR TECHNIQUE
Today’s Title: CW: DNA manipulation – separating and probing
Summarized by In-Hee Lee
COURSE OF MICROBIOLOGY
Chapter 20: DNA Technology and Genomics
PCR and RLFP’s.
Polymerase Chain Reaction & DNA Profiling
DNA Computing and Molecular Programming
Chapter 14 Bioinformatics—the study of a genome
Polymerase Chain Reaction (PCR) technique
Chapter 21 Nucleic Acids and Protein Synthesis
Recombinant DNA Unit 12 Lesson 2.
Introduction to Bioinformatics II
Russell Deaton, junghuei Chen, hong Bi, and John A. Rose
Chapter 20: DNA Technology and Genomics
Presentation transcript:

DNA Computing: Mathematics with Molecules Russell Deaton Professor Comp. Sci. & Engr. The University of Arkansas Fayetteville, AR

What is DNA Computing (DNAC) ? The use of biological molecules, primarily DNA, DNA analogs, and RNA, for computational purposes.

Why Nucleic Acids? Density (Adleman, Baum): –DNA: 1 bit per nm 3, molecules –Video: 1 bit per nm 3 Efficiency (Adleman) –DNA: ops / J –Supercomputer: 10 9 ops / J Speed (Adleman): –DNA: ops per s –Supercomputer: ops per s

What makes DNAC possible? Great advances in molecular biology –PCR (Polymerase Chain Reaction) –DNA Microarrays –New enzymes and proteins –Better understanding of biological molecules Ability to produce massive numbers of DNA molecules with specified sequence and size DNA molecules interact through template matching reactions

What is a the typical methodology? Encoding: Map problem instance onto set of biological molecules and molecular biology protocols Molecular Operations: Let molecules react to form potential solutions Extraction/Detection: Use protocols to extract result in molecular form

P HYSICAL S TRUCTURE OF DNA Nitrogenous Base 34 Å Major Groove Minor Groove Central Axis Sugar-Phosphate Backbone 20 Å 5’ C 3’ OH 3’ 0H C 5’ 5’ 3’ 5’

What is an example? “Molecular Computation of Solutions to Combinatorial Problems” Adleman, Science, v. 266, p

Algorithm Generate Random Paths through the graph. Keep only those paths that begin with v in and end with v out. If graph has n vertices, then keep only those paths that enter exactly n vertices. Keep only those paths that enter all the vertices at least once. In any paths remain, say “Yes”; otherwise, say “No”

I NTER-STRAND H YDROGEN B ONDING AdenineThymine to Sugar-Phosphate Backbone to Sugar-Phosphate Backbone (+)(-) (+)(-) Hydrogen Bond GuanineCytosine to Sugar-Phosphate Backbone to Sugar-Phosphate Backbone (-) (+) (-) (+) (-)

S TRAND H YBRIDIZATION A B a b A B a b b B a A HEAT COOL b a A B OR 100° C

DNA L IGATION  ’’ ’’ ’’ ’’ Ligase Joins 5' phosphate to 3' hydroxyl ’’ ’’  

Encoding ‘GCATGGCC ‘AGCTTAGG ‘ATGGCATG CCGGTCGA’ CCGGTACC’ ‘GCATGGCCAGCTTAGG CCGGTCGA’ ‘GCATGGCCATGGCATG CCGGTACC’ 0021

Massively Parallel Search V1V1 E 0->1 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 1->2 E 2->3 E 3->4 E 4->5 E 5->6 V6V6 E 0->6 V0V0 V3V3 E 0->3 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 3->2 E 2->3 E 3->4 E 4->5 E 5->6 V5V5 E 4->5 V4V4 V1V1 V2V2 E 5->1 E 1->2

Algorithm Generate Random Paths through the graph. Keep only those paths that begin with v in and end with v out. If graph has n vertices, then keep only those paths that enter exactly n vertices. Keep only those paths that enter all the vertices at least once. In any paths remain, say “Yes”; otherwise, say “No”

DNA Polymerase

POLYMERASE CHAIN REACTION

Start = V0, Stop = V6 V1V1 E 0->1 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 1->2 E 2->3 E 3->4 E 4->5 E 5->6 V6V6 E 0->6 V0V0 V3V3 E 0->3 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 3->2 E 2->3 E 3->4 E 4->5 E 5->6 V5V5 E 4->5 V4V4 V1V1 V2V2 E 5->1 E 1->2

Algorithm Generate Random Paths through the graph. Keep only those paths that begin with v in and end with v out. If graph has n vertices, then keep only those paths that enter exactly n vertices. Keep only those paths that enter all the vertices at least once. In any paths remain, say “Yes”; otherwise, say “No”

G EL E LECTROPHORESIS - SIZE SORTING Buffer Gel Electrode Samples Faster Slower

Right Length V1V1 E 0->1 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 1->2 E 2->3 E 3->4 E 4->5 E 5->6 V6V6 E 0->6 V0V0 V3V3 E 0->3 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 3->2 E 2->3 E 3->4 E 4->5 E 5->6

Algorithm Generate Random Paths through the graph. Keep only those paths that begin with v in and end with v out. If graph has n vertices, then keep only those paths that enter exactly n vertices. Keep only those paths that enter all the vertices at least once. In any paths remain, say “Yes”; otherwise, say “No”

A NTIBODY A FFINITY CACCATGTGAC GTGGTACACTG B PMP + Anneal CACCATGTGAC GTGGTACACTG B + CACCATGTGAC GTGGTACACTG B PMP Bind Add oligo with Biotin label Heat and cool Add Paramagnetic-Streptavidin Particles Isolate with Magnet N S

Every Vertex V1V1 E 0->1 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 1->2 E 2->3 E 3->4 E 4->5 E 5->6 V3V3 E 0->3 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 3->2 E 2->3 E 3->4 E 4->5 E 5->6

Algorithm Generate Random Paths through the graph. Keep only those paths that begin with v in and end with v out. If graph has n vertices, then keep only those paths that enter exactly n vertices. Keep only those paths that enter all the vertices at least once. In any paths remain, say “Yes”; otherwise, say “No”

Hamiltonian Path V1V1 E 0->1 V0V0 V2V2 V3V3 V4V4 V5V5 V6V6 E 1->2 E 2->3 E 3->4 E 4->5 E 5->6

Mismatches

DNA Word Design Importance of Template-Matching Hybridization Reactions in DNA Computing (DNAC) Sequence design should implement DNAC architecture. –Planned Hybridizations –Problem Size –Subsequent Processing Reactions Designed sequences should minimize unplanned “cross-hybridizations.” Consequences of Bad Designs: Errors and Poor Efficiency

DNA Word Design Design problem is hard. As number of sequences required to represent the problem increases, this constraints increasingly conflicts with the requirement of non-crosshybridization. How much of DNA sequence space is available for computation?

Why In Vitro? In Vitro Selection and Evolution PCR as tool for selection Ability to synthesis huge, random starting populations Mutagenesis Oligos manufactured under conditions for use Use massive parallelism of DNAC to solve word design problem

Protocol Outline Start with huge population of random sequences with attached primers. Anneal rapidly to quench oligos in mismatched configurations. Using temperature as a control, melt most mismatched pairs. Amplify and purify Repeat

Experimental Results

Latest Results

DNA Memories

Overview Input DNAs (Unknown Seq.) Sequences Comple- mentary to Input DNAs Memory DNA Strands (With the 3’ end Comple- mentary to the Input DNAs) New Unknown Input DNAs Learning Recall Output Tag 1 Random Probe Separates Memory DNA Strands that Match or Partially Match the New Inputs from Those That Don’t Match Labeled Tag Sequence Complements

Learning Learning: Information acquired from examples rather than programmed Protocol to store input DNAs (possibly of unknown sequence) Higher level representation of the input sequences Not individual sequence memories but whole populations Clustering of input sequences in vitro Massively random and parallel copying or sampling depending on number of inputs and probes

Base-by-Base Amplification Input DNA Tag ProbeExtension

Sampling Input DNA Tag ProbeExtension

Energy Surface Manipulation through Learning Energy Input Sequence Energy Input Sequence Before LearningAfter Learning

Tags Non-Crosshybridizing Sequences Convenient for Input/Output in absence of input sequence information Manipulate memory without input sequences Implement DNA 2 DNA Computations (Landweber and Lipton, DNA 3)

Recall Hybridization to retrieve memories Similar sequences patterns matched Pattern matching done against whole memory Single memory associated with single tags Memory composite of output on multiple tags

Experiments Test learning and recall with plasmid Test of sensitivity in concentration Test coverage of input sequence space with: –Plasmids (5k bp) –E. Coli (5M bp) Test sequence resolution of protocols

Learning Input 1 is a 3 kb linear DNA (pBluescript) Input 2 is a 5 kb linear DNA (  x 174) I1 I2 I1I2 Input DNAs M1 M2 Memory DNAs

Input 1 Input 2 Stained Gel Blot with Memory 1 (pBluescript) Blot with Memory 2 (  x 174) Plasmid inputs learned, similar sequences recalled, and dissimilar not matched. Recall

Concentration Sensitivity 1.Plasmids digested with Hpa II 2.1  g pBluescript 3.10ng - 800ng  x Blotted with  x 174 memory 5.1%  x 174 detected in background of pBluescript

Input Space Coverage Randomly digested input Learning on both inputs Blots nearly identical

E. coli 1.E. coli digested 2.219bp fragment of  x 174 added 3.Learning with and without fragment 4.Fragment distinguished when learned

Application

Team Russell Deaton, University of Arkansas, Computer Science and Engineering Junghuei Chen, University of Delaware, Chemistry and Biochemistry Hong Bi, University of Delaware, Chemistry and Biochemistry Max Garzon, University of Memphis, Computer Science Harvey Rubin, University of Pennsyvania, School of Medicine David Wood, University of Delaware, Computer and Information Science

Acknowledgement This work was supported by the NSF QuBIC program, award number EIA