Model for Evaluation of DNA Synthesis Created by: Ori Kaplan Gilad Myerson Supervised by: Gregory Linshiz, Weizmann institute Prof. Udi Shapiro, Weizmann.

Slides:



Advertisements
Similar presentations
Replication, Transcription, and Translation Before a cell can divide, the DNA in the nucleus of the cell must be duplicated. Since the DNA molecule consists.
Advertisements

PCR way of copying specific DNA fragments from small sample DNA material "molecular photocopying" It’s fast, inexpensive and simple Polymerase Chain Reaction.
Labelling probes and primers In the cases of Northern and Southern blots probes are pieces of single stranded DNA that are complimentary to the single.
Genomic DNA extraction from whole blood
Introduction to Excel 2007 Part 2: Bar Graphs and Histograms February 5, 2008.
13-2 Manipulating DNA.
DNA Synthesis. Last time: n DNA is the chemical substance that serves as the genetic material (exception: RNA in some viruses) n Today: In order to pass.
11 DNA and Its Role in Heredity. 11 The Structure of DNA DNA is a polymer of nucleotides. The four nucleotides that make up DNA differ only in their nitrogenous.
DNA. DNA is… DNA is… –Your genetic code –What tells your cells which proteins to make and when to make them –The code that makes up your genes –Located.
Nucleic Acid Design Applications Polymerase Chain Reaction (PCR) Calculating Melting Temperature (Tm) PCR Primers Design.
Genomic DNA purification
The polymerase chain reaction (PCR) rapidly
ZmqqRPISg0g&feature=player_detail page The polymerase chain reaction (PCR)
Lecture 3 Lecture 2 catch up Vector structure Copy number control
DNA Sequencing Today, laboratories routinely sequence the order of nucleotides in DNA. DNA sequencing is done to: Confirm the identity of genes isolated.
1.) DNA Extraction Follow Kit Grind sample Mix with solution and spin Bind, Wash, Elute.
Reading the Blueprint of Life
Identical twins are two individuals that are genetically identical. What does this mean? How can a sheep that is 12 years old have an identical twin who.
DNA Technology- Cloning, Libraries, and PCR 17 November, 2003 Text Chapter 20.
From Gene To Protein Chapter 17. The Connection Between Genes and Proteins Proteins - link between genotype (what DNA says) and phenotype (physical expression)
Chapter 17 Notes From Gene to Protein.
Molecular Genetics DNA Structure  Nucleotides  Consist of a five-carbon sugar, a phosphate group, and a nitrogenous base 12.1 DNA: The Genetic Material.
RNA and Protein Synthesis
RNA AND PROTEIN SYNTHESIS RNA vs DNA RNADNA 1. 5 – Carbon sugar (ribose) 5 – Carbon sugar (deoxyribose) 2. Phosphate group Phosphate group 3. Nitrogenous.
13-1 Changing the Living World
HOW TO MAKE A TIMETABLE USING GENETIC ALGORITHMS Introduction with an example.
Tools of Human Molecular Genetics. ANALYSIS OF INDIVIDUAL DNA AND RNA SEQUENCES Two fundamental obstacles to carrying out their investigations of the.
1 Chapter 2: DNA replication and applications DNA replication in the cell Polymerase chain reaction (PCR) Sequence analysis of DNA.
Tina Doss Applied Biosystems
Warm-Up #33 Answer questions #1-5 on Text page 321, Section Assessment.
19.1 Techniques of Molecular Genetics Have Revolutionized Biology
Polymerase Chain Reaction (PCR) Developed in 1983 by Kary Mullis Major breakthrough in Molecular Biology Allows for the amplification of specific DNA fragments.
DNA REPLICATION TOPIC 3.4 & 7.2. Assessment Statements Explain DNA replication in terms of unwinding the double helix and separation of the strands.
Molecular Testing and Clinical Diagnosis
March 23 & 28, Csci 2111: Data and File Structures Week 10, Lectures 1 & 2 Hashing.
March 23 & 28, Hashing. 2 What is Hashing? A Hash function is a function h(K) which transforms a key K into an address. Hashing is like indexing.
Replication (not part of transcription/translation) Before a cell can divide, the DNA in the nucleus of the cell must be duplicated. Since the DNA molecule.
Chapter 10: Genetic Engineering- A Revolution in Molecular Biology.
Some basic molecular biology Summaries of: Replication, Transcription; Translation, Hybridization, PCR Material adapted from Lodish et al, Molecular Cell.
Polymerase Chain Reaction A process used to artificially multiply a chosen piece of genetic material. May also be known as DNA amplification. One strand.
Solution of Satisfiability Problem on a Gel-Based DNA computer Ji Yoon Park Dept. of Biochem Hanyang University.
Ch. 11 DNA Structure. Chromosomes Structure  Two Components:  DNA  Protein.
Semiconservative DNA replication Each strand of DNA acts as a template for synthesis of a new strand Daughter DNA contains one parental and one newly synthesized.
RNA processing and Translation. Eukaryotic cells modify RNA after transcription (RNA processing) During RNA processing, both ends of the primary transcript.
Cloning of PCR Fragment into T- Vector Jung-Min Choi Department of Biochemistry, College of Life Science and Biotechnology, Mouse Genetics and Laboratory.
9.2 Copying DNA KEY CONCEPT The polymerase chain reaction rapidly copies segments of DNA.
DNA Replication the big event during S phase. The Animation hill.com/sites/ /student_view0/chapter14/animations.html#
Biotechnology.
Sequencing Introduction
Chapter 5 Chemical Synthesis, Sequencing, and Amplification of DNA
PCR uses polymerases to copy DNA segments.
DNA Technology.
Chapter 14 Bioinformatics—the study of a genome
Polymerase Chain Reaction (PCR) technique
The student is expected to: (6H) describe how techniques such as DNA fingerprinting, genetic modifications, and chromosomal analysis are used to study.
Replication, Transcription, and Translation
PCR uses polymerases to copy DNA segments.
PCR uses polymerases to copy DNA segments.
PCR Polymerase chain reaction (PCR)
DNA REPLICATION.
PCR uses polymerases to copy DNA segments.
Dr. Israa ayoub alwan Lec -12-
Transcription Protein Synthesis.
DNA Replication.
PCR uses polymerases to copy DNA segments.
PCR uses polymerases to copy DNA segments.
Model for Evaluation of DNA Synthesis
Using the DNA Sequence Knowing the sequence of an organism’s DNA allows researchers to study specific genes, to compare them with the genes of other organisms,
PCR uses polymerases to copy DNA segments.
Presentation transcript:

Model for Evaluation of DNA Synthesis Created by: Ori Kaplan Gilad Myerson Supervised by: Gregory Linshiz, Weizmann institute Prof. Udi Shapiro, Weizmann institute

Synthesizing DNA Currently, there are few successful ways of synthesizing DNA. Most common - Assembly PCR. Methods are costly and take much time (±3 weeks from order to delivery of a DNA strand). ABI 3900 Mer-Made6

New Approach Prof. Udi Shapiro / Gregory Linshiz: New confidential method of in-vitro DNA molecule synthesis. Goal – synthesize DNA quicker, easier and cheaper. Part of this method, involves elongation of oligonucleotides. Elongation success rate (until now) ≈ 80-90%.

New Approach Elongation of DNA includes….. Since the elongation of oligonucleotides in-vitro is done on the pattern of synthetic DNA strands, we will give a brief explanation of synthetic oligonucleotide synthesis. Oligonucleotide synthesis is a remarkably simple process that has far reaching implications. Oligonucleotide synthesis is extremely useful in laboratory procedures. It is used to make primers crucial in methods such as PCR replication. Making a custom oligonucleotide is additionally useful because they will only bind to the region of DNA that is complementary to your custom oligonucleotide sequence. This allows specific segments of DNA to be amplified. In addition, custom oligonucleotide synthesis allows other sequences, such as restriction sites, to be added on to the desired oligonucleotide. Custom oligonucleotides are generally 50 bases in length which can limit how many additional sequences can be added on to the desired primer sequence. Oligonucleotides are synthesized by using DNA Phosphoramidite Monomer Bases as building blocks. The monomer bases active sites are all chemically blocked in such a way that they can be unblocked at will by use of unblocking solutions. The oligonucleotide synthesis involves 4 stages: Stage 1: De blockingThe first base, which is attached to the solid support, is at first inactive because all the active sites have been blockedor protected. To add the next base, the DMT group protecting the 5'-hydroxyl group must be removed. This is done by adding a base. The 5’-hydroxyl group is now the only reactive group on the base monomer. This ensures that the addition of the next base will only bind to that site. Stage 2: Base condensationThe next base monomer cannot be added until it has been activated. This is achieved by adding tetrazole to the base. The active 5’-hydroxyl group of the preceding base and the newly activated phosphorus bind to loosely oin the two bases together. Stage 3: CappingThe unbound, active 5’-hydroxyl group is capped with a protective group which subsequently prohibits that strand from growing again. This is done by adding acetic anhydride and N-methylimidazole to the reaction column.Stage 4: Oxidation In order to stabilize the phosphate linkage, a solution of dilute iodine in water, pyridine, and tetrahydrofuran is added to the reaction column, oxidizing and strengthening it. Top Secret

Sequencing After the DNA synthesis procedure,sequencing the new molecules will indicate if the right molecule was synthesized. A chromatogram of DNA synthesis:

Chromatogram What does a chromatogram portray? “Clean” chromatogram – all molecules are identical “Noisy” chromatogram – inexplicit All A Some A Some T

The problem Lets assume this is the sequencing result: I.Is the experiment successful??? II.What needs to be changed in order to improve method? pH, temp, polymerase, dNTP’s, concentrations… Noise

The problem contd.. Which result is better…?

Conventional Analysis CLONE TO UNDERSTAND THE SEQUENCING Isolation cloning: Isolate single molecules  read exact sequence. Cloning several oligos gives an insight to the methods' degree of success. Theoretically, clone all in order to see if experiment was successful.

Weizmann’s request Cloning – very long, hard and expensive. Please try figure out a way to asses the degree of success “visually” using the chromatogram…

OK… אם נחייך יחשבו שאנחנו מבינים??? ננסה בכל מקרה

OK… יש לי יש לי יש לי...

A Solution ??? Lets treat the graph like LEGO © and see what we can do with the pieces…

Perfect Sequencing A C T G C A C T G A C A C G C T T A C T G C C G 10 molecules

Mutations occur “Dirty” chromatogram deletion insertion substitution

Two ways to try understand graph Sequence every single oligonucleotide (isolation cloning) Impossible Sequence every single oligonucleotide (isolation cloning) Impossible Sample sequencing and assessment of result Statistically inaccurate Sample sequencing and assessment of result Statistically inaccurate

Another Option Mathematically “Build” oligonucleotide molecules in such a way that the accumulated graph of those molecules will be identical to the chromatogram

Graph  Table A G C T If I had nucleotide long molecules – how many bases of each kind do I have in each “place”?

Table  Molecules A G C T Random procedure

Molecules  Graph

New Problem How do we choose the 100 molecules that build graph? Linear – too many options to check O(4 n )! Choose 100 from 4 n. If oligo is 100 nucleotides long  n = 100. Choose 100 molecules from 1.6*10 60 nknk 1.6* = ≈

OK… תחייך – אולי יתנו לנו 100 ננסה...

OK… יש לי יש לי יש לי...

The problem Don’t choose from all possibilities, assume that each molecule has only one mutation – Edit Distance 1 Reduced molecules:4 n  8n Select 100 molecules from 800 (instead of 1.6*10 60) OR

Still a problem How do we choose 100 molecules from 800? Linear: n! k!(n-k)! nknk 1.6* == = 3* possibilities

Genetic Algorithm

Genetic algorithm Define initial mutation rates: deletions, insertions (?), substitutions (?) Normalize graph and convert graph to matrix (4 x n). Build a molecule bank of “Edit Distance 1”.

Deletions Deletions are very easy to see and calculate… Excel – graph of deletions: y = x

Population There is a population of 100 – each entity in population represents a single result. Each result consists of 100 molecules (from the ED1 bank) that build up a graph. The population is initialized using the mutation rate. 100 One result

Evaluation function The current Evaluation function is: F(e) = ∑|M ij – R ij | In the future the function will take amount of substitutions into consideration. experiment result

Generation Generation Policy (current): Replication – Always replicate best 10. Crossover – Biased choice of entities for crossover. Mutations – i: mutate best 10. ii: randomly mutate the whole pop. Local Minimum Policy: 20 generations without improvement – shake pop.

File Handling Sequence data is initially in *.ab1 files In order to utilize data: Retranslate *ab1 file – Sequencing Analysis Convert *.ab1  *.txt – Bioedit Manage *.txt – Excel (also calculate del rate) Genetic Algorithm

No mutations - before 1

No mutations - after 1

10*del at 1, 10*del at 9 1

1

1

1

15 scattered subs - before 1

15 scattered subs 1

Setbacks ED1 – Result will never be 100% correct. Genetic Algorithm setbacks: heuristic, different final results, local min, evaluation function… No indication if results are correct. Algorithm deals with successful experiments. Data input – noise interpretation, normalized data.

Advantages New method of sequencing analysis. Potentially save many hours of isolation cloning. Mathematically – result is correct. Development potential for different areas of research.

Personal View Thrown into deep water  swam. Idea will (hopefully) be practical and useful. Learned a great deal – new programs, languages, methods. Mathematical analysis of chromatogram sequencing – ever done before???

Thank you