Thanks to: DARPA & DOE-GtL Agencourt, Ambergen, Atactic, BeyondGenomics, Caliper, Genomatica, Genovoxx, Helicos, MJR, NEN, Nimblegen, Xeotron/Invitrogen.

Slides:



Advertisements
Similar presentations
Recombinant DNA technology
Advertisements

Thanks to: Broad Inst., DARPA-BioComp, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-CECBSR, PhRMA, Lipper Foundation Agencourt, Ambergen, Atactic,
DARPA BAA 01-26: BIO-COMP Technical challenges and risks: “DNA computing” so far focused on computing.
Low-cost, high accuracy, long-DNA synthesis technology George Church, Joe Jacobsen et al. Harvard & MIT 0. Killer Applications 1. Chip synthesis, fluidics.
Harvard MIT DOE GtL Center Collaborating PIs: Chisholm, Polz, Church, Kolter, Ausubel, Lory arep.med.harvard.edu C.Ting 7-Feb :10-4:40 PM 2-20 μm.
Reference books: Molecular Biology of the Cell, 4th edition, by B. Alberts et al., Molecular Cell Biology, 5th edition, by H. Lodish et al., 2004.
Harvard Med, WashU, MIT NIH-CEGS, DARPA, PhRMA, DOE-GTL Sequencing: Helicos, Ambergen, Caliper, BC-Agencourt-GTC, Synthesis: Nimblegen, Atactic/Invitrogen,
Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome Jay Shendure, Gregory J. Porreca, Nikos B. Reppas, Xiaoxia Lin, John P. McCutcheon.
Thanks to: DOE-HGP DARPA (* Currently unfunded* ) 28-Jan-2004 DOE Synthesis of useful replicating biosystems.
Molecular Genomic Imaging Center (CEGS) Harvard / Wash U George Church, Rob Mitra Greg Porreca, Jay Shendure Sequencing by Ligation on Polony Beads with.
Thanks to: DARPA BioComp DNA&RNA Polonies: Mitra, Shendure, Zhu Protein MS: Jaffe, Leptos Metabolism/Proliferation models : Segre, Vitkup, Badarinarayana.
1 Library Screening, Characterization, and Amplification Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis.
Characterization, Amplification, Expression
Thanks to: Washington U, Harvard-MIT Broad Inst., DARPA-BioSpice, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-SysBio, PhRMA, Lipper Foundation Agencourt,
Thanks to: Science & Technology Expert Partnership (STEP) Program Broad Inst., DARPA, DOE-GTL, EU-MolTools, NHLBI, NGHRI-CEGS, NIGMS-SysBio, PhRMA, Lipper.
1 Characterization, Amplification, Expression Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis of DNA.
Thanks to: Agencourt, Ambergen, Atactic, BeyondGenomics, Caliper, Genomatica, Genovoxx, Helicos, MJR, NEN, Nimblegen, ThermoFinnigan, Xeotron/Invitrogen.
HST Advisory Council Thursday 16-Nov :00 to 2:20 PM Personal Genomes & Medicine Thanks to: Broad Inst., DARPA-BioComp, DOE-GTL, EU-MolTools, NGHRI-CEGS,
Low-cost, high accuracy, long-DNA synthesis technology George Church, Joe Jacobsen et al. Harvard & MIT 0. Killer Applications 1. Chip synthesis, fluidics.
Thanks to: Washington U, Harvard-MIT Broad Inst., DARPA-BioSpice, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-SysBio, PhRMA, Lipper Foundation Agencourt,
CS 6293 Advanced Topics: Current Bioinformatics
Error-Corrected DNA Synthesis Peter Carr MIT Media Laboratory.
Mutation  Is a change in the genetic material.  Structural change in genomic DNA which can be transmitted from cell to it is daughter cell.  Structural.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
AP Biology: Chapter 14 DNA Technologies
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Data Type 1: Microarrays
DNA Cloning and PCR.
Restriction Nucleases Cut at specific recognition sequence Fragments with same cohesive ends can be joined.
Chapter 4 Molecular Cloning Methods. Gene Cloning The Role of Restriction Endonuclease.
MIT Molecular Machines (Jacobson) Group - Next Generation DNA Synthesis HTGAA.
Recombinant Technololgy
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Literature reviews revised is due4/11 (Friday) turn in together: revised paper (with bibliography) and peer review and 1st draft.
Biotechnology.
MAPPING GENOMES – genetic, physical & cytological maps Genetic distance (in cM) 1 centimorgan = 1 map unit, corresponding to recombination frequency of.
19.1 Techniques of Molecular Genetics Have Revolutionized Biology
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
-Know that we can manipulate genomes by inserting or deleting certain genes. -What about synthesizing an entirely novel genome using sequencing technology?
Microfluidics for Gene Fabrication Peter Carr & David Kong MIT Media Laboratory.
PHYSICAL MAPPING AND POSITIONAL CLONING. Linkage mapping – Flanking markers identified – 1cM, for example Probably ~ 1 MB or more in humans Need very.
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
Copyright © 2009 Pearson Education, Inc. Chapter 14 The Genetic Code and Transcription Copyright © 2009 Pearson Education, Inc.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
Chapter 10: Genetic Engineering- A Revolution in Molecular Biology.
Proteome and Gene Expression Analysis Chapter 15 & 16.
Chapter 20: DNA Technology and Genomics - Lots of different techniques - Many used in combination with each other - Uses information from every chapter.
From Megabytes to Megabases Building a Fab to Enable Synthetic Biology
Some basic molecular biology Summaries of: Replication, Transcription; Translation, Hybridization, PCR Material adapted from Lodish et al, Molecular Cell.
目录 The Principle and Application of Common Used Techniques in Molecular Biology chapter 18.
Rest of Chapter 11 Chapter 12 Genomics, Proteomics, and Transgenics Jones and Bartlett Publishers © 2005.
Recombinant DNA Reverse genetics Synthesis of DNA probes Restriction enzymes, plasmids and recombinant DNA Genomic and cDNA libraries Applications.
Chapter 14 GENETIC TECHNOLOGY. A. Manipulation and Modification of DNA 1. Restriction Enzymes Recognize specific sequences of DNA (usually palindromes)
DNA Technology & Genomics CHAPTER 20. Restriction Enzymes enzymes that cut DNA at specific locations (restriction sites) yielding restriction fragments.
Next-generation sequencing technology
Next generation sequencing
Lecture 8 A toolbox for mechanistic biologists (continued)
Next-generation sequencing technology
Chapter 4 Recombinant DNA Technology
Chapter 20: DNA Technology and Genomics
GENETIC ENGINEERING Akinniyi A. Osuntoki,Ph.D. 13/07/20181.
Relationship between Genotype and Phenotype
Recombinant DNA Technology
Recombinant DNA Technology
Molecular Biology lecture -Putnoky
Digital Gene Expression – Tag Profiling Sample Preparation
Putting Synthesis into Biology: A Viral View of Genetic Engineering through De Novo Gene and Genome Synthesis  Steffen Mueller, J. Robert Coleman, Eckard.
Putting Synthesis into Biology: A Viral View of Genetic Engineering through De Novo Gene and Genome Synthesis  Steffen Mueller, J. Robert Coleman, Eckard.
Chapter 20: DNA Technology and Genomics
Presentation transcript:

Thanks to: DARPA & DOE-GtL Agencourt, Ambergen, Atactic, BeyondGenomics, Caliper, Genomatica, Genovoxx, Helicos, MJR, NEN, Nimblegen, Xeotron/Invitrogen For more info see: arep.med.harvard.edu 1-Feb :15-10 MITRE Low-Cost/High-Accuracy Microbial Genome Synthesis and Monitoring

Synthetic - homologous recombination testing of DNA motifs (1.3 in  argR) RNA Ratio (motif- to wild type) for each flanking gene Bulyk, McGuire,Masuda,Church Genome Res. 14:201–208

Synthetic Genomes & Proteomes. Why? Test or engineer cis-DNA/RNA-elements Access to any protein (complex) including post-transcriptional modifications Affinity agents for the above. Protein design, vaccines, solubility screens Utility of molecular biology DNA -- RNA -- Protein in vitro "kits" (e.g. PCR -- T7 -- Roche) Toward these goals design a chassis: 115 kbp genome. 150 genes. Nearly all 3D structures known. Comprehensive functional data.

(PURE) translation utility Removing tRNA-synthetases, translational release-factors, RNases & proteases Selection of scFvs[antibodies] specific for HBV DNA polymerase using ribosome display. Lee et al J Immunol Methods. 284:147 Programming peptidomimetic syntheses by translating genetic codes designed de novo. Forster et al PNAS 100:6353 High level cell-free expression & specific labeling of integral membrane proteins. Klammt et al Eur J Biochem 271:568 Cell-free translation reconstituted with purified components. Shimizu et al Nat Biotechnol. 19:751-5.

in vitro genetic codes 5' mS yU eU UGG UUG CAG AAC... GUU A 3' GAAACCAUG fMTNVE | | | 5' Second base 3' U A C C U mS yU eU A C U G A Forster, et al. (2003) PNAS 100: % average yield per unnatural coupling. eU = 2-amino-4-pentenoic acid yU = 2-amino-4-pentynoic acid mS = O-methylserine gS = O-GlcNAc–serine bK = biotinyl-lysine

Forster & Church Oligos for 150 & 776 synthetic genes (for E.coli minigenome & M.mobile whole genome respectively)

Up to 760K Oligos/Chip 18 Mbp for $700 raw (6-18K genes) <1K Oxamer Electrolytic acid/base 8K Atactic/Xeotron/Invitrogen Photo-Generated Acid Sheng, Zhou, Gulari, Gao (U.Houston) 24K Agilent Ink-jet standard reagents 48K Febit 100K Metrigen 380K Nimblegen Photolabile 5'protection Nuwaysir, Smith, Albert Tian, Gong, Church

Improve DNA Synthesis Cost Synthesis on chips in pools is 5000X less expensive per oligonucleotide, but amounts are low (1e6 molecules rather than usual 1e12) & bimolecular kinetics slow with square of concentration decrease!) Solution: Amplify the oligos then release them => ss-70-mer (chip) 20-mer PCR primers with restriction sites at the 50mer junctions Tian, Gong, Sheng, Zhou, Gulari, Gao, Church => ds-90-mer => ds-50-mer

Improve DNA Synthesis Accuracy via mismatch selection Tian & Church Other mismatch methods: MutS (&H,L)

Genome assembly Moving forward: 1. Tandem, inverted and dispersed repeats (hierarchical assembly, size-selection and/or scaffolding) 2. Reduce mutations (goal <1e-6 errors) to reduce # of intermediates 3. 15kb to 5Mb by homologous recombination (Nick Reppas) 4. Phage integrase site-specific recombination, also for counters. Stemmer et al Gene 164:49-53;Mullis 1986 CSHSQB … 100*2^(n-1)

All 30S-Ribosomal-protein DNAs (codon re-optimized) Tian, Gong, Sheng, Zhou, Gulari, Gao, Church 1.7 kb 0.3 kb s19 0.3kb Nimblegen 95K chip Atactic <4K chip

Improving synthesis accuracy Method Bp/error Chip assembly only 160 Hybridization-selection 1,400 MutS-gel-shift 10,000 MutHLS cleavage 30,000 (10X better than PCR) Tian & Church 2004 Carr & Jacobson 2004 Smith & Modrich 1997

Extreme mRNA makeover for protein expression in vitro RS-2,4,5,6,9,10,12,13,15,16,17,and 21 detectable initially. RS-1, 3, 7, 8, 11, 14, 18, 19, 20 initially weak or undetectable. Solution: Iteratively resynthesize all mRNAs with less mRNA structure. Tian & Church Western blot based on His-tags

Safety Proposals Church, G.M. A synthetic biohazard non-proliferation proposal. (2004) ( Monitor oligo synthesis via expansion of Controlled substances, Select Agents, &/or Recombinant DNA 2. Computational tools for the above 3. System modeling checks for synthetic biology projects 4. Multi-auxotroph, novel genetic code for the host genome, prevents functional transfer of DNA to other cells.

Why sequence? Synthetic biology & laboratory selections Pathogen "weather map", biowarfare sensors Cancer: mutation sets for individual clones, loss-of-heterozygosity RNA splicing & chromatin modification patterns. Antibodies or "aptamers" for any protein B & T-cell receptor diversity: Temporal profiling, clinical Preventative medicine & genotype–phenotype associations Cell-lineage during development Phylogenetic footprinting, biodiversity Shendure et al Nature Rev Gen 5, 335.

Personal genomics & cancer therapy Mutations G719S, L858R, Del746ELREA in red. EGFR Mutations in lung cancer: correlation with clinical response to gefitinib [Iressa] therapy. Paez, … Meyerson (Apr 2004) Science 304: 1497 Lynch … Haber, N Engl J Med. (Apr 2004) 350:2129. Pao.. Mardis,Wilson,Varmus H, PNAS (Aug 2004) 101: Dulbecco R. (1986) A turning point in cancer research: sequencing the human genome. Science 231:

Why 'single molecule' sequencing? (1) Single-cells: Preimplantation (PGD), uncultivatable (2) Co-occurrence on a molecule, complex, cell RNA splice-forms & DNA haplotypes (3) Cost: $1K-100K "personal genomes" (4) Precision: Counting 10 9 RNA tags (to reduce variance) (~5e5 RNAs per human cell) Fixed 5e3 5e4 5e6 5e9 (goal) costs EST SAGE MPSS Polony-FISSeq (polymerase colony)

CD44 Exon Combinatorics (Zhu & Shendure) Alternatively Spliced Cell Adhesion Molecule Specific variable exons are up-or-down-regulated in various cancers (>2000 papers) v6 & v7 enable direct binding to chondroitin sulfate, heparin… Zhu,J, et al. Science. 301:836-8.

Zhu J, Shendure J, Mitra RD, Church GM. Science 301: Single molecule profiling of alternative pre-mRNA splicing. Eph4 = murine mammary epithelial cell line Eph4bDD = stable transfection of Eph4 with MEK-1 (tumorigenic) CD44 RNA isoforms

Chromosome-wide haplotyping IL : A 60-Mb CD : A /T Human Chr. 7 A..A A..T Mb

Convergence on non-electrophorectic tag-sequencing methods? Tag > bp (2-ends) EST SAGE MPSS 454 Polony-Seq Ronaghi Single-molecule vs. amplified single molecule. Array vs. bead packing vs. random Rapid scans vs. long scans (chemically limited, 454) Number of immobilized primers: 0: Chetverin'97 "Molecular Colonies" 1: Mitra'99 > Agencourt "Bead Polonies" 2: Kawashima'88, Adams'97 > Lynx/Solexa: "Clusters"

Bead Polony Sequencing Pipeline In vitro libraries via paired tag manipulation Bead polonies via emulsion PCR [Dre03] Monolayered immobilization in acrylamide Enrichment of amplified beads SOFTWARE Images → Tag Sequences Tag Sequences → Genome FISSEQ or “wobble” sequencing Epifluorescence Scope with Integrated Flow Cell

Polony Fluorescent In Situ Sequencing Libraries Greg Porreca Abraham Rosenbaum 1 to 100kb Genomic M L R M PCR bead Sequencing primers Selector bead 2x20bp after MmeI ( BceAI, AcuI) Dressman et al PNAS 2003 emulsion

Cleavable dNTP-Fluorophore (& terminators) Mitra,RD, Shendure,J, Olejnik,J, Olejnik,EK, and Church,GM (2003) Fluorescent in situ Sequencing on Polymerase Colonies. Analyt. Biochem. 320:55-65 Reduce or photo- cleave

Polony- FISSeq : up to 2 billion beads/slide Cy5 primer (570nm) ; Cy3 dNTP (666nm) Jay Shendure Self Organizing Monolayer

# of bases sequenced (total)23,703,953 # bases sequenced (unique)73 Avg fold coverage324,711 X Pixels used per bead (analysis)~3.6 Read Length per primer14-15 bp Insertions 0.5% Deletions 0.7% Substitutions (raw) 4e-5 Throughput:360,000 bp/min Polony FISSeq Stats Current capillary sequencing 1400 bp/min (600X speed/cost ratio, ~$5K/1X) (This may omit: PCR, homopolymer, context errors) Shendure

High accuracy special case: homopolymers (e.g. AAA, CC, etc.) Use "compressed" tags, ACG = ACCG=ACCCG Quantitate incorporation Reversible terminators FRET between adjacent 3' bases Wobble sequencing All five of these work. Maintenance of amplification fidelity using linear amplification from initial genomic fragment

Degenerate (aka “wobble) sequencing “single tipped” vs “double tipped” length of anchoring sequence natural vs. universal nucleotides (i.e. deoxyinosine) single fluor vs. four-color fluor mixtures of dNTPs for extensions Sequenase vs Klenow vs BST Exonuclease stripping vs heat stripping CTAGCGAGCTAGNNNNNNNNA CTAGCGAGCTAGNNNNNNNNG CTAGCGAGCTAGNNNNNNNNC CTAGCGAGCTAGNNNNNNNNT anchordegenerate “tip”

Wobble vs Simple base-extension 1/4 vs 2.5/4 base/cycle >8 vs base reads 3e-3 vs 4e-5 non-homopolymer errors 3e-3 vs 1e-1 homopolymer errors 40' per cycle, 60 hr per 20 cycles

Sequencing single molecules Ecosystem studies need single-cell amplification because of multiple chromosomes (& RNAs) per cell. Many cells are hard to grow. Microbes exchange genome subsets. (Even an 80% genome coverage is better than 100 kb BACs) Many input molecules required to sequence one molecule. vs. one molecule sufficient to sequence via many copies of it.

Single cell sequencing  29 real-time amplification No template control Affymetrix quantitation of independent amplifications

.