Thanks to: Broad Inst., DARPA-BioComp, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-CECBSR, PhRMA, Lipper Foundation Agencourt, Ambergen, Atactic,

Slides:



Advertisements
Similar presentations
DNA strands can be separated under conditions which break H-bonds
Advertisements

Systematic Evolution of Ligands by Exponential Enrichment: RNA Ligands to Bacteriophage T4 DNA Polymerase CRAIG TUERK AND LARRY GOLD.
LECTURE 17: RNA TRANSCRIPTION, PROCESSING, TURNOVER Levels of specific messenger RNAs can differ in different types of cells and at different times in.
Lecture 4: DNA transcription
Genetic Expression Genotype => Phenotype. DNA Functions Information Storage –sequence of bases Information Transmission –replication Information Expression.
Lecture 10 DNA Translation and Control
DARPA BAA 01-26: BIO-COMP Technical challenges and risks: “DNA computing” so far focused on computing.
Harvard MIT DOE GtL Center Collaborating PIs: Chisholm, Polz, Church, Kolter, Ausubel, Lory arep.med.harvard.edu C.Ting 7-Feb :10-4:40 PM 2-20 μm.
Reference books: Molecular Biology of the Cell, 4th edition, by B. Alberts et al., Molecular Cell Biology, 5th edition, by H. Lodish et al., 2004.
Greg Phillips Veterinary Microbiology
Harvard Med, WashU, MIT NIH-CEGS, DARPA, PhRMA, DOE-GTL Sequencing: Helicos, Ambergen, Caliper, BC-Agencourt-GTC, Synthesis: Nimblegen, Atactic/Invitrogen,
Thanks to: DARPA & DOE-GtL Agencourt, Ambergen, Atactic, BeyondGenomics, Caliper, Genomatica, Genovoxx, Helicos, MJR, NEN, Nimblegen, Xeotron/Invitrogen.
Thanks to: DOE-HGP DARPA (* Currently unfunded* ) 28-Jan-2004 DOE Synthesis of useful replicating biosystems.
Molecular Genomic Imaging Center (CEGS) Harvard / Wash U George Church, Rob Mitra Greg Porreca, Jay Shendure Sequencing by Ligation on Polony Beads with.
Thanks to: DOE GtL DARPA BioComp PhRMA NHLBI 17-Sep-2003 Virtual Conference on Genomics & Bioinformatics BioSystems Synthesis: New optima demand new technologies.
Thanks to: DARPA BioComp DNA&RNA Polonies: Mitra, Shendure, Zhu Protein MS: Jaffe, Leptos Metabolism/Proliferation models : Segre, Vitkup, Badarinarayana.
DNA Sequencing (& synthesis) Microbe-weather-maps: single cell $ 0.02/kb 30 bp microbial species then drug resistance (~1000 cells/$) Personal.
1 Library Screening, Characterization, and Amplification Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis.
10 Genomics, Proteomics and Genetic Engineering. 2 Genomics and Proteomics The field of genomics deals with the DNA sequence, organization, function,
3 September, 2004 Chapter 20 Methods: Nucleic Acids.
Thanks to: Washington U, Harvard-MIT Broad Inst., DARPA-BioSpice, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-SysBio, PhRMA, Lipper Foundation Agencourt,
David Goodsell. GtL Workshop B: Experimental Technology Development and Integration Tue at 2 PM Co-Chairs – George Church, Harvard Medical School Ham.
Thanks to: Science & Technology Expert Partnership (STEP) Program Broad Inst., DARPA, DOE-GTL, EU-MolTools, NHLBI, NGHRI-CEGS, NIGMS-SysBio, PhRMA, Lipper.
1 Characterization, Amplification, Expression Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis of DNA.
Thanks to: Agencourt, Ambergen, Atactic, BeyondGenomics, Caliper, Genomatica, Genovoxx, Helicos, MJR, NEN, Nimblegen, ThermoFinnigan, Xeotron/Invitrogen.
Genetic Technologies By: Brenda, Dale, John, and Brady.
HST Advisory Council Thursday 16-Nov :00 to 2:20 PM Personal Genomes & Medicine Thanks to: Broad Inst., DARPA-BioComp, DOE-GTL, EU-MolTools, NGHRI-CEGS,
Thanks to: Washington U, Harvard-MIT Broad Inst., DARPA-BioSpice, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-SysBio, PhRMA, Lipper Foundation Agencourt,
Manipulating the Genome: DNA Cloning and Analysis 20.1 – 20.3 Lesson 4.8.
This class: Regulation of protein activities (1) What is a protein activity? (2) How to change the rate of a specific cellular activity? (3) Rapid vs slower.
Error-Corrected DNA Synthesis Peter Carr MIT Media Laboratory.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Synthetic biology Genome engineering Chris Yellman, U. Texas CSSB.
Central Dogma & PCR B Wang Yu-Hsin.
Fine Structure and Analysis of Eukaryotic Genes
Unit 7 RNA, Protein Synthesis & Gene Expression Chapter 10-2, 10-3
Manufacture of Human Interleukin 13 Protein Using a Prokaryotic Expression System Ryan Rupp, York College of Pennsylvania, Department of Biological Sciences.
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
歐亞書局 PRINCIPLES OF BIOCHEMISTRY Chapter 9 DNA-Based Information Technologies.
Transfection. What is transfection? Broadly defined, transfection is the process of artificially introducing nucleic acids (DNA or RNA) into cells, utilizing.
DNA Cloning and PCR.
Chapter 4 Molecular Cloning Methods. Gene Cloning The Role of Restriction Endonuclease.
MIT Molecular Machines (Jacobson) Group - Next Generation DNA Synthesis HTGAA.
Recombinant Technololgy
Steady-state flux optima AB RARA x1x1 x2x2 RBRB D C Feasible flux distributions x1x1 x2x2 Max Z=3 at (x 2 =1, x 1 =0) RCRC RDRD Flux Balance Constraints:
Today Building a genome –Nucleotides, GC content and isochores –Gene structure and expression; introns –Evolution of noncoding RNAs Evolution of transcription.
19.1 Techniques of Molecular Genetics Have Revolutionized Biology
Chapter 8 Microbial Genetics part A. Life in term of Biology –Growth of organisms Metabolism is the sum of all chemical reactions that occur in living.
-Know that we can manipulate genomes by inserting or deleting certain genes. -What about synthesizing an entirely novel genome using sequencing technology?
Microfluidics for Gene Fabrication Peter Carr & David Kong MIT Media Laboratory.
Chap. 1 basic concepts of Molecular Biology Introduction to Computational Molecular Biology Chapter 1.
Chapter 12 Lecture Outline Molecular Techniques and Biotechnology.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Copyright © 2005 Pearson Education, Inc. Publishing as Benjamin Cummings THE FLOW OF GENETIC INFORMATION FROM DNA TO RNA TO PROTEIN The DNA genotype is.
1 Protein synthesis How a nucleotide sequence is translated into amino acids.
10 AM Tue 20-Feb Genomics, Computing, Economics Harvard Biophysics 101 (MIT-OCW Health Sciences & Technology 508)MIT-OCW Health Sciences & Technology 508.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
From Megabytes to Megabases Building a Fab to Enable Synthetic Biology
Some basic molecular biology Summaries of: Replication, Transcription; Translation, Hybridization, PCR Material adapted from Lodish et al, Molecular Cell.
Molecular Tools. Recombinant DNA Restriction enzymes Vectors Ligase and other enzymes.
Recombinant DNA Reverse genetics Synthesis of DNA probes Restriction enzymes, plasmids and recombinant DNA Genomic and cDNA libraries Applications.
Genomics Lecture 3 By Ms. Shumaila Azam. Proteins Proteins: large molecules composed of one or more chains of amino acids, polypeptides. Proteins are.
Methods in Cell Biology Cont. Sept. 24, Science Bomb 2 Unc-22: encodes a myofilament in C. elegans.
Next generation sequencing
Chapter 4 Recombinant DNA Technology
Relationship between Genotype and Phenotype
Recombinant DNA Technology
Nutrigenomics/pharmacogenomics
Recombinant DNA Technology
Digital Gene Expression – Tag Profiling Sample Preparation
Presentation transcript:

Thanks to: Broad Inst., DARPA-BioComp, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-CECBSR, PhRMA, Lipper Foundation Agencourt, Ambergen, Atactic, BeyondGenomics, Caliper, Genomatica, Genovoxx, Helicos, MJR, NEN, Nimblegen, ThermoFinnigan, Xeotron/Invitrogen For more info see: arep.med.harvard.edu BU BME retreat 23-Jun :45-10:30 Seacrest, N. Falmouth, MA Optimal Combinatorial Biology & Genome Engineering

Exponential technologies Shendure J, Mitra R, Varma C, Church GM (May 2004) Advanced Sequencing Technologies: Methods & Goals. Nature Reviews of Genetics 5, ABI

Programming cells with DNA vs. Digital computers simulating cells Cells simulating digital computers Drugs & devices simulating human systems

Engineering complex systems (comparative genomics) Stedman et al. (2004) [Masticatory] Myosin gene mutation correlates with anatomical changes in the human lineage Nature 428,

DNA RNA Proteins Metabolites Replication rate Environment Biosystems Engineering Integrating Measures & Models Microbes Cancer & stem cells Darwinian optima In vitro replication Small multicellular organisms RNAi Insertions SNPs interactions

Now that we have 200 genomes, why sequence? Once per organism Phylogenetic footprinting, biodiversity RNA splicing & chromatin modification patterns. Cell-lineage during development NA "aptamers" & Ab for any protein Once per person Preventative medicine & genotype–phenotype associations Frequently Cancer: mutation sets for individual clones, loss-of-heterozygosity B & T-cell receptor diversity: Temporal profiling, clinical New & old pathogen "weather map", biowarfare sensors DNA computing & lab selections Shendure et al Nature Rev Gen 5, 335.

Why 'single molecule' sequencing? (1) Single-cell analyses, e.g. Preimplantation (PGD) (2) Co-occurrence on a molecule, complex, cell e.g. RNA splice-forms (3) Cost: $1K-100K "personal genomes" (4) Precision: Counting 10 9 RNA tags (to reduce variance) (~5e5 RNAs per human cell) Fixed 5e3 5e4 5e6 5e9 (goal) Costs EST SAGE MPSS Polony-FISSeq (polymerase colony)

Polony Fluorescent In Situ Sequencing Libraries Greg Porreca Abraham Rosenbaum 1 to 100kb Genomic L R M L R PCR bead Sequencing primers Selector bead 2x20bp after MmeI Dressman et al PNAS 2003 emulsion

Cleavable dNTP-Fluorophore (& terminators) Mitra,RD, Shendure,J, Olejnik,J, Olejnik,EK, and Church,GM (2003) Fluorescent in situ Sequencing on Polymerase Colonies. Analyt. Biochem. 320:55-65 Reduce or photo- cleave

Polony- FISSeq : up to 2 billion beads/slide White= Fe-core pixels, Cy5 primer (570nm) ; Cy3 dNTP (666nm) Jay Shendure

# of bases sequenced (total)23,703,953 # bases sequenced (unique)73 Avg fold coverage324,711 X Pixels used per bead (analysis)~3.6 Read Length per primer14-15 bp Insertions 0.5% Deletions 0.7% Substitutions (raw) 4e-5 Throughput:360,000 bp/min Polony FISSeq Stats Current capillary sequencing 1400 bp/min (600X speed/cost ratio, ~$5K/1X) (This may omit: PCR, homopolymer, context errors) Shendure

CD44 Exon Combinatorics (Zhu & Shendure) Alternatively Spliced Cell Adhesion Molecule Specific variable exons are up-or-down-regulated in various cancers (>2000 papers) v6 & v7 enable direct binding to chondroitin sulfate, heparin… Zhu,J, et al. Science. 301:836-8.

V1 V2 V3 V4 V5 V6 V7 V8 V9 V10 RNA exon examples auto- regridded & quan- titated Zhu,J, Shendure,J, Mitra, RD, Church, GM (2003) Science. 301: Single Molecule Profiling of Alternative Pre-mRNA Splicing.

Zhu J, Shendure J, Mitra RD, Church GM. Science 301: Single molecule profiling of alternative pre-mRNA splicing. Eph4 = murine mammary epithelial cell line Eph4bDD = stable transfection of Eph4 with MEK-1 (tumorigenic) CD44 RNA isoforms

DNA RNA Proteins Metabolites Replication rate Environment Biosystems Engineering Integrating Measures & Models Escherichia Darwinian optima Prochlorococcus mutant suboptimality Homo RNAi Insertions SNPs interactions

Integer Stochiometric matrix (Roche/ExPASy) Metabolic Pathways Cellular Processes

XiXi Membrane V transport V syn V deg V growth Growth: c 1 X i + c 2 X c m X m Biomass Flux ratios at each branch point yields optimal polymer composition for replication X i =const.   v j =0

AcCoA CoA ATP FAD NADH Xi = metabolites Ci = coeff. in growth reaction Biomass composition Edwards & Palsson, PNAS 2000, BMC Bioinf Optimize flow from input C,N,P to Biomass GTP Trp Leu Ala Arg Gly Cys Ser Asn Asp His CTP UTP SucCoA Val Glu Gln Phe Pro Ile Lys Met Tyr Thr dACGT

Minimization of Metabolic Adjustment (MoMA) Linear Programming (LP) to find optima, Quadratic (QP) to find closest points x,y are two of the 100s of flux dimensions Wild-type optimum Mutant optimum Mutant initially (closest point) Mutant Wild type (feasible flux polyhedra) Objective function = growth flux hyperplanes Segre, Vitkup, & Church PNAS 99:

12 C 13 C MS/NMR Flux Ratio Data

Experimental Fluxes Predicted Fluxes  pyk (LP) WT (LP) Experimental Fluxes Predicted Fluxes Experimental Fluxes Predicted Fluxes  pyk (QP)  =0.91 p=8e-8  =-0.06 p=6e-1  =0.56 p=7e-3 Flux Data C009- limited

Reproducibility of mass competition Correlation between two selection experiments Badarinarayana, et al. Nature Biotech.19: 1060

Competitive growth data  2 p-values 4x x10 -5 Position effects Novel redundancies On minimal media negative small selection effect Hypothesis: next optima are achieved by regulation of activities. LP QP

Motif Co-occurrence, comparative genomics, RNA clusters, and/or ChIP 2 -location data P= to Genome Res. 14:201–208 Bulyk, McGuire,Masuda,Church

Synthetic testing of DNA motif combinations (1.3 in  argR) RNA Ratio (motif- to wild type) for each flanking gene Bulyk, McGuire,Masuda,Church Genome Res. 14:201–208

Systems Biology Loop Synthesis / Perturbation Model Experimental design (Systematic) Data Proteasome targeting Genome Engineering

Engineering BioSystems Perturbations Action Specificity %KO "Design" Small molecules (drugs) Fast Varies Varies Hard Antibodies Fast Varies Varies Hard RNAi Slow Varies Medium OK Insertion "traps" Slow Yes Varies Random Proteasome targeting Fast Excellent Medium Easy Homologous recombination Slow Perfect Complete Easy

Programming proteasome targeting Janse, DM, Crosas,B Finley,D & Church, GM (2004) Localization to the Proteasome is Sufficient for Degradation.

Synthetic Genomes & Proteomes. Why? Test or engineer cis-DNA/RNA-elements Access to any protein (complex) including post-transcriptional modifications Affinity agents for the above. Mass spectrometry standards, protein design Utility of molecular biology DNA-RNA-Protein in vitro "kits" (e.g. PCR, SP6, Roche) Toward these goals design a chassis: 115 kbp genome. 150 genes. Nearly all 3D structures known. Comprehensive functional data.

PURE translation utility (yet room for improvement) Removing tRNA-synthetases, RNases & proteases makes feasible: Optimal mRNA structure & codon usage Lee et al J Immunol Methods. 284: Selection of scFvs specific for HBV DNA polymerase using ribosome display. Forster et al. 2003Programming peptidomimetic syntheses by translating genetic codes designed de novo. PNAS 100: Klammt et al Eur J Biochem. 271: High level cell-free expression & specific labeling of integral membrane proteins. Shimizu et al Nat Biotechnol. 19: Cell-free translation reconstituted with purified components.

in vitro genetic codes 5' mS yU eU UGG UUG CAG AAC... GUU A 3' GAAACCAUG fMTNVE | | | 5' Second base 3' U A C C U mS yU eU A C U G A Forster, et al. (2003) PNAS 100: % average yield per unnatural coupling. bK = biotinyllysine, mS = Omethylserine eU=2-amino-4-pentenoic acid yU = 2-amino-4-pentynoic acid

Mirror world : resistant to enzymes, parasites, predators L-amino acids & D-ribose (rNTPs, dNTPs) Transition: EF-Tu, peptidyl transferase, DNA-ligase D-amino acids & L-ribose (rNTPs, dNTPs) Dedkova, et al. (2003) Enhanced D-amino acid incorporation into protein by modified ribosomes. J Am Chem Soc 125,

Forster & Church Oligos for 150 & 776 synthetic genes (for E.coli minigenome & M.mobile whole genome respectively)

Up to 760K Oligos/Chip 18 Mbp for $700 raw (6-18K genes) <1K Oxamer Electrolytic acid/base 8K Atactic/Xeotron/Invitrogen Photo-Generated Acid Sheng, Zhou, Gulari, Gao (U.Houston) 24K Agilent Ink-jet standard reagents 48K Febit 100K Metrigen 380K Nimblegen Photolabile 5'protection Nuwaysir, Smith, Albert Tian, Gong, Church

Improve DNA Synthesis Cost Synthesis on chips in pools is 5000X less expensive per oligonucleotide, but amounts are low (1e6 molecules rather than usual 1e12) & bimolecular kinetics slow with square of concentration decrease!) Solution: Amplify the oligos then release them => ss-70-mer (chip) 20-mer PCR primers with restriction sites at the 50mer junctions Tian, Gong, Sheng, Zhou, Gulari, Gao, Church => ds-90-mer => ds-50-mer

Improve DNA Synthesis Accuracy via mismatch selection Tian & Church

Genome assembly Challenges: 1. Tandem, inverted and dispersed repeats (hierarchical assembly, size-selection and/or scaffolding) 2. Reduce mutations (goal <1e-6 errors) to reduce # of intermediates 3. >30 kbp homologous recombination (Nick Reppas) Stemmer et al Gene 164: Single-step assembly of a gene and entire plasmid from large numbers of oligodeoxyribonucleotides … 100*2^(n-1)

M DNA Templates RNA Transcripts All 30S-Ribosomal-protein DNAs & mRNAs synthesized in vitro s19 0.5kb 0.3kb Nimblegen Xeotron/Atactic Wild-type DNA Templates Tian, Gong, Sheng, Zhou, Gulari, Gao, Church

Improving synthesis accuracy 9-fold Method Total bp # Clones Trans- ition Trans- versionDeletionAddition Bp/error Hyb selection, PCR Gel selection, PCR No selection, ligation +PCR No selection, PCR Tian & Church

Extreme mRNA makeover for protein expression in vitro RS-2,4,5,6,9,10,12,13,15,16,17,and 21 detectable initially. RS-1, 3, 7, 8, 11, 14, 18, 19, 20 initially weak or undetectable. Solution: Iteratively resynthesize all mRNAs with less mRNA structure. Tian & Church Western blot based on His-tags

Enabling technologies Multi-Gene Assembly Protein, peptidomimetic synthesis CAD/CAM & Design for manufacturing Automated homologous recombination for E.coli & embryonic stem cells Fidelity enhancements Sequencing 10 7 bp/$ ($1K/human)

Thanks to: DOE-GTL, DARPA-BioComp, NIGMS-CECBSR, NGHRI-CEGS, PhRMA, EU-MolTools, NHLBI-PGA, Broad Inst., Lipper Foundation Agencourt, Ambergen, Atactic, BeyondGenomics, Caliper, Genomatica, Genovoxx, MJR, NEN, Nimblegen, ThermoFinnigan, Xeotron/Invitrogen For more info see: arep.med.harvard.edu BU BME retreat 23-Jun :45-10:30 Seacrest, N. Falmouth, MA Optimal Combinatorial Biology & Genome Engineering

.

Improve DNA Synthesis accuracy Synthesis on a chip pools of "construction" ~50-mers and two complementary "selection" ~26-mers (Left & Right) => ss-70-mer (chip) Tian, Gong, Sheng, Zhou, Gulari, Gao, Church => ds/ss-50-mer (amplif/restrict) => ss-56-mer (chip) 20-mer PCR primers (one biotinylated) Biotin => ss-76-mer (amplif/avidin)

Improve DNA Synthesis Accuracy via D-HPLC or MutS Smith & Modrich (1997) PNAS 94: 6847–50. Removal of polymerase-produced mutant sequences from PCR products. MutHLS Cleaves at GATC near mismatches. Lowers error rate from 6e-6 to 6e-7. Bellanne-Chantelot et al. (1997) Mutat Res. 382: Search for DNA sequence variations using a MutS-based technology. Mulligan & Tabone (2002) US Patent 6,664,112. Methods for improving the sequence fidelity of synthetic doublestranded- oligonucleotides.