HST Advisory Council Thursday 16-Nov-2004 2:00 to 2:20 PM Personal Genomes & Medicine Thanks to: Broad Inst., DARPA-BioComp, DOE-GTL, EU-MolTools, NGHRI-CEGS,

Slides:



Advertisements
Similar presentations
Chap. 6 Problem 2 Protein coding genes are grouped into the classes known as solitary (single) genes, and duplicated or diverged genes in gene families.
Advertisements

PCR Polymerase Chain Reaction Mariam Cortes Tormo Miami Children’s Hospital Research institute 2013.
Module 12 Human DNA Fingerprinting and Population Genetics p 2 + 2pq + q 2 = 1.
Thanks to: Broad Inst., DARPA-BioComp, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-CECBSR, PhRMA, Lipper Foundation Agencourt, Ambergen, Atactic,
Signatures of Selection
Reference books: Molecular Biology of the Cell, 4th edition, by B. Alberts et al., Molecular Cell Biology, 5th edition, by H. Lodish et al., 2004.
Gene Expression Chapter 9.
Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome Jay Shendure, Gregory J. Porreca, Nikos B. Reppas, Xiaoxia Lin, John P. McCutcheon.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Thanks to: DARPA & DOE-GtL Agencourt, Ambergen, Atactic, BeyondGenomics, Caliper, Genomatica, Genovoxx, Helicos, MJR, NEN, Nimblegen, Xeotron/Invitrogen.
Molecular Genomic Imaging Center (CEGS) Harvard / Wash U George Church, Rob Mitra Greg Porreca, Jay Shendure Sequencing by Ligation on Polony Beads with.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
The Human Genome Project and ~ 100 other genome projects:
Thanks to: DARPA BioComp DNA&RNA Polonies: Mitra, Shendure, Zhu Protein MS: Jaffe, Leptos Metabolism/Proliferation models : Segre, Vitkup, Badarinarayana.
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
Thanks to: Washington U, Harvard-MIT Broad Inst., DARPA-BioSpice, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-SysBio, PhRMA, Lipper Foundation Agencourt,
A dynamic program algorithm for haplotype block partitioning Zhang, et. al. (2002) PNAS. 99, 7335.
Thanks to: Science & Technology Expert Partnership (STEP) Program Broad Inst., DARPA, DOE-GTL, EU-MolTools, NHLBI, NGHRI-CEGS, NIGMS-SysBio, PhRMA, Lipper.
Arrays: Narrower terms include bead arrays, bead based arrays, bioarrays, bioelectronic arrays, cDNA arrays, cell arrays, DNA arrays, gene arrays, gene.
Thanks to: George Church (Harvard GTL & CEGS Centers) 5-Jan-2006 HPCGG Landsdowne 2 PM Personal Genomics meets Quantitative Proteomics NHGRI Seq Tech 2004:
Bioinformatics page 12, part of ch. 21 Cell and Mol Biol Lab.
Thanks to: Agencourt, Ambergen, Atactic, BeyondGenomics, Caliper, Genomatica, Genovoxx, Helicos, MJR, NEN, Nimblegen, ThermoFinnigan, Xeotron/Invitrogen.
Thanks to: Washington U, Harvard-MIT Broad Inst., DARPA-BioSpice, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-SysBio, PhRMA, Lipper Foundation Agencourt,
CS 6293 Advanced Topics: Current Bioinformatics
Next Generation DNA Sequencing Platforms: Evolving Tools for
Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA.
Genome Variations & GWAS
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Genetic and Molecular Epidemiology Lecture III: Molecular and Genetic Measures Jan 19, 2009 Joe Wiemels HD 274 (Mission Bay)
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
Analyzing DNA Differences PHAR 308 March 2009 Dr. Tim Bloom.
CO 10.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Targeted next generation sequencing for population genomics and phylogenomics in Ambystomatid salamanders Eric M. O’Neill David W. Weisrock Photograph.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
CS177 Lecture 10 SNPs and Human Genetic Variation
Microfluidics for Gene Fabrication Peter Carr & David Kong MIT Media Laboratory.
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
François Ancien Sascha Kretzschmann Olivier Suplis Genotyping Errors Causes, Consequences and Solutions Genotyping Errors.
SEQUENCING – THE BENCHTOPS. Roche 454 Junior Same technology as 454 FLX Read length: 400 bases Paired-end 100,000 reads 12 hours (instrument time) Output.
ABC for the AEA Basic biological concepts for genetic epidemiology Martin Kennedy Department of Pathology Christchurch School of Medicine.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Highlights of DNA Technology. Cloning technology has many applications: Many copies of the gene are made Protein products can be produced.
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
Ultra-High Throughput DNA Sequencing on the 454/Roche GS-FLX
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
Chapter 10: Genetic Engineering- A Revolution in Molecular Biology.
Molecular Markers CRITFC Genetics Workshop December 8, 2015.
Simple-Sequence Length Polymorphisms SSLPs Short tandemly repeated DNA sequences that are present in variable copy numbers at a given locus. Scattered.
MCT = Molecular Colony Technique Alexander Chetverin Institute of Protein Research of the Russian Academy of Sciences References: NAR(10)2349 from 1993.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Admixture Mapping Controlled Crosses Are Often Used to Determine the Genetic Basis of Differences Between Populations. When controlled crosses are not.
Analyzing DNA using Microarray and Next Generation Sequencing (1) Background SNP Array Basic design Applications: CNV, LOH, GWAS Deep sequencing Alignment.
INTERPRETING GENETIC MUTATIONAL DATA FOR CLINICAL ONCOLOGY Ben Ho Park, M.D., Ph.D. Associate Professor of Oncology Johns Hopkins University May 2014.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Simple-Sequence Length Polymorphisms
Next-generation sequencing technology
Next generation sequencing
What is Haplotyping? T C A G
Next-generation sequencing technology
Microarray Technology and Applications
Genome Projects Maps Human Genome Mapping Human Genome Sequencing
“TaqMan genotyping Assay’’
Presentation transcript:

HST Advisory Council Thursday 16-Nov :00 to 2:20 PM Personal Genomes & Medicine Thanks to: Broad Inst., DARPA-BioComp, DOE-GTL, EU-MolTools, NGHRI-CEGS, NHLBI-PGA, NIGMS-CECBSR, PhRMA, Lipper Foundation Agencourt, Ambergen, Atactic, BeyondGenomics, Caliper, Genomatica, Genovoxx, Helicos, MJR, NEN, Nimblegen, ThermoFinnigan, Xeotron/Invitrogen For more info see: arep.med.harvard.edu

Why sequence? Cancer: mutation sets for individual clones, loss-of-heterozygosity Pathogen "weather map", biowarfare sensors RNA splicing & chromatin modification patterns. Synthetic biology & lab selections Antibodies or "aptamers" for any protein B & T-cell receptor diversity: Temporal profiling, clinical Preventative medicine & genotype–phenotype associations Cell-lineage during development Phylogenetic footprinting, biodiversity Shendure et al Nature Rev Gen 5, 335.

The idea of Common SNPs for Common Diseases has been hugely oversold. Do association studies need the added baggage of "linkage" assumptions? Should we determine genotype (haplotype) directly (at low cost) rather than infer it from population trends?

Rare Alleles / Common Diseases Even "dispensable" regions of the genome can harbor neomorphic alleles. Each of us has about 10 4 mutations since the last major population bottleneck. "-463GA, has been associated with incidence or severity of inflammatory diseases, including atherosclerosis and Alzheimer's disease, and some cancers. The polymorphism is within an Alu element " Kumar AP, et al. (2004) J Biol Chem. 279: Variable breakpoints in Burkitt lymphoma cells with chromosomal t(8;14) translocation separate c-myc and the IgH locus up to several hundred kb. Joos S, et al. (1992) Hum Mol Genet. 1: Multiple rare alleles contribute to low plasma levels of HDL cholesterol. Cohen JC et al. (2004) Science. 305:

Personal genomics & cancer therapy Mutations G719S, L858R, Del746ELREA in red. EGFR Mutations in lung cancer: correlation with clinical response to gefitinib [Iressa] therapy. Paez, … Meyerson (2004) Science 304: 1497 Dulbecco R. (1986) A turning point in cancer research: sequencing the human genome. Science 231:

Why 'single molecule' sequencing? (1) Single-cells: Preimplantation (PGD), uncultivatable (2) Co-occurrence on a molecule, complex, cell RNA splice-forms & DNA haplotypes (3) Cost: $1K-100K "personal genomes" (4) Precision: Counting 10 9 RNA tags (to reduce variance) (~5e5 RNAs per human cell) Fixed 5e3 5e4 5e6 5e9 (goal) costs EST SAGE MPSS Polony-FISSeq (polymerase colony)

CD44 Exon Combinatorics (Zhu & Shendure) Alternatively Spliced Cell Adhesion Molecule Specific variable exons are up-or-down-regulated in various cancers (>2000 papers) v6 & v7 enable direct binding to chondroitin sulfate, heparin… Zhu,J, et al. Science. 301:836-8.

Zhu J, Shendure J, Mitra RD, Church GM. Science 301: Single molecule profiling of alternative pre-mRNA splicing. Eph4 = murine mammary epithelial cell line Eph4bDD = stable transfection of Eph4 with MEK-1 (tumorigenic) CD44 RNA isoforms

Multi-locus haplotyping on pooled samples Kun Zhang Throughput = (# loci × # samples) / time

Multi-locus haplotyping NOS3 C/TC/TG/AG/AG/TG/TG/AG/AT/AT/AC/TC/TC/TC/T ~24-Kb Chr 7

Chromosome-wide haplotyping IL : A/C ~60-Mb CD : T/A Human Chr. 7 A..T A..A

Convergence on non-electrophorectic tag-sequencing methods? Tag > bp (2-ends) EST SAGE MPSS 454 Polony-Seq Ronaghi Single-molecule vs. amplified single molecule. Array vs. bead packing vs. random Rapid scans vs. long scans (chemically limited, 454) Number of immobilized primers: 0: Chetverin'97 "Molecular Colonies" 1: Mitra'99 > Agencourt "Bead Polonies" 2: Kawashima'88, Adams'97 > Lynx/Solexa: "Clusters"

Bead Polony Sequencing Pipeline In vitro libraries via paired tag manipulation Bead polonies via emulsion PCR [Dre03] Monolayered immobilization in acrylamide Enrichment of amplified beads SOFTWARE Images → Tag Sequences Tag Sequences → Genome FISSEQ or “wobble” sequencing Epifluorescence Scope with Integrated Flow Cell

Polony Fluorescent In Situ Sequencing Libraries Greg Porreca Abraham Rosenbaum 1 to 100kb Genomic M L R M PCR bead Sequencing primers Selector bead 2x20bp after MmeI ( BceAI, AcuI) Dressman et al PNAS 2003 emulsion

Cleavable dNTP-Fluorophore (& terminators) Mitra,RD, Shendure,J, Olejnik,J, Olejnik,EK, and Church,GM (2003) Fluorescent in situ Sequencing on Polymerase Colonies. Analyt. Biochem. 320:55-65 Reduce or photo- cleave

Polony- FISSeq : up to 2 billion beads/slide Cy5 primer (570nm) ; Cy3 dNTP (666nm) Jay Shendure Self Organizing Monolayer

# of bases sequenced (total)23,703,953 # bases sequenced (unique)73 Avg fold coverage324,711 X Pixels used per bead (analysis)~3.6 Read Length per primer14-15 bp Insertions 0.5% Deletions 0.7% Substitutions (raw) 4e-5 Throughput:360,000 bp/min Polony FISSeq Stats Current capillary sequencing 1400 bp/min (600X speed/cost ratio, ~$5K/1X) (This may omit: PCR, homopolymer, context errors) Shendure

Anonymity, privacy, identity Required disclosure > optional > required privacy

Non-anonymous healthy genotype-phenotype studies Are information-rich resources (e.g. facial imaging & genome sequence) really anonymous? What are the risks and benefits of "open-source"? What level of training is needed to give informed consent on open-ended studies? Harvard Medical School IRB Human Subjects protocol submitted 16-Sep-2004

.