Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies - 1980 to present day DNA SEQUENCING & ASSEMBLY.

Slides:



Advertisements
Similar presentations
MCB 317 Genetics and Genomics Topic 11, part 2 Genomics.
Advertisements

This presentation was originally prepared by C. William Birky, Jr. Department of Ecology and Evolutionary Biology The University of Arizona It may be used.
SEQUENCING-related topics 1. chain-termination sequencing 2. the polymerase chain reaction (PCR) 3. cycle sequencing 4. large scale sequencing stefanie.hartmann.
Recombinant DNA Introduction to Recombinant DNA technology
DNA Sequencing – “Plus and Minus” Plus –Incubate with T4 DNA Polymerase and single dNTP –T4 Polymerase degrades 3’ ends in absence of dNTP –Fractionated.
3 September, 2004 Chapter 20 Methods: Nucleic Acids.
Making, screening and analyzing cDNA clones Genomic DNA clones
Restriction Enzymes.
Emily Buckhouse. Nitrogenous Bases Nucleosides  Base linked to a 2-deoxy-D-ribose at 1’ carbon Nucleotides Nucleosides with a phosphate at 5’ carbon.
7.1 cont’d: Sanger Sequencing SBI4UP MRS. FRANKLIN.
CS 6293 Advanced Topics: Current Bioinformatics
DNA Sequencing Today, laboratories routinely sequence the order of nucleotides in DNA. DNA sequencing is done to: Confirm the identity of genes isolated.
MCB 7200: Molecular Biology
DNA Technology- Cloning, Libraries, and PCR 17 November, 2003 Text Chapter 20.
Fig 11-1 Chapter 11: recombinant DNA and related techniques.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
Analyzing your clone 1) FISH 2) “Restriction mapping” 3) Southern analysis : DNA 4) Northern analysis: RNA tells size tells which tissues or conditions.
6.3 Advanced Molecular Biological Techniques 1. Polymerase chain reaction (PCR) 2. Restriction fragment length polymorphism (RFLP) 3. DNA sequencing.
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
Recombinant DNA Technology………..
High Throughput Sequencing Methods and Concepts
Recombinant DNA Technology……….. BTEC3301. DNA Libraries How do you identify the gene of interest and clone only the DNA sequence you are interested? Read.
Technological Solutions. In 1977 Sanger et al. were able to work out the complete nucleotide sequence in a virus – (Phage 0X174) This breakthrough allowed.
DNA Cloning and PCR.
Restriction Nucleases Cut at specific recognition sequence Fragments with same cohesive ends can be joined.
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey Chapter 3 Fundamentals of Mapping and Sequencing Basic principles.
Pyromania USAFSAM “EPI Lab” MSgt Stephen Christian, USAF, MSgt, MT (ASCP) Lucinda Sinclair, GS-11 1 Distribution Statement A: Approved for Public release;
Genome sequencing Haixu Tang School of Informatics.
Bioinformatics and Sequencing Relevant to SolCAP
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
CHAPTER 7 DNA SEQUENCING - INTRODUCTION - SANGER DIDEOXY METHOD - AUTOMATED SEQUENCING - NEXT GENERATION OF SEQUENCING METHODS MISS NUR SHALENA SOFIAN.
PHYSICAL MAPPING AND POSITIONAL CLONING. Linkage mapping – Flanking markers identified – 1cM, for example Probably ~ 1 MB or more in humans Need very.
Molecular Testing and Clinical Diagnosis
Human Genome.
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
6.3 Advanced Molecular Biological Techniques 1. Polymerase chain reaction (PCR) 2. Restriction fragment length polymorphism (RFLP) 3. DNA sequencing.
Chapter 10: Genetic Engineering- A Revolution in Molecular Biology.
Chemical Synthesis, Amplification, and Sequencing of DNA (Part II)
Chapter 20 DNA Technology and Genomics. Biotechnology is the manipulation of organisms or their components to make useful products. Recombinant DNA is.
Biotechnology and Genetic Engineering PBIO 450/550 Characterization of DNA clones including: Restriction Enzyme (RE) mapping Subcloning Southerns Northerns*
Semiconservative DNA replication Each strand of DNA acts as a template for synthesis of a new strand Daughter DNA contains one parental and one newly synthesized.
Chapter 5 Sequence Assembly: Assembling the Human Genome.
DNA Fingerprinting Maryam Ahmed Khan February 14, 2001.
Chapter 14 GENETIC TECHNOLOGY. A. Manipulation and Modification of DNA 1. Restriction Enzymes Recognize specific sequences of DNA (usually palindromes)
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Topic Cloning and analyzing oxalate degrading enzymes to see if they dissolve kidney stones with Dr. VanWert.
Cse587A/Bio 5747: L2 1/19/06 1 DNA sequencing: Basic idea Background: test tube DNA synthesis DNA polymerase (a natural enzyme) extends 2-stranded DNA.
DNA Sequencing First generation techniques
Next-generation sequencing technology
DNA Sequencing Second generation techniques
Polymerase Chain Reaction
Microbial Genomes and techniques for studying them.
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
Chapter 7 Recombinant DNA Technology and Genomics
Genomics Sequencing genomes.
Next-generation sequencing technology
DNA Sequencing Techniques
6. Sequencing Genomes.
Chapter 4 Recombinant DNA Technology
Genetic Research and Biotechnology
Cloning Overview DNA can be cloned into bacterial plasmids for research or commercial applications. The recombinant plasmids can be used as a source of.
Sequencing Technologies
AMPLIFYING AND ANALYZING DNA.
SOLEXA aka: Sequencing by Synthesis
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
ULTRASEQUENCING. Next Generation Sequencing: methods and applications.
DNA and the Genome Key Area 8a Genomic Sequencing.
Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine
A Sequenciação em Análises Clínicas
SBI4U0 Biotechnology.
Presentation transcript:

Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY

“The X Prize Foundation of Playa Vista, California, is offering a $10-million prize to the first team to accurately sequence the genomes of 100 people aged 100 or older, for $1,000 or less apiece and within 30 days [beginning September 5, 2013].” see Nature 487:417, July 26, 2012 $$$ Motivation to “spur DNA sequencing technologies, boost accuracy and drive down costs” …with an accuracy of <1 error per 1 million bases

… but it stops when dideoxynucleotide is incorporated 4 parallel sets of reactions: ddATP + 4 dNTPs ddCTP + 4 dNTPs etc. Fig. 4.2 Sanger chain termination method (Fred Sanger, 1977) - enzymatic synthesis of DNA strand complementary to “template” of interest Nobel Prizes: Sanger 1958 (protein structure) 1980 (DNA sequencing)

Fig. 4.2 Ratio of ddATP:dATP important to get appropriate size range of products - set of products each terminating with ddA - their sizes reflect positions of T in template DNA

Products (each differing in length by 1 nt) resolved on denaturing polyacrylamide gels... Automated sequencing profile Autoradiograph Fig. 4.1 Fig. 4.3 or by capillary electrophoresis …

Fig. 4.5 PRIMERS FOR SEQUENCING 1. “Universal” - forward & reverse 2. Custom-designed “internal” - use new sequence info to design primer to sequence next stretch If insert is too long to completely sequence using “universal primers”, can use this strategy to close a “sequencing gap”

… or can find another clone in library that has overlap, and sequence it using “universal” primers Fig. 3.35

if particular region of genome is not represented in clone library - can use a different vector to prepare a second clone library - then use probes (eg. oligomers) mapping to ends of contigs from first library to screen second library (maybe region was unstable in first vector) What if there is a “physical gap”? Fig. 4.17

Fig Which contigs are adjacent? … or by PCR You have 9 contigs & design oligomers mapping close to their ends (#1-18) Example of closing a “physical gap” screening by hybridization

What if “physical gap” is very short? - then sequence the PCR product directly - could use oligomers mapping to ends of contigs in PCR reactions with uncloned DNA template 3’… … 5’ 5’… … 3’ < 10 kb or so - this slide also illustrates a method for finding overlapping clones

Fig repeat to “walk along” genome ASSEMBLING INFO FROM CLONES INTO CONTIGS 1. CHROMOSOME WALKING by hybridization - sequence from one clone is used as probe to screen library of clones to find overlapping one

But what if probe contains repeated sequences?  Problem avoided if use short unique-sequence probe (eg oligomer) mapping close to end of clone - so hybridizes to multiple clones … or if pre-hybridize with repeat sequence Fig. 3.34

2. CHROMOSOME WALKING by PCR Fig reactions can be carried out as pools for more rapid screening - design primer pairs based on sequence at end of clone - use other clones in library for template DNA - will get PCR amplicon for any new clones with that sequence (combinatorial screening) Fig. 4.14

Fig. 4.15A 3. CLONE FINGERPRINTING Restriction profile fingerprint To identify overlapping clones: by finding features that they share or clones having STS in common (Fig.4.15D)

Fig Haemophilus genome project 1995 (1.8 Mbp) 1. DNA sonicated, fragments (1.6 – 2 kb) cloned in plasmid vectors 2. Shotgun sequencing of insert ends ~ 20,000 clones analyzed, 11 Mbp of sequence, scaffolds with sequencing gaps & physical gaps 4. Screened for overlapping clones – reduced to 42 contigs 3. Assembled into 140 contigs 5. Assumed gaps represented genome regions unstable in plasmid vector - switched to lambda vector 6. Probed library with oligomers from contig ends or used PCR with primer pairs from contig ends

“Cost per Megabase of DNA Sequence (or Why biologists panic about computing)” “Next generation” sequencing technologies National Human Genome Research Institute - major challenge to correctly assemble the massive amount of sequence data generated… and to interpret it !

Genome Res 11:3, one dNTP is added at a time + enzyme (apyrase) that degrades dNTP if not incorporated into new strand, then next dNTP added - incorporation detected by chemiluminescence of pyrophosphate (PP i ) Fig Pyrosequencing C

Medini Nat Rev Microbiol. 6:419, DNA sheared, adaptors ligated, attached to bead & PCR amplified - beads captured in wells & pyrosequencing carried out in parallel on each DNA fragment Enzymes on beads and primer Sample preparationPyrosequencing PCR Polymerase PPi Light Genomic DNA - average read of ~ 700 (?) bp “Massively-parallel” pyrosequencing (on beads or chips) 454 technology... but “up to 1.6 million reactions can be carried out in parallel on a 6.4 cm 2 slide” “expect ~ 500 million nucleotides of sequence data per 10 hour run” (July 2010)

2. Illumina sequencing (parallel microchip) Sample preparation Sequencing by synthesis - average read of ~ bp (short-read) - add adaptors to sheared DNA, attach to chip, then PCR “bridge amplification” - denature clusters of ~ 1000 copies of DNA molecules & sequential sequencing using four fluorophore-labelled nts SOLEXA technology Medini Nat Rev Microbiol. 6:419, Output (2 × 100 bp)600 Gb300 Gb Run Time (2 × 100 bp) ~11 days~8.5 days Paired-end Reads6 Billion3 Billion Single Reads3 Billion1.5 Billion Maximum Read Length** 2 × 100 bp Bases Above Q30*** > 85% (2 x 50 bp) > 80% (2 x 100 bp) HiSeq 2000 HiSeq 1000 (Illumina website Sept. 2012)

3. Single molecule real-time sequencing (Helicos, Pacific Biosciences) Metzker Nature Reviews Genetics 11:31, continuous monitoring of nt incorporation (rather than termination as in Sanger method…) and no amplification - formation of phosphodiester bond releases fluorophore - nanoscale wells on chip so ~ one DNA polymerase molecule per well (Helicos website Sept. 2012) - read length 25 to 55 bases, Gigabases per run

Chin et al. New Eng J Med 364:33, 2011 Press release, Dec 9,2010: “PacBio & Harvard Use Fast Gene Sequencer to Crack DNA Code of Haitian Cholera Strain” H1 and H2 strains were sequenced in < 24 hr with enough “reads” to cover the genomes 60 and 32 times, respectively.