1 DNA Sequencing Achim Tresch UoC / MPIPZ Cologne treschgroup.de/OmicsModule1415.html

Slides:



Advertisements
Similar presentations
Next-Generation Sequencing: Methodology and Application
Advertisements

Next–generation DNA sequencing technologies – theory & practice
High-Throughput Sequencing Technologies
Next-generation sequencing
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
What Is Genomics? Genomics is the study of how the entire genome of a species functions as a unit and evolves over time. It is the study of life’s blueprint,
Greg Phillips Veterinary Microbiology
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Polymerase chain reaction: Starting with VERY SMALL AMOUNTS OF DNA (sometimes a few molecules), one can amplify the DNA enough to detect it by electrophoresis.
Chem 395 Bioanalytical Chemistry
CS 6293 Advanced Topics: Current Bioinformatics
Next Generation DNA Sequencing Platforms: Evolving Tools for
GENOME SEQUENCING. I. Genome sequencing The Sanger Method (1977) Denaturation +priming Polymerization.
Update on Next-Generation Sequencing
Next Now-Generation Genomics: methods and applications for modern disease research Aaron J. Mackey, Ph.D. Center for Public Health.
Automated DNA Sequencing LECTURE 7: Biotechnology; 3 Credit hours Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences and.
DNA Sequencing Today, laboratories routinely sequence the order of nucleotides in DNA. DNA sequencing is done to: Confirm the identity of genes isolated.
Sequencing Technologies and Applications at JGI
Analyzing your clone 1) FISH 2) “Restriction mapping” 3) Southern analysis : DNA 4) Northern analysis: RNA tells size tells which tissues or conditions.
High Throughput Sequencing Methods and Concepts
Announcements Lab notebooks due Monday by 5 No Ch. 9 Part 2 homework
Pyromania USAFSAM “EPI Lab” MSgt Stephen Christian, USAF, MSgt, MT (ASCP) Lucinda Sinclair, GS-11 1 Distribution Statement A: Approved for Public release;
Bioinformatics and Sequencing Relevant to SolCAP
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
CHAPTER 7 DNA SEQUENCING - INTRODUCTION - SANGER DIDEOXY METHOD - AUTOMATED SEQUENCING - NEXT GENERATION OF SEQUENCING METHODS MISS NUR SHALENA SOFIAN.
Chapter 5: Exploring Genes and Genomes Copyright © 2007 by W. H. Freeman and Company Berg Tymoczko Stryer Biochemistry Sixth Edition.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
Next-Generation Sequencing of Microbial Genomes and Metagenomes
Molecular Biology Dr. Chaim Wachtel May 28, 2015.
Molecular Testing and Clinical Diagnosis
SEQUENCING – THE BENCHTOPS. Roche 454 Junior Same technology as 454 FLX Read length: 400 bases Paired-end 100,000 reads 12 hours (instrument time) Output.
Bioinformatics & Biotechnology Lecture 1 Sequencing BLAST PCR Gel Electrophoresis.
Ultra-High Throughput DNA Sequencing on the 454/Roche GS-FLX
6.3 Advanced Molecular Biological Techniques 1. Polymerase chain reaction (PCR) 2. Restriction fragment length polymorphism (RFLP) 3. DNA sequencing.
Sequencing tutorial Peter HANTZ EMBL Heidelberg.
Chemical Synthesis, Amplification, and Sequencing of DNA (Part II)
Sanger or Dideoxy DNA Sequencing
- DNA sequencing in the last century - Current technologies (Illumina, Ion Torrent) - New developments (PacBio, Nanopore) Topics.
핵산 염기서열 분석(DNA SEQUENCING)
Topic Cloning and analyzing oxalate degrading enzymes to see if they dissolve kidney stones with Dr. VanWert.
Introduction to Illumina Sequencing
DNA Sequencing First generation techniques
Next-generation sequencing technology
DNA sequencing DNA sequencing is the process of determining the precise order of nucleotides within a DNA molecule. It includes any method or technology.
Research Techniques Made Simple: Next-Generation Sequencing:
6.3 – Manipulating genomes
DNA Sequencing Second generation techniques
Next Generation Sequencing
Sequencing technologies
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
Next-generation sequencing technology
DNA Sequencing Techniques
NGS technologies.
Sequencing technology and assembly
Sequencing Technologies
The Human Genome Project
SOLEXA aka: Sequencing by Synthesis
B3- Olympic High School Bioinformatics
DNA Sequence Determination (Sanger)
Polymerase Chain Reaction (PCR)
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
ULTRASEQUENCING. Next Generation Sequencing: methods and applications.
Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine
Molecular Biology lecture -Putnoky
High-Throughput Sequencing Technologies
High-Throughput Sequencing Technologies
Next-generation DNA sequencing
BF nd (Next) Generation Sequencing
Standard (Sanger) sequencing
SBI4U0 Biotechnology.
Presentation transcript:

1 DNA Sequencing Achim Tresch UoC / MPIPZ Cologne treschgroup.de/OmicsModule1415.html

- DNA sequencing in the last century - Current technologies (Illumina, Ion Torrent) - New developments (PacBio, Nanopore) Topics

T Sanger sequencing - Random incorporation of blocked nucleotides  at any position, reaction stops in a small fraction of the reads TTGCACTTGAGTCGT AACGTGAACTCAGCATAGGCTCAGATAGAT A-Reaction: add dATP (elongation) and ddATP (block) Analogous: C-, G-, T-Reaction ddATP - Developed by Fred Sanger in the 70ies ( , 2*Nobel laureate: 1958 – protein structure of insulin, 1980 – sequencing of nucleic acids) - Sequencing by synthesis: DNA polymerase is synthesizing a complementray strand by adding single nucleotides TTGCACTGAGTCG AACGTGACTCAGCATAGGCTCAGATAGAT

TTGCACTTGAGTCG AACGTGAACTCAGCATAGGCTCAGATAGAT A-Reaction: TTGCA TTGCACTTGA C-Reaction: TTGC TTGCAC TTGCACTTGAGTC G-Reaction: TTG TTGCACTTG TTGCACTTGAG TTGCACTTGAGTCG T-Reaction: TT TTGCACT TTGCACTT TTGCACTTGAGT ddNTP Sanger sequencing ladder of DNA fragments  electrophoresis  sequence T G C A

GATTGATAGTTGC CTAACTATCAACGTATAGGCTCAGATAGAT G GA GAT GATT GATTG GATTGA GATTGAT GATTGATA GATTGATAG GATTGATAGT GATTGATAGTT GATTGATAGTTG GATTGATAGTTGC - labeled ddNTPS, capillary sequencing A Sanger sequencing

Pyrosequencing - immobilize DNA on beads, pyrosequencing in microreactors dTTP TTGCACTGAGTCGT AACGTGACTCAGCATAGGCTCAGATAGAT PPi ATP Oxyluciferin + light 454 technology

DNA-loaded beads + primer + polymerase + sulfurylase + luciferase flowgram TTGCACTGAGTCGT AACGTGACTCAGCAAGTCTATTCACCCAC technology Problem: homopolymers difficult to detect

increase throughput: - DNA gel electrophoresis, single genes in few days - capillary electrophoresis, 96 capillaries per machine, human genome in a few years - sequencing on microbeads: 454 technology Parallelisation & Miniaturisation

Illumina sequencing: - sequencing by synthesis - massive parallelisation and miniaturisation by self-organising DNA microarrays on a glass surface - several hundred Gb, >10 9 reads per run Illumina technology

- generate libraries - grow clusters on a flowcell - sequence by addition and imaging of blocked & fluorescence-labeled nucleotides Illumina technology

library preparation: DNA fragments Blunting by Fill-in and exonuclease Phosphorylation Addition of A-overhang Ligation to adapters Illumina technology

cluster generation: 1. flowcell Illumina technology

cluster generation: 1. flowcell 2. hybridize template Illumina technology

cluster generation: 1. flowcell 2. hybridize template 3. immobilize template Illumina technology

cluster generation: 1. flowcell 2. hybridize template 3. immobilize template 4. bridge amplification Illumina technology

cluster generation: 1. flowcell 2. hybridize template 3. immobilize template 4. bridge amplification Illumina technology

cluster generation: 1. flowcell 2. hybridize template 3. immobilize template 4. bridge amplification Illumina technology

cluster generation: 1. flowcell 2. hybridize template 3. immobilize template 4. bridge amplification Illumina technology

cluster generation: 1. flowcell 2. hybridize template 3. immobilize template 4. bridge amplification Illumina technology

cluster generation: 1. flowcell 2. hybridize template 3. immobilize template 4. bridge amplification 5. linearisation Illumina technology

cluster generation: 1. flowcell 2. hybridize template 3. immobilize template 4. bridge amplification 5. linearisation 6. cleave reverse strand Illumina technology

cluster generation: 1. flowcell 2. hybridize template 3. immobilize template 4. bridge amplification 5. linearisation 6. cleave reverse strand 7. block 3‘-ends Illumina technology

cluster generation: 1. flowcell 2. hybridize template 3. immobilize template 4. bridge amplification 5. linearisation 6. cleave reverse strand 7. block 3‘-ends 8. hybridize primer Illumina technology

Imaging & Sequencing: Illumina technology Nucleotide + fluorescent dye + terminator

reversible terminators: Illumina technology

fluorescently labelled clusters: Illumina technology

data output: Hiseq: - ca. 250 Mio reads * 8 lanes - 2*100 bp paired end-> 400 Gb / 8 days Hiseq rapid run: - ca. 200 Mio reads * 2 lanes - 2*150 bp paired end-> 120 Gb / 40 hours - (2*250 bp paired end)-> 200 Gb / 60 hours) Miseq: - ca. 25 Mio reads * 1 lane - 2*300 bp paired end-> 15 Gb / 65 hours Illumina technology

Fastq quality scores good quality quality drops towards the end 0.1 % error 1 % error Data quality of short reads

Amplification Artifacts Duplicate reads

Ion torrent: semiconductor sequencing - detect H+ release upon nucleotide incorporation by DNA polymerase Ion torrent

work flow: Ion torrent

data output: Ion Proton: - up to 80 mio reads - up to 10 Gb (200 base read length) - 4 hours runtime Ion Torrent PGM: - up to 5 mio reads - up to 2 Gb (400 base read length) - 8 hours runtime Ion torrent

homopolymer problem? Ion torrent - nonlinear increase of signal

what can we do with short reads? RNA-seq, identify transcripts, count reads per transcript  assessment of differential expression problem: reads are too short to establish connectivity of all exons, difficult/impossible to quantify multiple isoforms of a gene Sequencing Applications

Stefan Krebs, Single end: ambiguous mapping Paired end sequencing: read fragment from both ends -> resolve ambiguities Improvements: Paired end Reads

further improvements long jumping mate-pair libraries: circularize large fragment and reads junctions (2-10 kb) resolve large repeats in genome assembly Improvements: Circularization

Third generation Sequencing

- single molecule detection -several kilobases read length -moderate output ( wells) -expensive instrument and high cost per base Pacific Biosciences

Read length distribution

Pacific Biosciences Read quality

Pacific Biosciences

- DNA polymerase coupled to pore releases tags when incorpotating labeled nucleotides -tags passing through nanopore change ion current - read length = length of DNA fragment Oxford Nanopore

everything that can be converted to a DNA strand can be sequenced - even long-term data storage by encoding in synthetic DNA is possible BIOLOGICAL APPLICATIONS: sequencing of genomes, transcriptomes, population diversity, composition of microbial communities, ChIPseq, methyl-Seq, translating RNA from ribosomes,... MEDICAL APPLICATIONS: whole genome sequencing, exome sequencing, tumor diagnostics, sequencing of T-cell receptor diversity, identification of pathogens,... FORENSICS, FOOD SAFETY, ARCHEOLOGY, … Applications

Other Approaches

Summary third generation Sequencing

Acknowledgements Stefan Krebs Gene Center LMU Munich