1 Next Generation Sequencing Itai Sharon November 11th, 2009 Introduction to Bioinformatics.

Slides:



Advertisements
Similar presentations
In Silico Primer Design and Simulation for Targeted High Throughput Sequencing I519 – FALL 2010 Adam Thomas, Kanishka Jain, Tulip Nandu.
Advertisements

Next–generation DNA sequencing technologies – theory & practice
Next-generation sequencing
Next Generation Sequencing, Assembly, and Alignment Methods
Canadian Bioinformatics Workshops
What Is Genomics? Genomics is the study of how the entire genome of a species functions as a unit and evolves over time. It is the study of life’s blueprint,
Greg Phillips Veterinary Microbiology
Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome Jay Shendure, Gregory J. Porreca, Nikos B. Reppas, Xiaoxia Lin, John P. McCutcheon.
CS273a Lecture 9, Aut08, Batzoglou CS273a Lecture 9, Fall 2008 Quality of assemblies—mouse N50 contig length Terminology: N50 contig length If we sort.
Sequencing and Assembly Cont’d. CS273a Lecture 5, Aut08, Batzoglou Steps to Assemble a Genome 1. Find overlapping reads 4. Derive consensus sequence..ACGATTACAATAGGTT..
$399 Personal Genome Service $2,500 Health Compass service $985 deCODEme (November 2007) (April 2008) $350,000 Whole-genome sequencing (November 2007)
Emily Buckhouse. Nitrogenous Bases Nucleosides  Base linked to a 2-deoxy-D-ribose at 1’ carbon Nucleotides Nucleosides with a phosphate at 5’ carbon.
Delon Toh. Pitfalls of 2 nd Gen Amplification of cDNA – Artifacts – Biased coverage Short reads – Medium ~100bp for Illumina – 700bp for 454.
Department of Bioinformatics and Computational Biology
Next Generation Sequencing
CS 6293 Advanced Topics: Current Bioinformatics
GENOME SEQUENCING. I. Genome sequencing The Sanger Method (1977) Denaturation +priming Polymerization.
Update on Next-Generation Sequencing
Next generation sequencing platforms Applications
Next Now-Generation Genomics: methods and applications for modern disease research Aaron J. Mackey, Ph.D. Center for Public Health.
High-Throughput Sequencing Technologies
Molecular Biology Dr. Chaim Wachtel April 4, 2013.
DNA Sequencing Today, laboratories routinely sequence the order of nucleotides in DNA. DNA sequencing is done to: Confirm the identity of genes isolated.
Analyzing your clone 1) FISH 2) “Restriction mapping” 3) Southern analysis : DNA 4) Northern analysis: RNA tells size tells which tissues or conditions.
High Throughput Sequencing Methods and Concepts
Next Generation DNA Sequencing
Technological Solutions. In 1977 Sanger et al. were able to work out the complete nucleotide sequence in a virus – (Phage 0X174) This breakthrough allowed.
Bioinformatics Algorithms Sequence Assembly 2 Copy Right Notice Most slides in this presentation are adopted from slides of text book and various sources.
Sequence assembly using paired- end short tags Pramila Ariyaratne Genome Institute of Singapore SOC-FOS-SICS Joint Workshop on Computational Analysis of.
Ion Torrent and Minion Relatively low cost ‘next generation’ sequencing Wendy Smith School of Computing Science, Alan Ward Newcastle University, UK.
Bioinformatics and Sequencing Relevant to SolCAP
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
A Sequenciação em Análises Clínicas Polymerase Chain Reaction.
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
CHAPTER 7 DNA SEQUENCING - INTRODUCTION - SANGER DIDEOXY METHOD - AUTOMATED SEQUENCING - NEXT GENERATION OF SEQUENCING METHODS MISS NUR SHALENA SOFIAN.
Robert Arthur Kevin Lee Xing Liu Pushkar Pande Gena Tang Racchit Thapliyal Tianjun Ye.
De Novo Genome Assembly - Introduction Henrik Lantz - BILS/SciLife/Uppsala University.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
Genome Characterization DNA sequence-ULTIMATE Map DNA sequencing-methods Assembly/sequencing BIO520 BioinformaticsJim Lund Assigned reading: Service 2006.
Problems of Genome Assembly James Yorke and Aleksey Zimin University of Maryland, College Park 1.
Taqman Technology and Its Application to Epidemiology Yuko You, M.S., Ph.D. EPI 243, May 15 th, 2008.
Researchers use genetic engineering to manipulate DNA. Section 2: DNA Technology K What I Know W What I Want to Find Out L What I Learned.
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
Anna Shcherbina Bioinformatics Challenge Day 01/10/2013 De novo assembly from clinical sample This work is sponsored by the Defense Threat Reduction Agency.
De Novo Genome Assembly - Introduction
Third Generation Sequencing. Today Illumina – Solexa sequencing technology 454 Life sciences – 454 sequencer Applied Biosystem – SOLiD system Tomorrow.
Higher Human Biology Unit 1 Human Cells KEY AREA 5: Human Genomics.
핵산 염기서열 분석(DNA SEQUENCING)
Bioinformatics Algorithms Sequence Assembly. Copy Right Notice Most slides in this presentation are adopted from slides of text book and various sources.
Topic Cloning and analyzing oxalate degrading enzymes to see if they dissolve kidney stones with Dr. VanWert.
Next-generation sequencing technology
Next Generation Sequencing
Next generation sequencing
Sequencing Introduction
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
Genomics Sequencing genomes.
21.8 Recombinant DNA DNA can be used in
Next-generation sequencing technology
Very important to know the difference between the trees!
Sequencing Technologies
AMPLIFYING AND ANALYZING DNA.
SOLEXA aka: Sequencing by Synthesis
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
2nd (Next) Generation Sequencing
ULTRASEQUENCING. Next Generation Sequencing: methods and applications.
Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine
A Sequenciação em Análises Clínicas
Next-generation DNA sequencing
Standard (Sanger) sequencing
Using the DNA Sequence Knowing the sequence of an organism’s DNA allows researchers to study specific genes, to compare them with the genes of other organisms,
Presentation transcript:

1 Next Generation Sequencing Itai Sharon November 11th, 2009 Introduction to Bioinformatics

2 2010: 5K$, a few days? 2009: Illumina, Helicos 40-50K$ Sequencing the Human Genome Year Log 10 (price) : 100$, <24 hrs? 2008: ABI SOLiD 60K$, 2 weeks 2007: 454 1M$, 3 months 2001: Celera 100M$, 3 years 2001: Human Genome Project 2.7G$, 11 years

3 In this Talk: Sequencing 1.0: Sanger Assembly Next generation sequencing (NGS) NGS applications Future directions

Genome Sequencing Goal  figuring the order of nucleotides across a genome Problem  Current DNA sequencing methods can handle only short stretches of DNA at once (<1-2Kbp) Solution  Sequence and then use computers to assemble the small pieces 4

Genome Sequencing 5 5 ACGTGGTAA CGTATACAC TAGGCCATA GTAATGGCG CACCCTTAG TGGCGTATA CATA… ACGTGGTAATGGCGTATACACCCTTAGGCCATA Short fragments of DNA AC..GC TT..TC CG..CA AC..GC TG..GTTC..CC GA..GC TG..AC CT..TG GT..GC AC..GC AT..AT TT..CC AA..GC Short DNA sequences ACGTGACCGGTACTGGTAACGTACA CCTACGTGACCGGTACTGGTAACGT ACGCCTACGTGACCGGTACTGGTAA CGTATACACGTGACCGGTACTGGTA ACGTACACCTACGTGACCGGTACTG GTAACGTACGCCTACGTGACCGGTA CTGGTAACGTATACCTCT... Sequenced genome Genome

Sanger Sequencing Mix DNA with dNTPs and ddNTPs Amplify Run in Gel – Fragments migrate distance that is proportional to their size 6

Sanger Sequencing 7

Advantages  Long reads (~900bps)  Suitable for small projects Disadvantages  Low throughput  Expensive 8

Assembly 9 9 Cut DNA to larger pieces (2Kbp, 15Kbp) and sequence both ends of each piece (Fleischmann et al., 1994) contig 1 contig 2 15Kbp mates 2Kbp mates ~(length―1,000) ~500 bp resolving repeats Better assembly of contigs, gap lengths estimation

 many pieces to assemble High coverage: Assembly: How Much DNA? 10 Low coverage: A few pieces to assemble a few contigs, a few gaps many contigs, many gaps  Input Output Lander and Waterman, 1988

Sanger Sequencing : lambda virus DNA stretches up to 30-40Kbp (Sanger et al.) 1994: H. Influenzae 1.8 Mbp (Fleischmann et al.) 2001: H. Sapiens, D. Melanogaster 3 Gbp (Venter et al.) 2007: Global Ocean Sampling Expedition ~3,000 organisms, 7Gbp (Venter et al.)

12 Next Generation Sequencing: Why Now? Motivation: HGP and its derivatives, personalized medicine Short reads applications: (re-)sequencing, other methods (e.g. gene expression) Advancements in technology

13 High Parallelism is Achieved in Polony Sequencing PolonySanger

14 Generation of Polony array: DNA Beads (454, SOLiD) DNA Beads are generated using Emulsion PCR

15 Generation of Polony array: DNA Beads (454, SOLiD) DNA Beads are placed in wells

16 Generation of Polony array: Bridge-PCR (Solexa) DNA fragments are attached to array and used as PCR templates

17 Sequencing: Pyrosequencing (454) Complementary strand elongation: DNA Polymerase

18 Sequencing: Fluorescently labeled Nucleotides (Solexa) Complementary strand elongation: DNA Polymerase

19 Sequencing: Fluorescently Labeled Nucleotides (ABI SOLiD) Complementary strand elongation: DNA Ligase

20 Sequencing: Fluorescently Labeled Nucleotides (ABI SOLiD) 5 reading frames, each position is read twice

21 Single Molecule Sequencing: HeliScope Direct sequencing of DNA molecules: no amplification stage DNA fragments are attached to array Potential benefits: higher throughput, less errors

22 Technology Summary Read lengthSequencing Technology Throughput (per run) Cost (1mbp)* Sanger~800bpSanger400kbp500$ 454~400bpPolony500Mbp60$ Solexa75bpPolony20Gbp2$ SOLiD75bpPolony60Gbp2$ Helicos30-35bpSingle molecule 25Gbp1$ *Source: Shendure & Ji, Nat Biotech, 2008

23 What, When and Why Sanger: Small projects (less than 1Mbp) 454: De-novo sequencing, metagenomics Solexa, SOLiD, Heliscope: – Gene expression, protein-DNA interactions – Resequencing

24 Applications

25 Applications

26 Where Do We Go from Here? Higher throughput, longer reads (Pacific BioSciences) Computational bottleneck Shift to sequencing-based technologies Will it help to cure cancer?