NEXT – GEN SEQUENCING TECHNIQUES

Slides:



Advertisements
Similar presentations
Research Techniques Made Simple: Polymerase Chain Reaction
Advertisements

Next–generation DNA sequencing technologies – theory & practice
Recombinant DNA technology
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
What Is Genomics? Genomics is the study of how the entire genome of a species functions as a unit and evolves over time. It is the study of life’s blueprint,
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Gene Cloning Techniques for gene cloning enable scientists to prepare multiple identical copies of gene-sized pieces of DNA. Most methods for cloning pieces.
CS 6293 Advanced Topics: Current Bioinformatics
Update on Next-Generation Sequencing
Next generation sequencing platforms Applications
The impact of next-generation sequencing technology of genetics Elaine R. Mardis – 11 February Washington School of Medicine, Genome Sequencing Center.
Biotechnology and Recombinant DNA
Objective 2: TSWBAT describe the basic process of genetic engineering and the applications of it.
Finishing the Human Genome
 It is the methods scientist use to study and manipulate DNA.  It made it possible for researchers to genetically alter organisms to give them more.
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
歐亞書局 PRINCIPLES OF BIOCHEMISTRY Chapter 9 DNA-Based Information Technologies.
High Throughput Sequencing Methods and Concepts
Restriction Nucleases Cut at specific recognition sequence Fragments with same cohesive ends can be joined.
 It is the methods scientist use to study and manipulate DNA.  It made it possible for researchers to genetically alter organisms to give them more.
Section 2 Genetics and Biotechnology DNA Technology
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
NIS - BIOLOGY Lecture 57 – Lecture 58 DNA Technology Ozgur Unal 1.
Tools of Human Molecular Genetics. ANALYSIS OF INDIVIDUAL DNA AND RNA SEQUENCES Two fundamental obstacles to carrying out their investigations of the.
1 Chapter 2: DNA replication and applications DNA replication in the cell Polymerase chain reaction (PCR) Sequence analysis of DNA.
Genetics 6: Techniques for Producing and Analyzing DNA.
19.1 Techniques of Molecular Genetics Have Revolutionized Biology
CHAPTER 7 DNA SEQUENCING - INTRODUCTION - SANGER DIDEOXY METHOD - AUTOMATED SEQUENCING - NEXT GENERATION OF SEQUENCING METHODS MISS NUR SHALENA SOFIAN.
Polymerase Chain Reaction (PCR) Developed in 1983 by Kary Mullis Major breakthrough in Molecular Biology Allows for the amplification of specific DNA fragments.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
How will new sequencing technologies enable the HMP? Elaine Mardis, Ph.D. Associate Professor of Genetics Co-Director, Genome Sequencing Center Washington.
Error model for massively parallel (454) DNA sequencing Sriram Raghuraman (working with Haixu Tang and Justin Choi)
PHARMACOBIOTECHNOLOGY.  Recombinant DNA (rDNA) is constructed outside the living cell using enzymes called “restriction enzymes” to cut DNA at specific.
Cloning and Expression of Genes
PHYSICAL MAPPING AND POSITIONAL CLONING. Linkage mapping – Flanking markers identified – 1cM, for example Probably ~ 1 MB or more in humans Need very.
Molecular Testing and Clinical Diagnosis
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
Ultra-High Throughput DNA Sequencing on the 454/Roche GS-FLX
The polymerase chain reaction
Chapter 10: Genetic Engineering- A Revolution in Molecular Biology.
Chemical Synthesis, Amplification, and Sequencing of DNA (Part II)
Recombinant DNA Technology. DNA replication refers to the scientific process in which a specific sequence of DNA is replicated in vitro, to produce multiple.
PCR With PCR it is possible to amplify a single piece of DNA, or a very small number of pieces of DNA, over many cycles, generating millions of copies.
Green with envy?? Jelly fish “GFP” Transformed vertebrates.
CAMPBELL BIOLOGY Reece Urry Cain Wasserman Minorsky Jackson © 2014 Pearson Education, Inc. TENTH EDITION CAMPBELL BIOLOGY Reece Urry Cain Wasserman Minorsky.
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Introduction to Illumina Sequencing
Cse587A/Bio 5747: L2 1/19/06 1 DNA sequencing: Basic idea Background: test tube DNA synthesis DNA polymerase (a natural enzyme) extends 2-stranded DNA.
DNA Sequencing First generation techniques
Next-generation sequencing technology
Research Techniques Made Simple: Polymerase Chain Reaction
DNA Sequencing Second generation techniques
Part 3 Gene Technology & Medicine
Next generation sequencing
Gene Cloning Techniques for gene cloning enable scientists to prepare multiple identical copies of gene-sized pieces of DNA. Most methods for cloning pieces.
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
DNA Technologies (Introduction)
Next-generation sequencing technology
copying & sequencing DNA
COURSE OF MICROBIOLOGY
Sequencing Technologies
SOLEXA aka: Sequencing by Synthesis
Chapter 14 Bioinformatics—the study of a genome
CISC 667 Intro to Bioinformatics (Spring 2007) Molecular Biology Tools
Recombinant DNA Technology
ULTRASEQUENCING. Next Generation Sequencing: methods and applications.
The impact of next-generation sequencing technology on genetics
Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine
Standard (Sanger) sequencing
Presentation transcript:

NEXT – GEN SEQUENCING TECHNIQUES

INTRODUCTION An astounding potential exists for these technologies to bring enormous change in genetic and biological research These technologies are not only changing our genome sequencing approaches and the associated timelines and costs, but also accelerating and altering a wide variety of types of biological inquiry that have historically used a sequencing-based readout, or effecting a transition to this type of read- out. Furthermore, next-generation platforms are helping to open entirely new areas of biological inquiry, including the investigation of ancient genomes, the characterization of ecological diversity, and the identification of unknown etiologic agents.

The most commonly used technology until a few years ago – BAC each BAC clone is amplified in bacterial culture, isolated in large quantities, sheared to produce size- selected pieces of approximately 2−3 kb. These pieces are subcloned into plasmid vectors, amplified in bacterial culture the DNA is selectively isolated prior to sequencing. By generating approximately eightfold coverage of each BAC clone in plasmid subclone equivalents, computer-aided assembly can largely recreate the BAC insert sequence in contiguous stretches of assembled sequence reads. Refinement, including gap closure and sequence quality improvement

WHOLE GENOME SEQUENCING Genomic DNA sheared directly into several distinct size classes Placed into plasmid and subclones. Oversampling the ends of these subclones generates paired-end sequencing reads Provides the necessary linking information

ADVANTAGES OF WGS Rapid Readily Highly polymorphic or highly repetitive genomes remain quite fragmented after assembly.

Utility of next – gen sequence reads The length of a sequence read from all current next- generation platforms is much shorter than that from a capillary sequencer Each next- generation read type has a unique error model different from that already established for capillary sequence reads. Both differences affect how the reads are utilized in bioinformatic analyses, depending upon the application.

IMPACT ON OTHER FIELDS The massively parallel scale of sequencing implies a similarly massive scale of computational analyses that include image analysis, signal processing, background subtraction, base calling, and quality assessment to produce the final sequence reads for each run. In every case, these analyses place significant demands on the information technology (IT), computational, data storage, and laboratory information management system (LIMS) This aspect of next-generation sequencing is at present complicated by the dearth of current sequence analysis tools suited to shorter sequence read data; existing data analysis pipelines and algorithms must be modified to accommodate these shorter reads. In many cases, entirely new algorithms and data visualization interfaces are being devised and tested to meet this new demand.

The next-generation platforms are effecting a complete paradigm shift, not only in the organization of large-scale data production, but also in the downstream bioinformatics, IT, and LIMS support required for high data utility and correct interpretation. This paradigm shift promises to radically alter the path of biological inquiry.

NEXT – GEN TECHNOLOGIES Roche/454 FLX Illumina/ Solexa Genome Analyzer Applied Biosystems SOLiDTM Helicos HeliscopeTM Pacific Biosciences SMRT By different approaches, each technology seeks to amplify single strands of a fragment library and perform sequencing reactions on the amplified strands. The fragment libraries are obtained by annealing platform-specific linkers to blunt-ended fragments generated directly from a genome or DNA source of interest. selectively amplified by PCR - No bacterial cloning step is required to amplify the genomic fragment in a bacterial Both the Helicos and Pacific Biosystems instruments are so-called “single molecule” sequencers and do not require any amplification of DNA fragments prior to sequencing.

Roche/454 FLX Pyrosequencer Incorporation of nucleotide by DNA polymerase Release of pyrophosphate Initiates series of downstream reactions Produce light by firefly enzyme luciferase Directly proportional

Library preparation The genome is fragmented by nebulization. The library fragments are mixed with a population of agarose beads whose surfaces carry oligonucleotides Each bead is associated with a single fragment. There is no cloning and hence no colony picking. The library is created with adaptors. Selection – A/B fragments are isolated using avidin – biotin purification. Time – 4.5 hours

Emulsion PCR Each of these fragment:bead complexes is isolated into individual oil:water micelles that also contain PCR reactants Clonal amplification occurs inside microreactors. Break microreactors and enrich DNA – positive beads. Thermal cycling of the micelles produces approximately one million copies of each DNA fragment on the surface of each bead. These amplified single molecules are then sequenced.

Sequencing First the beads are arrayed into a picotiter plate that holds a single bead in each of several hundred thousand single wells, This provides a fixed location at which each sequencing reaction can be monitored. Enzyme- containing beads that catalyze the downstream pyrosequencing reaction steps are then added to the PTP and the mixture is centrifuged to surround the agarose beads. On instrument, the PTP acts as a flow cell into which each pure nucleotide solution is introduced in a step- wise fashion, with an imaging step after each nucleotide incorporation step. The PTP is seated opposite a CCD camera that records the light emitted at each bead. The first four nucleotides (TCGA) on the adapter fragment adjacent to the sequencing primer added in library construction correspond to the sequential flow of nucleotides into the flow cell. Problem - However, the calibrated base calling cannot properly interpret long stretches (>6) of the same nucleotide so these areas are prone to base insertion and deletion errors during base calling. Advantage – Because each incorporation step is nucleotide specific, substitution errors are rarely encountered in Roche/454 sequence reads.

FLX reads are of sufficient length to assemble small genomes such as bacterial and viral genomes to high quality and contiguity. The lack of a bacterial cloning step in the Roche/454 process means that sequences not typically sampled in a WGS approach owing to cloning bias will be more likely represented in a FLX data set, which contributes to more comprehensive genome coverage.

Illumina Genome Analyzer Process starts with Illumina-specific adapter library  takes place on the oligo-derivatized surface of a flow cell,  performed by an automated device called a Cluster Station. The flow cell is an 8-channel sealed glass microfabricated device that allows bridge amplification (Allows the generation of in situ copies of a specific DNA molecule on an oligo-decorated solid support) of fragments on its surface, and uses DNA polymerase to produce multiple DNA copies, or clusters, that each represent the single molecule that initiated the cluster amplification. A separate library can be added to each of the eight channels, or the same library can be used in all eight, or combinations thereof. Each cluster contains approximately one million copies of the original fragment, which is sufficient for reporting incorporated bases at the required signal intensity for detection during sequencing.

Sequencing- by-synthesis approach All four nucleotides are added simultaneously to the flow cell channels, with DNA polymerase Incorporate into oligoprimed cluster fragments. Specifically, the nucleotides carry a base-unique fluorescent label and the 3’-OH group is chemically blocked Imaging Step – Each flow cell lane is imaged in three 100-tile segments by the instrument optics. the 3’ - OH blocking group is chemically removed to prepare each strand for the next incorporation by DNA polymerase. Continues for no. of cycles. Advantage - A base-calling algorithm assigns sequences and associated quality values to each read and a quality checking pipeline evaluates the Illumina data from each run, removing poor-quality sequences.

Illumina Steps Prepare genomic DNA sample - Randomly fragment genomic DNA and ligate adapters to both ends of the fragments. Attach DNA to surface - Bind single-stranded fragments randomly to the inside surface of the flow cell channels. Bridge amplification- Add unlabeled nucleotides and enzyme to initiate solid- phase bridge amplification. Denature the double stranded molecules

First chemistry cycle : determine first base - To initiate the first sequencing cycle, add all four labeled reversible terminators, primers, and DNA polymerase enzyme to the flow cell. Image of first chemistry cycle - After laser excitation, capture the image of emitted fluorescence from each cluster on the flow cell. Record the identity of the first base for each cluster. Before initiating the next chemistry cycle - The blocked 3' terminus and the fluorophore from each incorporated base are removed. Sequence read over multiple chemistry cycles - Repeat cycles of sequencing to determine the sequence of bases in a given fragment a single base at a time.

Applied Biosystems SOLiDTM Sequencer The SOLiD platform uses an adapter-ligated fragment library Emulsion PCR approach with small magnetic beads to amplify the fragments for sequencing. SOLiD uses DNA ligase and a unique approach to sequence the amplified fragments. Two flow cells are processed per instrument run, each of which can be divided to contain different libraries in up to four quadrants. Read lengths for SOLiD are user defined between 25–35 bp, and each sequencing run yields between 2–4 Gb of DNA sequence data. The reads are aligned to a reference genome to enable a second tier of quality evaluation called two-base encoding.

Steps Prime and ligate Image Cap unextended strands Cleave off flour Repeat steps to extend sequence Primer reset Repeat steps with new primer Repeat Reset with n-2, n-3, n-4 primers

Data Collection and Image Analysis Collect color image Identify beads Identify bead color Because each fluorescent group on a ligated polymer identifies a two-base combination, the resulting sequence reads can be screened for base-calling errors versus true polymorphisms versus single base deletions by aligning the individual reads to a known high-quality reference sequence.

FUTURE POSSIBLITIES UNIVERSAL READ OUT To characterize strains or isolates relative to high-quality reference genomes EPIGENOMIC VARIATION CORRELATIVE ANALYSES DNA sequencing will become a more universal readout for an increasingly wide variety of front-end assays. ascertainment of genome-wide patterns of methylation and how these patterns change through the course of an organism’s development, in the context of disease, and under various other influences.

CHARACTERIZE STRAINS To characterize strains or isolates relative to high-quality reference genomes – Studies of this type will identify and catalog genomic variation on a wide scale. Ultimately, resequencing studies will help to better characterize, for example, the range of normal variation in complex genomes such as the human genome, and aid in our ability to comprehensively view the range of genome variation in clinical isolates of pathogenic microbes, viruses, etc.

CORRELATIVE ANALYSES Rapidly read out experimental results is the enhanced potential to combine the results of different experiments—correlative analyses of genome-wide methylation, histone binding patterns, and gene expression, for example— owing to the similar data type produced. The power in these correlative analyses is the power to begin unlocking the secrets of the cell.