Canadian Bioinformatics Workshops

Slides:



Advertisements
Similar presentations
Next–generation DNA sequencing technologies – theory & practice
Advertisements

High-Throughput Sequencing Technologies
Next-generation sequencing
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
What Is Genomics? Genomics is the study of how the entire genome of a species functions as a unit and evolves over time. It is the study of life’s blueprint,
Greg Phillips Veterinary Microbiology
Bioinformatics for high-throughput DNA sequencing Gabor Marth Boston College Biology New grad student orientation Boston College September 8, 2009.
Cycle Sequencing. Broad and Long Term Objective To characterize a single clone from an Emiliania huxleyi cDNA library using sequence analysis To characterize.
The Human Genome Project and ~ 100 other genome projects:
The SOLiD System: Next-Generation Sequencing Overview of the SOLiD System –  Scalable  Accurate Ultra High Throughput  Flexible  Mate Pairs.
1 Next Generation Sequencing Itai Sharon November 11th, 2009 Introduction to Bioinformatics.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Lecture 1 Introduction to high throughput sequencing
A Contract Research and Services Organization. Ideas to Life! A Contract Research and Services Organization  Xcelris is a Specialty Contract Research.
CS 6293 Advanced Topics: Current Bioinformatics
Next Generation DNA Sequencing Platforms: Evolving Tools for
NEXT GENERATION SEQUENCING Technologies on Biomedical Research
DNA Sequencing LECTURE 6: Biotechnology; 3 Credit hours Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences and Technology.
Update on Next-Generation Sequencing
Next Now-Generation Genomics: methods and applications for modern disease research Aaron J. Mackey, Ph.D. Center for Public Health.
High-Throughput Sequencing Technologies
Automated DNA Sequencing LECTURE 7: Biotechnology; 3 Credit hours Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences and.
Next generation sequencing Xusheng Wang 4/29/2010.
Analyzing your clone 1) FISH 2) “Restriction mapping” 3) Southern analysis : DNA 4) Northern analysis: RNA tells size tells which tissues or conditions.
High Throughput Sequencing Methods and Concepts
AP Biology A Lot More Advanced Biotechnology Tools Sequencing.
Technological Solutions. In 1977 Sanger et al. were able to work out the complete nucleotide sequence in a virus – (Phage 0X174) This breakthrough allowed.
Collecting and Storing Sequences In the laboratory Heather Helm UPR Sequencing Facilities Manager.
NEXT – GEN SEQUENCING TECHNIQUES
Genome sequencing Haixu Tang School of Informatics.
Bioinformatics and Sequencing Relevant to SolCAP
A Lot More Advanced Biotechnology Tools (Part 1) Sequencing.
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
CO341: Introduction to Bioinformatics Prof. Yi-Ke Guo
CHAPTER 7 DNA SEQUENCING - INTRODUCTION - SANGER DIDEOXY METHOD - AUTOMATED SEQUENCING - NEXT GENERATION OF SEQUENCING METHODS MISS NUR SHALENA SOFIAN.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
How will new sequencing technologies enable the HMP? Elaine Mardis, Ph.D. Associate Professor of Genetics Co-Director, Genome Sequencing Center Washington.
Sequencing and Assembly GEN875, Genomics and Proteomics, Fall 2010.
Molecular Biology Dr. Chaim Wachtel May 28, 2015.
PHYSICAL MAPPING AND POSITIONAL CLONING. Linkage mapping – Flanking markers identified – 1cM, for example Probably ~ 1 MB or more in humans Need very.
Polymerase Chain Reaction (PCR)
Sequencing DNA 1. Maxam & Gilbert's method (chemical cleavage) 2. Fred Sanger's method (dideoxy method) 3. AUTOMATED sequencing (dideoxy, using fluorescent.
Biochemistry 412 Overview of Genomics & Proteomics 18 January 2005.
Locating and sequencing genes
IB Cloning and Sequencing - Jan 23
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Introduction to next-gen sequencing bioinformatics.ca Canadian Bioinformatics Workshops
Topic Cloning and analyzing oxalate degrading enzymes to see if they dissolve kidney stones with Dr. VanWert.
Introduction to Illumina Sequencing
Cse587A/Bio 5747: L2 1/19/06 1 DNA sequencing: Basic idea Background: test tube DNA synthesis DNA polymerase (a natural enzyme) extends 2-stranded DNA.
DNA Sequencing First generation techniques
Next-generation sequencing technology
DNA Sequencing Second generation techniques
Next generation sequencing
Sequencing technologies
Introduction to next generation sequencing
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
Next-generation sequencing technology
Joseph E. Conley, Alex J. Meisel, and James J
Sequencing Technologies
SOLEXA aka: Sequencing by Synthesis
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
ULTRASEQUENCING. Next Generation Sequencing: methods and applications.
The impact of next-generation sequencing technology on genetics
Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine
Plant Biotechnology Lecture 2
Next-generation DNA sequencing
Standard (Sanger) sequencing
A Lot More Advanced Biotechnology Tools
Presentation transcript:

Canadian Bioinformatics Workshops www.bioinformatics.ca

Introduction to next-gen sequencing Informatics on High Throughput Sequencing Data Introduction to next-gen sequencing Francis Ouellette francis@oicr.on.ca July 25th 2008

Outline Sequencing DNA Next Generation Technologies Solexa SOLiD 454 Helicos AB’s color space What next, & things to keep in mind!

Adapted from John McPherson, OICR Biological Research

History of DNA Sequencing Adapted from Eric Green, NIH; Adapted from Messing & Llaca, PNAS (1998) Avery: Proposes DNA as ‘Genetic Material’ Watson & Crick: Double Helix Structure of DNA Holley: Sequences Yeast tRNAAla 1870 1953 1940 1965 1970 1977 1980 1990 2002 Miescher: Discovers DNA Wu: Sequences  Cohesive End DNA Sanger: Dideoxy Chain Termination Gilbert: Chemical Degradation Messing: M13 Cloning Hood et al.: Partial Automation Cycle Sequencing Improved Sequencing Enzymes Improved Fluorescent Detection Schemes 1986 Next Generation Sequencing Improved enzymes and chemistry Improved image processing Efficiency (bp/person/year) 1 15 150 1,500 15,000 25,000 50,000 200,000 50,000,000 100,000,000,000 2008

Basics of the “old” technology Clone the DNA. Generate a ladder of labeled (colored) molecules that are different by 1 nucleotide. Separate mixture on some matrix. Detect fluorochrome by laser. Interpret peaks as string of DNA. Strings are 500 to 1,000 letters long 1 machine generates 57,000 nucleotides/run Assemble all strings into a genome.

Basics of the “new” technology Get DNA. Attach it to something. Extend and amplify signal with some color scheme. Detect fluorochrome by microscopy. Interpret series of spots as short strings of DNA. Strings are 30-300 letters long Multiple images are interpreted as 0.4 to 1.2 GB/run (1,200,000,000 letters/day). Map or align strings to one or many genome.

From Debbie Nickerson, Department of Genome Sciences, University of Washington, http://tinyurl.com/6zbzh4

Differences between the various platforms: Nanotechnology used. Resolution of the image analysis. Chemistry and enzymology. Signal to noise detection in the software Software/images/file size/pipeline Cost $$$

Next Generation DNA Sequencing Technologies Adapted from Richard Wilson, School of Medicine, Washington University, “Sequencing the Cancer Genome” http://tinyurl.com/5f3alk Next Generation DNA Sequencing Technologies 3 Gb ==

Solexa

Solexa-based Whole Genome Sequencing Adapted from Richard Wilson, School of Medicine, Washington University, “Sequencing the Cancer Genome” http://tinyurl.com/5f3alk

Solexa-based Whole Genome Sequencing Adapted from Richard Wilson, School of Medicine, Washington University, “Sequencing the Cancer Genome” http://tinyurl.com/5f3alk Solexa flow cell ~50M clusters are sequenced per flow cell.

Debbie Nickerson, Department of Genome Sciences, University of Washington, http://tinyurl.com/6zbzh4

454

Roche / 454 : GS FLX Real Time Sequencing by Synthesis Chemiluminescence detection in pico titer plates Amplification: emulsion PCR Pyrosequencing up to 400,000 reads / run on average 250 bases / read (and longer) up to 100 Mb / run

Roche / 454 : GS FLX Made for de novo sequencing. Too expensive for resequencing. For example, this platform will be used a lot by laboratories doing new bacterial genomes. Baylor Genome Center involved in Sea Urchin, Bee, Platypus genomes: They have a number of 454.

Helicos

Single Molecule Sequencing Adapted from: Barak Cohen, Washington University, Bio5488 http://tinyurl.com/6zttuq http://tinyurl.com/6k26nh Single Molecule Sequencing Microscope slide * * * Single DNA molecule Super-cooled TIRF microscope primer dNTP-Cy3 * Helicos Biosciences Corp.

Helicos Approximate Data Production per Run at Current Peak Throughput (1 strand/µ2) Single Pass Dual Pass 7 day run 14 day run Image Data: 35 TB 60 TB Diagnostic Images: 350 GB 600 GB Object Table: 3.5 TB 6 TB Sequence Data: 350 GB 600 GB Log Files: 350 GB 600 GB Total ~4.5 TB ~7.8 TB (w/o full image stack)

ABI SOLiD

File management

SOLiD color space

It’s more complicated! Get files with quality scores Get files with miss-matches Need to align them to a reference genome Multiple tools do this today … and there will be more later. What do you do? Do it all!

Things to keep in mind All people are learning, if you don’t know, ask, and they probably won’t know either, and you can figure it out together! The technology is changing – This workshop next year will be totally different! We can only do so much in two days – you will need to find things, find people who can help you, and you will need to teach your friends!

Other factors Changing technology Changing price structure New and disappearing companies? Changing price structure Cost of machine Cost of operation (reagents/people) Service from the company 1 machine vs (2 or 3 machines) vs 40 machines. Changing software and processing

Pacific Biosystems (PacBio)

Questions? Coffee break!

Day 1