Basic Techniques Project Design Process Improvements

Slides:



Advertisements
Similar presentations
PCR, Gel Electrophoresis, and Southern Blotting
Advertisements

SEQUENCING-related topics 1. chain-termination sequencing 2. the polymerase chain reaction (PCR) 3. cycle sequencing 4. large scale sequencing stefanie.hartmann.
DNA Sequencing.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
SNP Discovery in the Human Genome C244/144 November 21, 2005.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
DNA Sequencing. DNA sequencing … ACGTGACTGAGGACCGTG CGACTGAGACTGACTGGGT CTAGCTAGACTACGTTTTA TATATATATACGTCGTCGT ACTGATGACTAGATTACAG ACTGATTTAGATACCTGAC.
“First generation" sequencing technologies and genome assembly
Chapter 14 Jizhong Zhou and Dorothea K. Thompson.
BioInformatics (2). Physical Mapping - I Low resolution  Megabase-scale High resolution  Kilobase-scale or better Methods for low resolution mapping.
Recombinant DNA Technology for the non- science major.
Analyzing your clone 1) FISH 2) “Restriction mapping” 3) Southern analysis : DNA 4) Northern analysis: RNA tells size tells which tissues or conditions.
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
High Throughput Sequencing Methods and Concepts
Genome Characterization Assembly/sequencing BIO520 BioinformaticsJim Lund Assigned reading: Ch 9.
Phred/Phrap/Consed Analysis A User’s View Arthur Gruber International Training Course on Bioinformatics Applied to Genomic Studies Rio de Janeiro 2001.
Manipulating DNA.
Module 1 Section 1.3 DNA Technology
Genome sequencing Haixu Tang School of Informatics.
Steps in a genome sequencing project Funding and sequencing strategy source of funding identified / community drive development of sequencing strategy.
A Lot More Advanced Biotechnology Tools (Part 1) Sequencing.
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
Biological Motivation for Fragment Assembly Rhys Price Jones Anne R. Haake.
A Sequenciação em Análises Clínicas Polymerase Chain Reaction.
SIZE SELECT SHEAR Shotgun DNA Sequencing (Technology) DNA target sample LIGATE & CLONE Vector End Reads (Mates) SEQUENCE Primer.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
Genome Characterization DNA sequence-ULTIMATE Map DNA sequencing-methods Assembly/sequencing BIO520 BioinformaticsJim Lund Assigned reading: Service 2006.
PHYSICAL MAPPING AND POSITIONAL CLONING. Linkage mapping – Flanking markers identified – 1cM, for example Probably ~ 1 MB or more in humans Need very.
Wageningen, April 24-25, 2008 II Tomato Finishing Workshop Chromosome 12 Update ENEA, Rome University of Naples ‘Federico II’ CRIBI and Univ. of Padua.
Human Genome.
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
Automatic DNA and Genome Sequencing
The Polymerase Chain Reaction (DNA Amplification)
Advantages of STR Analysis
Chemical Synthesis, Amplification, and Sequencing of DNA (Part II)
Genotyping and Genetic Maps Bas Heijmans Leiden University Medical Centre The Netherlands.
Molecular Tools. Recombinant DNA Restriction enzymes Vectors Ligase and other enzymes.
Genomics Part 1. Human Genome Project  G oal is to identify the DNA sequence of every gene in humans Genome  all the DNA in one cell of an organism.
PCR With PCR it is possible to amplify a single piece of DNA, or a very small number of pieces of DNA, over many cycles, generating millions of copies.
FOOTHILL HIGH SCHOOL SCIENCE DEPARTMENT Chapter 13 Genetic Engineering Section 13-2 Manipulating DNA.
Introduction to PCR Polymerase Chain Reaction
Chapter 5 Sequence Assembly: Assembling the Human Genome.
Higher Human Biology Unit 1 Human Cells KEY AREA 5: Human Genomics.
Genome sequencing and annotation Week 2 reading assignment - pages 63-78, 93-98, Boxes 2.1 and don’t worry about details of similarity scoring.
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Topic Cloning and analyzing oxalate degrading enzymes to see if they dissolve kidney stones with Dr. VanWert.
Cse587A/Bio 5747: L2 1/19/06 1 DNA sequencing: Basic idea Background: test tube DNA synthesis DNA polymerase (a natural enzyme) extends 2-stranded DNA.
DNA Sequencing First generation techniques
Introduction to PCR Polymerase Chain Reaction
Virginia Commonwealth University
Next generation sequencing
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
Genomics Sequencing genomes.
DNA Sequencing Techniques
Gene Isolation and Manipulation
AMPLIFYING AND ANALYZING DNA.
Engineering magnetosomes to express novel proteins
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
Polymerase Chain Reaction (PCR)
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
DNA and the Genome Key Area 8a Genomic Sequencing.
A Sequenciação em Análises Clínicas
Plant Biotechnology Lecture 2
Provide a genuine experience in using cell and molecular biology to learn about a fundamental problem in biology. Rather than following a set series of.
Introduction to Sequencing
Sequence the 3 billion base pairs of human
Relationship between Genotype and Phenotype
A Lot More Advanced Biotechnology Tools
Presentation transcript:

Basic Techniques Project Design Process Improvements DNA Sequencing Basic Techniques Project Design Process Improvements 9/18/2018 Chuck Staben

Project Size/Type 500 bases 2500 bases 10 kbp 150 kbp 3 Mbp BIG simple repeats BIG 1 locus EST,STS whole cDNA/EST gene, virus BAC, big virus bacterial genome YAC-size HUMAN, etc. 9/18/2018 Chuck Staben

DNA Sequencing Methods Chain termination/Dideoxy/Sanger fluorescence paradigm, ABI, HOOD Sequencing by hybridization chips Affymetrix (Lander, et al) other formats Hyseq (Church, et al) Lark 9/18/2018 Chuck Staben

Dideoxy/Chain Terminator/Sanger Template Primer Extension Chemistry polymerase termination labeling Separation Detection 9/18/2018 Chuck Staben

Chain Terminator Basics Template-Primer Target TGCA ddA ddG ddC ddT Terminators Extend ddA A ddC dN : ddN 100 : 1 Ladder n, n+1... AC ddG ACG ddT 9/18/2018 Chuck Staben

Electrophoresis 9/18/2018 Chuck Staben

Template Preparation ssDNA vectors PCR dsDNA (+/- PCR) M13 pUC 9/18/2018 Chuck Staben

Primers Primer Label Dye Terminator Universal primers Custom primers cheap, reliable, easy, fast, parallel BULK sequencing Custom primers expensive, slow, one-at-a-time ADAPTABLE Primer Label Dye Terminator 9/18/2018 Chuck Staben

Extension Chemistry 100% termination Accurate Even signal Polymerase Sequenase Thermostable (Cycle Sequencing) Terminators Dye labels (“Big Dye”) spectrally different, high fluorescence (mass labels??) ddA,C,G,T with primer labels 9/18/2018 Chuck Staben

Separation migration ~1/log N Gel Electrophoresis Capillary Electrophoresis suited to automation rapid (2 hrs vs 12 hrs) re-usable simple temperature control 96 well format migration ~1/log N 9/18/2018 Chuck Staben

Paradigm Instrument Applied Biosystems ABI3700 (early 1999) 1500 samples/day! http://www2.perkin-elmer.com/ga/3700/features.html ABI377 (gel) and ABI310 (capillary) 9/18/2018 Chuck Staben

Alternate Instruments Molecular Dynamics, Beckman Coulter… ALF, LiCor infrared detection Not Complete List 9/18/2018 Chuck Staben

Sample Output 1 lane 9/18/2018 Chuck Staben

Trace Editing EditView Chromas Consed Mac WinNT UNIX 9/18/2018 Chuck Staben

Project Goals de novo sequence repetitive sequencing Chain terminators Sequencing by hybridization Chip technology, eg 9/18/2018 Chuck Staben

Sequencing Strategies Random Sequence Brute Force Ordered Divide and Conquer Sequencing Assembly Finishing Annotation Mix to Suit 9/18/2018 Chuck Staben

Random Method Assemble Contigs Shear DNA (nebulize) Produce template finish ends, ligate into vector Produce template Sequence to target coverage read length (500 typical) accuracy (99% good) Assemble Contigs 9/18/2018 Chuck Staben

Random T C No coverage DISAGREEMENT Only 1 strand 9/18/2018 Chuck Staben

Poisson Statistics P0=e-L(N)/G L=read length N=#reads G=genome size 9/18/2018 Chuck Staben

Poisson-2 Gap Length=P0G 9/18/2018 Chuck Staben

Gap Number=P0N (assume N=500 bases) Poisson-3 Gap Number=P0N (assume N=500 bases) 9/18/2018 Chuck Staben

4 Mbp Genome 55 instrument days on ABI3700 10x Coverage 80,000 reads at 500 bases/read 4 gaps 400 bases in gaps 55 instrument days on ABI3700 9/18/2018 Chuck Staben

3000 Mbp Genome HUMAN 50000 instrument days on ABI3700 300 machines, 3 years Plenty 9/18/2018 Chuck Staben

Automation QT 9/18/2018 Chuck Staben

Costs $0.50/base Raw cost ~$0.01/base “Semi-finished” $0.10 per base High-quality Genome Project $0.50/base 9/18/2018 Chuck Staben

Ordered Methods Primer Walking Nested Deletion 9/18/2018 Chuck Staben

Limitations Slow, Expensive Expertise Needed Repeat Problems especially nested deletion Repeat Problems especially primer walking 9/18/2018 Chuck Staben

Finishing GOALS Finish when random no longer productive (3-10 X range) >95% coverage on BOTH strands every base covered 3X resolve ambiguities Finish when random no longer productive (3-10 X range) 9/18/2018 Chuck Staben

Finish-How Identify gaps, ambiguities Extend from end of contigs specific primers subclones, etc. Resolve ambiguities consensus or resequence specific primers, different chemistry 9/18/2018 Chuck Staben

Assembly Methods Strip out vector Mask known repeats Trim off unreliable data Find Matches (500 x 500 x many!!) how long (and what ktuple) how perfect (reliability index) where to look? (ends only vs entire) 9/18/2018 Chuck Staben

Assembly Programs PHRAP FAMILY SeqWeb…. phrap, kangaroo, phrapo, GAP4, TIGRAssembler,... GCG gelstart, gelenter, gelmerge, gelassemble, geldisassemble thinly veiled vi editor SeqWeb…. 9/18/2018 Chuck Staben

Assembly Improvements Repeat Problems Multiple fragment sizes in 1 project Use length/distance info 9/18/2018 Chuck Staben

Project Management Editing and Assembly Databases RepeatMasker Phred/Phrap Consed Databases ACeDB A C. elegans database Oracle 9/18/2018 Chuck Staben

Annotation Submit to Genbank ...HTGS (level1,2,3) ...nr ORFs Repeats GRAIL, PowerBLAST Repeats Other Regions Submit to Genbank ...HTGS (level1,2,3) ...nr 9/18/2018 Chuck Staben

Sequencing by Hybridization Hybridize labeled query DNA CHIP OLIGOS (20-mers) ...gaactAatact... ...gaactCatact... ...gaactGatact... ...gaactTatact... A C G T site 1 ...gaactaAtact... ...gaactaCtact... ...gaactaGtact... ...gaactaTtact... site 2 GAACTATGTACT 9/18/2018 Chuck Staben

Modern Sequencing Challenges Heterozygous DNAs germline differences somatic variation Massive sequencing population studies genome scans Minimal sample preparation “Doctor’s Office” Chips, Quantitative Seq Automation Miniaturization 9/18/2018 Chuck Staben

Physical Mapping Genome Characterization Genome fragmentation and cloning vectors, etc. Physical map assembly hybridization fingerprinting 9/18/2018 Chuck Staben