Genetic Map and Forward Genetics Tools for C. briggsae Presented by Dan Koboldt Ray Miller’s Group.

Slides:



Advertisements
Similar presentations
Mo17 shotgun project Goal: sequence Mo17 gene space with inexpensive new technologies Datasets in progress: Four-phases of 454-FLX sequencing to max of.
Advertisements

Reference mapping and variant detection Peter Tsai Bioinformatics Institute, University of Auckland.
Polymorphisms: Clinical Implications By Amr S. Moustafa, M.D.; Ph.D. Assistant Prof. & Consultant, Medical Biochemistry Dept. College of Medicine, KSU.
Targeted Data Introduction  Many mapping, alignment and variant calling algorithms  Most of these have been developed for whole genome sequencing and.
Outline to SNP bioinformatics lecture
Next-generation sequencing – the informatics angle Gabor T. Marth Boston College Biology Department AGBT 2008 Marco Island, FL. February
9 Genomics and Beyond Brief Chapter Outline
Design Goals Crash Course: Reference-guided Assembly.
Bioinformatics for high-throughput DNA sequencing Gabor Marth Boston College Biology New grad student orientation Boston College September 8, 2009.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
General methods of SNP discovery: PolyBayes Gabor T. Marth Department of Biology Boston College Chestnut Hill, MA
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
The Extraction of Single Nucleotide Polymorphisms and the Use of Current Sequencing Tools Stephen Tetreault Department of Mathematics and Computer Science.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Bioinformatics for next-generation DNA sequencing Gabor T. Marth Boston College Biology Department BC Biology new graduate student orientation September.
16 and 20 February, 2004 Chapter 9 Genomics Mapping and characterizing whole genomes.
Informatics tools for next-generation sequence analysis Gabor T. Marth Boston College Biology Department University of Michigan October 20, 2008.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Human Genome Sequence and Variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary,
Genome sequencing and assembling
Polymorphism discovery informatics Gabor T. Marth Department of Biology Boston College Chestnut Hill, MA
Sequence Variation Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Informatics for next-generation sequence analysis – SNP calling Gabor T. Marth Boston College Biology Department PSB 2008 January
Informatics challenges and computer tools for sequencing 1000s of human genomes Gabor T. Marth Boston College Biology Department Cold Spring Harbor Laboratory.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Restriction Fragment Length Polymorphisms (RFLPs) By Amr S. Moustafa, M.D.; Ph.D. Assistant Prof. & Consultant, Medical Biochemistry Dept. College of.
Plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic.
Next generation sequencing Xusheng Wang 4/29/2010.
Considerations for Analyzing Targeted NGS Data Introduction Tim Hague, CTO.
Copyright © 2011 Partek Incorporated. All rights reserved. Statistics Visualizations Annotations Start-to-Finish Analysis of Integrated Genomics.
RExPrimer Pongsakorn Wangkumhang, M.Sc. Biostatistics and Informatics Laboratory, Genome Institute, National Center for Genetic Engineering and Biotechnology.
GeVab: Genome Variation Analysis Browsing Server Korean BioInformation Center, KRIBB InCoB2009 KRIBB
Mouse Genome Sequencing
Genomics BIT 220 Chapter 21.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
How I learned to quit worrying Deanna M. Church Staff Scientist, Short Course in Medical Genetics 2013 And love multiple coordinate.
High throughput sequencing: informatics & software aspects Gabor T. Marth Boston College Biology Department BI543 Fall 2013 January 29, 2013.
By Zemin Ning & Adam Spargo Informatics Division The Wellcome Trust Sanger Institute The SSAHA2 Application Pack.
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
© 2010 by The Samuel Roberts Noble Foundation, Inc. 1 The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK, 73401, USA 2 National Center.
Genomics Method Seminar - BreakDancer January 21, 2015 Sora Kim Researcher Yonsei Biomedical Science Institute Yonsei University College.
Managing Next Generation Sequence Data with GMOD Dave Clements 1, Scott Cain 2, Paul Hohenlohe 3, Nicholas Stiffler 3, Paul Etter 3, Eric Johnson 3, William.
Linkage and Mapping. Figure 4-8 For linked genes, recombinant frequencies are less than 50 percent.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
The Genome Assemblies of Tasmanian Devil Zemin Ning The Wellcome Trust Sanger Institute.
Class 22 DNA Polymorphisms Based on Chapter 10 Recombinant DNA Technology Copyright © 2010 Pearson Education Inc.
A guided tour of Ensembl This quick tour will give you an outline view of what Ensembl is all about. You will learn: –Why we need Ensembl –What is in the.
GSVCaller – R-based computational framework for detection and annotation of short sequence variations in the human genome Vasily V. Grinev Associate Professor.
SNP Discovery in Whole-Genome Light-Shotgun 454 Pyrosequences Aaron Quinlan 1, Andrew Clark 2, Elaine Mardis 3, Gabor Marth 1 (1) Department of Biology,
Chapter 5 Sequence Assembly: Assembling the Human Genome.
A brief guide to sequencing Dr Gavin Band Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for Health.
Culturable Bacterial Communities Analyzer DIANA VANESSA SARRIA-ZUNIGA ELIANA TORRES-ZELADA April 29, 2016.
Canadian Bioinformatics Workshops
Virginia Commonwealth University
Precise Identification of Structural Variations in the Human Genome by Splitting Shotgun Reads Zemin Ning1, Anthony Cox1, David Adams1, Paul Flicek2, Charles.
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Pre-genomic era: finding your own clones
Ssaha_pileup - a SNP/indel detection pipeline from new sequencing data
DNA Marker Lecture 10 BY Ms. Shumaila Azam
Relationship between Genotype and Phenotype
Discovery tools for human genetic variations
Databases BI420 – Introduction to Bioinformatics Gabor T. Marth
Next-generation DNA sequencing
Databases BI420 – Introduction to Bioinformatics Gabor T. Marth
Polymorphism discovery in 09-CB1 × IPO323 versus 09-ASA-3apz × IPO94269 bulks. Polymorphism discovery in 09-CB1 × IPO323 versus 09-ASA-3apz × IPO94269.
Sequence the 3 billion base pairs of human
Research for medical discovery at the Computational Genomics Laboratory at Boston College Biology Gabor T. Marth Department of Biology, Boston College.
Volume 12, Issue 17, Pages (September 2002)
Relationship between Genotype and Phenotype
Presentation transcript:

Genetic Map and Forward Genetics Tools for C. briggsae Presented by Dan Koboldt Ray Miller’s Group

Outline I. Physical Map of C. briggsae II. Constructing the Initial Genetic Map III. Polymorphism Discovery IV. Assay Development V. The Genetic Map Web Site VI. Resources for C. briggsae as a model organism

The Physical Map (cb25) Shotgun sequencing of AF16 strain Shotgun sequencing of AF16 strain 5,341 supercontigs 5,341 supercontigs 578 fingerprint contigs 578 fingerprint contigs Assembly not organized by chromosomes Assembly not organized by chromosomes

The Initial SNP Map 1. S. Baird made 2 sets of ~100 Recombinant Inbred lines Cross 1: AF16 X HK104 Cross 2: AF16 X VT SNPs discovered by shotgun sequencing (GSC). 3. Selected 267 SNPs from the largest contigs in the physical map 4. Genotyped the RILs 5. Assembled the map using data from 248 SNPs.

The Draft Genetic Map Genetic Map of Cb4 (R. Miller) The Genetic Map and the Physical Map Coverage as of v3.0: 117 ultra-contigs (~71.5%) of the C. briggsae genome. Several changes made to cb25 genome assembly (R. Waterston and L. Hillier – pers. comm). Homology with C. elegans (Ibid) briggsaeelegans Cb1I Cb2II Cb3III Cb4IV Cb5V CbXX

SNP Discovery in HK104 In 13,632 HK104 sequence traces: 7,669 SNPs found by all methods 15,438 SNPs found by two methods Reasons for Disagreement Quality scoring & trimming Repetitive regions

HK104 SNPs

Improving SNP Discovery Data set may be used to test the next version of Polybayes (G. Marth, pers. comm). Data set may be used to test the next version of Polybayes (G. Marth, pers. comm). New software: Polyphred, Ssaha-SNP, novoSNP New software: Polyphred, Ssaha-SNP, novoSNP High-confidence SNPs will be submitted to Wormbase High-confidence SNPs will be submitted to Wormbase Quality scores and “hits”  quantify the SNP quality Quality scores and “hits”  quantify the SNP quality

Structural Polymorphisms SSAHA-DIP for Deletion-Insertion Polys SSAHA found 2,842 indels of >=2 bp 627 long (>= 7bp) HK104 indels identified Two-Step Algorithm for Structural Variants Blast step to identify break-point reads Pair-wise alignment to characterize variant(s).

Two-step Applied to 800 Cb reads Breakpoint ReadPredicted VariantCharacterized Variant(s) pjn91h04.g127 bp insertion pjn90e12.b128 bp insertion27 bp insertion pjn90h04.b144 bp insertion45 bp insertion pjn88f02.b170 bp insertion67 bp insertion pjn88e06.g188 bp insertion86 bp insertion pjn89d12.g1101 bp insertion93 bp insertion pjn89f10.b1151 bp insertion pjn91b04.g1247 bp insertion241 bp insertion pjn89c08.b1271 bp insertion pjn89f09.b145 bp deletion48 bp deletion pjn89h09.g151 bp deletion61 bp deletion and 12bp insertion pjn88d09.g1653 bp deletion651 bp deletion pjn91b04.b1953 bp deletion956 bp deletion pjn89d08.b11.3 kb deletion1.267 kb deletion pjn91d10.b1277 bp deletionpossible segmental duplication

Forward Genetics Tools SNPs and Indels FP-TDI genotyping assays FP-TDI genotyping assays RFLP assays (T. Harris) RFLP assays (T. Harris) PCR fragment length (PLP) assays PCR fragment length (PLP) assays Array-based technology (S. Baird) Array-based technology (S. Baird) Insertional Mutagenesis Mos1 insertions (M-A. Felix) Mos1 insertions (M-A. Felix)

Web Site at

Genetic Map Online

Browse Polymorphisms click!

Future Directions Improving/integrating the genetic map More genotyping to improve/resolve coverage More genotyping to improve/resolve coverage Additional HK104 SNP discovery (454 run?) Additional HK104 SNP discovery (454 run?) Developing and sharing resources Identification of snip-SNPs Identification of snip-SNPs Insertion-deletion validation / array design Insertion-deletion validation / array design Detecting structural variants in C. elegans Detecting structural variants in C. elegans

Take-Home Messages C. briggsae now has powerful tools to support it as a model organism. C. briggsae now has powerful tools to support it as a model organism. Comparative studies of C. elegans and C. briggsae are becoming feasible and should be considered. Comparative studies of C. elegans and C. briggsae are becoming feasible and should be considered.

Acknowledgements C. briggsae Genetic Map Advisory Committee Scott Baird, Helen Chamberlin, Bhagwati Gupta, Eric Haag, and Ray Miller Scott Baird, Helen Chamberlin, Bhagwati Gupta, Eric Haag, and Ray Miller Other Collaborators Marie-Anne Felix, Todd Harris, LaDeana Hillier, Gabor Marth, Bob Waterston Marie-Anne Felix, Todd Harris, LaDeana Hillier, Gabor Marth, Bob Waterston Funding Support NIH NIH

Thank You EMBO & I.G.C* Beautiful Country Free Food *Gulbenkian, not Gularenkian