Mouse Genome Sequencing

Slides:



Advertisements
Similar presentations
Accurate Assembly of Maize BACs Patrick S. Schnable Srinivas Aluru Iowa State University.
Advertisements

Sequencing a genome. Definition Determining the identity and order of nucleotides in the genetic material – usually DNA, sometimes RNA, of an organism.
Genomics Chapter 18.
Lecture 14 Genome sequencing projects
9 Genomics and Beyond Brief Chapter Outline
Sequencing a genome and Basic Sequence Alignment Lecture 10 1Global Sequence.
Alignment Problem (Optimal) pairwise alignment consists of considering all possible alignments of two sequences and choosing the optimal one. Sub-optimal.
Panzea Graphical Map Viewer Tutorial With the Panzea graphical map viewer you can construct custom graphical displays of the map positions.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
CISC667, F05, Lec4, Liao CISC 667 Intro to Bioinformatics (Fall 2005) Whole genome sequencing Mapping & Assembly.
Class 02: Whole genome sequencing. The seminal papers ``Is Whole Genome Sequencing Feasible?'' ``Whole-Genome DNA.
DNA Sequencing. The Walking Method 1.Build a very redundant library of BACs with sequenced clone- ends (cheap to build) 2.Sequence some “seed” clones.
DNA Sequencing and Assembly
The Human Genome Race. Collins vs. Venter Collins Venter.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Zebra Finch Seg Dup Analysis 1.Genome 2.Parameters for Pipeline 3.Analysis.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
CS273a Lecture 4, Autumn 08, Batzoglou Hierarchical Sequencing.
DNA Sequencing and Assembly. DNA sequencing How we obtain the sequence of nucleotides of a species …ACGTGACTGAGGACCGTG CGACTGAGACTGACTGGGT CTAGCTAGACTACGTTTTA.
Human Genome Project. Basic Strategy How to determine the sequence of the roughly 3 billion base pairs of the human genome. Started in Various side.
CS273a Lecture 2, Autumn 10, Batzoglou DNA Sequencing (cont.)
Genome sequencing and assembling
Compartmentalized Shotgun Assembly ? ? ? CSA Two stated motivations? ?
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Genome Analysis Determine locus & sequence of all the organism’s genes More than 100 genomes have been analysed including humans in the Human Genome Project.
Sequencing a genome and Basic Sequence Alignment Lecture 8 1Global Sequence.
Sequencing a genome and Basic Sequence Alignment
BioInformatics (2). Physical Mapping - I Low resolution  Megabase-scale High resolution  Kilobase-scale or better Methods for low resolution mapping.
Lecture 15 – Gene Cloning Based on Chapter 08 - Genomics: The Mapping and Sequencing of Genomes Copyright © 2010 Pearson Education Inc.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
Genomics Chapter 18.
HAPLOID GENOME SIZES (DNA PER HAPLOID CELL) Size rangeExample speciesEx. Size BACTERIA1-10 Mb E. coli: Mb FUNGI10-40 Mb S. cerevisiae 13 Mb INSECTS.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
CUGI Pilot Sequencing/Assembly Projects Christopher Saski.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Tomato Chromosome 4: A Mapping & Sequencing Update 28 th September 2005 Christine Nicholson Mapping Core Group Welcome Trust Sanger Institute, UK.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Next generation sequence data and de novo assembly For human genetics By Jaap van der Heijden.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Genome Sequencing in the Legumes Le et al Phylogeny Major sequencing efforts Minor sequencing efforts ~14 MY ~45 MY.
Steps in a genome sequencing project Funding and sequencing strategy source of funding identified / community drive development of sequencing strategy.
Biological Motivation for Fragment Assembly Rhys Price Jones Anne R. Haake.
Status report on gap closure of the human chromosome 5 BAC map Authentication of C5 BAC maps Map and sequence status Gap status and steps used to close.
Sequencing a genome and Basic Sequence Alignment
Advancing Science with DNA Sequence Metagenome definitions: a refresher course Natalia Ivanova MGM Workshop September 12, 2012.
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
Theobroma cacao Integrated Physical and Genetic Map 2 BAC Libraries 250 Genetic Markers.
Chromosome 2 Doil Choi, Sunghwan Jo KOREA. Cytological architecture of chromosome kb/µm DAPI (4’-6-diamidino-2-phenylindole) stained pachytene chromosome.
Linkage and Mapping. Figure 4-8 For linked genes, recombinant frequencies are less than 50 percent.
Wageningen, April 24-25, 2008 II Tomato Finishing Workshop Chromosome 12 Update ENEA, Rome University of Naples ‘Federico II’ CRIBI and Univ. of Padua.
Human Genome.
Mojavensis: Issues of Polymorphisms Chris Shaffer GEP 2009 Washington University.
Accessing and visualizing genomics data
Genome Analysis Assaad text book slides only Lectures by F. Assaad can be downlaoded from muenchen.de/~farhah/index.htm.
16 th April 2007 Christine Nicholson, Mapping Core Group Wellcome Trust Sanger Institute Tomato Chromosome 4 Mapping & Use of FPC Copyright Wellcome Trust.
CISC667, S07, Lec4, Liao CISC 667 Intro to Bioinformatics (Spring 2007) Whole genome sequencing Mapping & Assembly.
Welcome to the combined BLAST and Genome Browser Tutorial.
Genome Analysis. This involves finding out the: order of the bases in the DNA location of genes parts of the DNA that controls the activity of the genes.
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Structural genomics includes the genetic mapping, physical mapping and sequencing of entire genomes.
Fig. S1: Fingerprinting of three BAC clones from different accessions of wild rice species with the AA genome constitution. The BAC DNAs were completely.
Objectives: Outline the steps involved in sequencing the genome of an organism. Outline how gene sequencing allows for genome wide comparisons between.
Radiation hybrid map of the zebrafish genome
Virginia Commonwealth University
Human Genome Project.
Pre-genomic era: finding your own clones
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
A Sequenciação em Análises Clínicas
Introduction to Sequencing
Sequence the 3 billion base pairs of human
Presentation transcript:

Mouse Genome Sequencing 10/15/2002 Wei Yuan Mouse Genome Sequencing ---Sequencing strategy and the Physical Map Reference: 1) Gregory, et al. A physical map of the mouse genome. Nature, 2002, 418, 743-750 2) Green, ED, Strategies for the systematic sequencing of complex genomes. Nature reviews Genetics, 2001, 2, 573-583

Strategies for the systematic sequencing of complex genomes First Part Strategies for the systematic sequencing of complex genomes

Strategies for Complex Genomes sequencing clone-by-clone shotgun sequencing whole-genome shotgun sequencing hybrid strategies for shotgun sequencing Contig: overlapping series of clones or sequences reads (for a clone contig or sequencing contig, respectively) that corresponds to a contiguous segment of the source genome.

Two main shotgun-sequencing strategies clone-by-clone whole-genome

For clone-by-clone, a sequence-ready BAC contig map is required Sequence-ready BAC contig map. A collection of overlapping bacterial artificial chromosome (BAC) clones that contain human DNA was subjected to restriction enzyme digest-based fingerprint analysis. The resulting data was analysed using the program FPC, which constructed the depicted BAC contig map that spans >1 Mb. Minimal Tiling Path: a minimal set of overlapping clones that together provides complete coverage across a genomic region. (The 11 clones outlined in red, which provide a minimal tiling path across the corresponding genomic region, were selected for sequencing. )

The probability of two clones overlapping is based on the similarity of their fragments, performed by the program FPC. FPC uses an algorithm to cluster clones into contigs based on their probability of coincidence score. For each contig, it builds a consensus band (CB) map which is similar to a restriction map; but it does not try to resolve all the errors. The CB map is used to assign coordinates to the clones based on their alignment to the map and to provide a detailed visualization of the clone overlap. two clones are considered to overlap if the following score is below a user supplied cutoff: M is the number of shared bands, nL and nH are the lowest and highest number of bands in the two clones, respectively, t is the tolerance, gellen is approximately the number of possible values, b = 2t/gellen, and p = (1 b)nH,.

Shotgun-sequence assembly ---display from the program Consed

Hybrid shotgun-sequencing approach

Hybrid shotgun-sequencing approach take benefits of both clone-by-clone and whole-genome shotgun whole-genome shotgun: provides rapid insight about the sequence of the entire genome clone-by-clone shotgun: simplifies the process of sequence assembly to individual clone-sized genomic segments, thereby minimizing the likelihood of serious misassemblies Used by NIH in mouse genome sequencing. (Celera is using whole genome shotgun)

Construction of a physical map of the mouse genome Second Part Construction of a physical map of the mouse genome

Physical map of a genome is an essential guide for navigation, allowing the location of any gene or other landmark in the chromosomal DNA. It provides: a framework for assembly of whole-genome shotgun sequence data a tile path of clones for generation of the reference sequence

Strategy: Using the human sequence as a framework! Benefit: 1. Give a better level of resolution 2. Accelerate the process of constructing the mouse clone map

Because they are similar in sequence organization! But why to choose human sequence? Because they are similar in sequence organization! 180 conserved synteny (a region where the chromosomal location of multiple genes is conserved) conserved segment/linkage (a region where the order of multiple genes on a single chromosome segment is the same in both species)

Comparing Human and Mouse DNA Most human genes have mouse orthologs Coding exons usually correspond 1-1 Coding sequence similarity ~ 85%

Let’s go back to an old slide

How to construct a physical clone map of the mouse genome Two Phases: Phase I: Generation of a human-mouse homology clone map Compared restriction digest patterns (‘fingerprint’) of 305,716 BAC clones. Identified overlaps between clones on the basis of similarity between fingerprints and use this information to construct 7,587 contigs of overlapping clones. ----Done by the program FPC, under high strigency conditions at a probability of 1x10-16 and a match tolerance of seven Align the mouse BAC contigs to the human genome sequence by BES (BAC end sequences). Extend and join contigs where possible after re-examing the fingerprint data (p>1x10-12). ---- Done by BLASTN (with a blast score>700)

Phase II: Generation of a mouse clone map Use a set of independently mapped mouse markers (available in existing genetic and radiation hybrid maps of the mouse) to position the BAC contigs in the mouse genome. ---Markers were added to the map either by electronic PCR, or by hybridization using probes After further manual contig editing was carried out (p>1x10-10), a mouse clone map comprising 296 contigs was generated.

Construction of human–mouse homology clone map Alignment between part of human chromosome 6 (Hsa6) and mouse chromosome 4 (Mmu4). A 1.6-Mb interval is enlarged, showing part of Hsa6q16.1 aligned to a 1.3-Mb mouse BAC contig. 11 of the 15 segments of human sequence match to 29 of the BESs within a mouse BAC contig

Summary statistics of human-mouse homology clone map

Summary statistics of mouse physical clone map by chromosome

More details about the mouse physical map found 51,486 homologous crosslinks btw two genomes Of the clones in the human genome tile path, 88% are collinear with the mouse BAC map. For individual human chromosomes, coverage by aligned mouse contigs exceeds 80% on all except chromosome 19 (61%) and the Y chromosome (0%). Of the total coverage of the mouse BAC map (in 211 contigs), 97% (2,658 Mb) is aligned to the human genome sequence. Most mouse BAC contigs contained multiple mouse markers (average 57 markers per contig). coverage of the mouse genome (2.8 Gb) in mapped BACs is virtually complete: 296 contigs of average size 9.3 Mb cover an estimated 2,739 Mb. (~98%) 275 gaps due to breaks in synteny btw the two genomes.

Future Work for Mouse Genome Sequencing Finish sequencing by 2005 Analysis Sequence comparisons Annotation Gene Expression analysis Global genomic analysis

The End