Lindsay A. Shearer1, Lorinda K

Slides:



Advertisements
Similar presentations
Genome Structure/Mapping Lisa Malm 05/April/2006 VCR 221 Lisa Malm 05/April/2006 VCR 221.
Advertisements

The Role of Fluorescence in situ hybridization (FISH) in Sequencing the Tomato Genome.
Progress on the sequencing of the euchromatic gene rich space of chromosome 6 of Solanum lycopersicum cv. Heinz 1706 Sander Peters Cologne Oct 2008.
9 Genomics and Beyond Brief Chapter Outline
PAA / Solanaceae July 23-27, Madison, Wisconsin, USA Sequencing the gene-rich space of tomato chromosome 7 Current status of the French effort.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
Expanding the Tool Kit for BAC Extension Summary of completion criteria developed for NSF Tomato Sequencing Workshop January 14, 2007.
Use of FISH in sequencing tomato chromosome 6 René Klein Lankhorst Hans de Jong Korea meeting 2007.
Genome sequencing and assembling
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Bacterial Genome Finishing Using Optical Mapping Dibyendu Kumar, Fahong Yu and William Farmerie Interdisciplinary Center for Biotechnology Research, University.
Today’s Lecture Genetic mapping studies: two approaches
Chromosomes Chapter 13.
EU-SOL 2008 November 13-16, Toulouse, FRANCE CHROMOSOME 7 SEQUENCING Current status and perspective TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
HAPLOID GENOME SIZES (DNA PER HAPLOID CELL) Size rangeExample speciesEx. Size BACTERIA1-10 Mb E. coli: Mb FUNGI10-40 Mb S. cerevisiae 13 Mb INSECTS.
Mouse Genome Sequencing
Solanum lycopersicum Chromosome 4 Sequencing Update SOL Germany– October 2008 Wellcome Trust Medical Photographic Library.
CUGI Pilot Sequencing/Assembly Projects Christopher Saski.
Update tomato chr. 6 Roeland van Ham Centre for BioSystems Genomics The Netherlands.
These images illustrate results of in situ hybridization with eight tomato BACs. Putative locations for the BACs are included in the captions. All of these.
Solanum lycopersicum Chromosome 4 Sequencing Update UK-SOL– Dec 2008 Wellcome Trust Medical Photographic Library.
Current Sequencing Status of Tomato Chromosome 2 PLANT GENOME RESEARCH CENTER, KRIBB, KOREA Sanghyeob Lee Sung-Hwan Jo Dal-Hoe Koo Chul-Goo Hur Hong-Seok.
4th Solanaceae Genome Workshop 2007, September 09th- 13th, Jeju Island, Korea THE FRENCH CONTRIBUTION TO THE INTERNATIONAL TOMATO GENOME SEQUENCING PROGRAM.
FINISHING WORKSHOP APRIL 2008 CHROMOSOME 7 THE FRENCH CONTRIBUTION TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15.
Restriction Fragments and Mapping Restriction Fragment Analysis – System used to compare the genes and DNA sequences between individuals in a population.
Vervet Monkey Genomics: Genome Canada and Génome Québec Physical Map Project J. Wasserscheid, G. Leveque, C. Nagy, C. Pinsonnault, and K. Dewar, McGill.
Chromosome 2 Doil Choi, Sunghwan Jo KOREA. Cytological architecture of chromosome kb/µm DAPI (4’-6-diamidino-2-phenylindole) stained pachytene chromosome.
Progress tomato chromosome 6 René Klein Lankhorst.
Linkage and Mapping. Figure 4-8 For linked genes, recombinant frequencies are less than 50 percent.
Wageningen, April 24-25, 2008 II Tomato Finishing Workshop Chromosome 12 Update ENEA, Rome University of Naples ‘Federico II’ CRIBI and Univ. of Padua.
HeterochromatinEuchromatin Relative chromosome length Relative bivalent diameter X 1.23 X 1.00 Relative area Relative optical density.
Human Genome.
Solanum lycopersicum Chromosome 4 Mapping and Finishing Update SRC-UK and Wellcome Trust Sanger Institute SOL Korea – September 2007 Wellcome Trust Medical.
A B C D E F G H I J K FigS1. Supplemental Figure S1. Evolutionary relationships of Arabidopsis and tomato Aux/IAA proteins. The evolutionary history was.
Simple-Sequence Length Polymorphisms SSLPs Short tandemly repeated DNA sequences that are present in variable copy numbers at a given locus. Scattered.
BIOL 433 Plant Genetics Term 2, Instructors: Dr. George Haughn Dr. Ljerka Kunst BioSciences 2239BioSciences Tel
16 th April 2007 Christine Nicholson, Mapping Core Group Wellcome Trust Sanger Institute Tomato Chromosome 4 Mapping & Use of FPC Copyright Wellcome Trust.
26 th July 2006 Christine Nicholson, Mapping Core Group Karen McLaren, Finishing Group Leader Wellcome Trust Sanger Institute Sequencing the Gene Space.
US Contribution to the International Tomato Genome Sequencing Effort Current structure of contributions Ongoing activity summary Funding issues.
Genome Analysis. This involves finding out the: order of the bases in the DNA location of genes parts of the DNA that controls the activity of the genes.
Simple-Sequence Length Polymorphisms
Physical Mapping of the ASGR-Carrier Chromosome Reveals Collinearity to Chromosome 2 of Sorghum and Foxtail Millet Outside the ASGR Sirjan K. Sapkota,
Restriction Fragments and Mapping
BIOL 433 Plant Genetics Term 2,
Construction of clone-based physical maps suitable for sequencing the human genome. The first-generation physical maps of human chromosomes constructed.
Stephen W Scherer, Joseph Cheung  Current Biology 
Today’s Lecture Genetic mapping studies: two approaches
Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome.
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
BAC-Based Physical Map of the Rice Genome.
Development of genome sequencing infrastructure and progress toward sequencing of chromosomes 1, 10 and 11 Steve Tanksley, Cornell U Steve Stack, Colorado.
Status of the US contribution to the international
TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15 T1347 T1257 T0848 THE FRENCH CONTRIBUTION TO THE INTERNATIONAL.
Transcription within a Functional Human Centromere
Sequencing update of tomato chromosome 3 Chinese Academy of Sciences
Neocentromeres: Role in Human Disease, Evolution, and Centromere Study
The Release 5.1 Annotation of Drosophila melanogaster Heterochromatin
New Class Offering.
Integrated Cytogenetic Map of Chromosome Arm 4S of A. thaliana
BIOL 433 Plant Genetics Term 2,
Tomato FISH Song-Bin Chang Suzanne Royer Lorrie Anderson Steve Stack.
Introduction to Sequencing
Disruption of Contactin 4 (CNTN4) Results in Developmental Delay and Other Features of 3p Deletion Syndrome  Thomas Fernandez, Thomas Morgan, Nicole Davis,
Kami Ahmad, Steven Henikoff  Molecular Cell 
The Role of Fluorescence in situ hybridization (FISH)
Physical mapping Physical localisation on a chromosome
José E. Mejía, Adrian Willmott, Elaine Levy, William C
Linkage, crossing over and chromosomal mapping (2).
Whole-chromosome view of gene density, inverse environmental variation, inverse total variation, and open chromatin signature. Whole-chromosome view of.
Presentation transcript:

Using fluorescence in situ hybridization (FISH) to improve assembly of the tomato genome sequence Lindsay A. Shearer1, Lorinda K. Anderson1, Hans de Jong2, Jose Luis Goicoechea3, Bruce A. Roe4, Axin Hua4, James J. Giovannoni5, Stephen M. Stack1 Fluorescence In Situ Hybridization and Optical Mapping to Correct Scaffold Arrangement in the Tomato Genome G3 (2014) volume 4:1395-1405

When the tomato sequencing project was being planned 13 years ago, the small Arabidopsis thaliana genome (1C = 157 Mb) had just been sequenced using overlapping DNA inserts in bacterial artificial chromosomes (BACs)

6X as much work Tomato 1C = ~916 Mb Arabidopsis 1C = 157 Mb X 6 (2n = 2x = 10) X 6 Tomato 1C = ~916 Mb Solanum lycopersicum 6X as much work

75% heterochromatin, 25% euchromatin Repeat-rich, gene poor 75% heterochromatin, 25% euchromatin

75% heterochromatin, 25% euchromatin Repeat-poor, gene rich 75% heterochromatin, 25% euchromatin

90% of genes 75% heterochromatin, 25% euchromatin

Arabidopsis 1C = 157 Mb Tomato 1C = 916 Mb X 0.25 229 Mb

To guide sequencing to euchromatin by Fluorescence in situ Hybridization (FISH) of DNA in Bacterial Artificial Chromosomes (BACs) from tomato BAC libraries to tomato pachytene chromosomes that were spread by hypotonic bursting.

Pachytene chromosome spread Phase contrast DAPI 10 µm by hypotonic bursting

10 µm

SC 12

Improved sequencing technologies 454 & Illumina

DNA pseudomolecule representing one chromosome Sequenced scaffolds 1-4 1 2 3 4 Head (end of short arm) Tail (end of long arm) 1-2 2-3 3-4 Gaps 1-2, 2-3, 3-4

≈ chromosomes Tomato DNA pseudomolecules 1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5 6 7 8 9 10 11 12 49.9 Mb 46.0 Mb 53.4 Mb 65.5 Mb 63.0 Mb 64.8 Mb 64.1 Mb 65.0 Mb 65.3 Mb 64.8 Mb 67.7 Mb Tomato DNA pseudomolecules ≈ chromosomes 90.3Mb

? = Head = End of short arm Ideally order and orientation of 1 Head = End of short arm ? = Ideally order and orientation of scaffolds in the DNA pseudomolecule for chromosome 1 Corresponding straightened pachytene chromosome 1 Tail = End of long arm

Kazusa Linkage MAP for tomato Chromosome 1 Pseudomolecule Mapped DNA sequences were identified in scaffolds. Chromosome 1 Kazusa Linkage MAP for tomato Chromosome 1 Pseudomolecule

Fluorescence in situ Hybridization (FISH) of scaffold sequence DNA in Bacterial Artificial Chromosomes (BACs) to tomato pachytene chromosomes.

Pseudomolecule or location of one BAC in a scaffold by FISH Identification of one mapped molecular marker in a scaffold or location of one BAC in a scaffold by FISH permits assignment of the scaffold to a linkage group/chromosome/pseudomolecule and indicates the order of scaffolds, but their not orientation. 1 2 3 4 Head (End of short arm) Tail (end of long arm) 1 2 3 Pseudomolecule

One molecular marker or one BAC located by FISH on a scaffold does not permit orientation of the scaffold in regard to the head and tail of the pseudomolecule. Orientation ? 1 2 3 4 Head (End of short arm) Tail (end of long arm) 1 2 3

Two or More mapped molecular markers on a scaffold or two or more BACs located by FISH on a scaffold determine scaffold head/tail orientation Head (End of short arm) Tail (end of long arm) 1 2 3 4 1 2 3 Gaps 1-3 a b c d e f g h i

The two methods usually agree in euchromatin, Idiogram of straightened tomato pachytene chromosomes 1 2 3 4 5 6 7 8 9 10 11 12 Eu- Heterochromatin Euchromatin The two methods usually agree in euchromatin, but often disagree in pericentric heterochromatin Where 75% of the DNA is located.

Tomato DNA Pseudomolecules Nature (2012) 485:635-641 1 2 3 4 5 6 7 8 9 10 11 12 49.9 Mb 46.0 Mb 53.4 Mb 65.5 Mb 63.0 Mb 64.8 Mb 64.1 Mb 65.0 Mb 65.3 Mb 64.8 Mb 67.7 Mb Tomato DNA Pseudomolecules Nature (2012) 485:635-641 90.3Mb

FISH results should be given preference for ordering and orienting scaffolds in tomato pseudomolecules for the following reasons:

1) The linkage map method only works if there is mapped sequence in a scaffold. This may or may not be the case for small scaffolds and scaffolds in heterochromatin, but known BACs are always available in scaffolds for use in FISH.

2) FISH is a more direct method. The linkage map method only works if there is mapped sequence in a scaffold. 2) FISH is a more direct method.

12

Chromosome 1 Pachytene Linkage map Pseudomolecule ? =

3) FISH is accurate in euchromatin and in heterochromatin. The linkage map method only works if there is mapped sequence in a scaffold. 2) FISH is a more direct method. 3) FISH is accurate in euchromatin and in heterochromatin.

75% of genomic DNA is in pericentric heterochromatin where crossing over is suppressed.

Chromosome 1 Kazusa Linkage MAP Chromosome 1 Pseudomolecule

BAC/FISH can locate scaffolds accurately in euchromatin and heterochromatin without reference to crossing over. 12

4) Optical sequencing agrees with FISH. The linkage map method only works if there is mapped sequence in a scaffold. 2) FISH is a more direct method. 3) FISH is accurate in euchromatin and in heterochromatin. 4) Optical sequencing agrees with FISH. Optical sequencing is an independent physical method for ordering and orienting scaffolds in both euchromatin and in heterochromatin.

While optical mapping only covered 38 scaffolds (= 32% of the sequenced tomato genome), optical mapping results were invariably compatible with FISH results, while they were incompatible with the linkage map for 16 out of 38 scaffolds covered.

5) FISH permits estimates of gaps sizes between scaffolds, The linkage map method only works if there is mapped sequence in a scaffold. 2) FISH is a more direct method. 3) FISH is accurate in euchromatin and in heterochromatin. 4) Optical sequencing agrees with FISH. 5) FISH permits estimates of gaps sizes between scaffolds, while the linkage map does not.

Density of DNA (Mb/µm SC) For this we first determined the amount of DNA/µm for tomato pachytene chromosomes (= SCs) in: Density of DNA (Mb/µm SC) Chromatin type 1.5 euchromatin 8.5 heterochromatin 3.5 centromere

Pseudomolecule Then we localized a BAC at each end of every scaffold and measured gap lengths in micrometers. Scaffold 1-4 order 1 2 3 4 Head (End of short arm) Tail (end of long arm) 1 2 3 Gaps 1-3 Pseudomolecule

BAC s083L21 Tail – scaffold 3 BAC E016I22 Head – scaffold 2 Gap 3-2 (centromere) Chromosome 1

Gap 3-2 = 0.7 µm of centromeric chromatin X 3.5 Mb/µm centromeric chromatin 2.45 Mb in chromosome 1 gap 3-2

Gap sizes were measured between all 91 scaffolds on all 12 chromosomes Tail – scaffold 6 Head – scaffold 6 Tail – scaffold 8 Head – scaffold 8 Scaffold 6

Chromosome 1 Pseudomolecule http://solgenomics.net head centromere tail centromere Chromosome 1 =100.2 Mb Chromosome 1 Pseudomolecule ____________ http://solgenomics.net

6) FISH can localize chromosome 0 BACs The linkage map method only works if there is mapped sequence in a scaffold. 2) FISH is a more direct method. 3) FISH is accurate in euchromatin and in heterochromatin. 4) Optical sequencing agrees with FISH. 5) FISH permits estimates of gaps sizes. 6) FISH can localize chromosome 0 BACs

Chromosome 0 BACs are sequenced BACs with no linkage mapped sequences, and their sequences do not fit into any scaffold.

Chromosome 9

Chromosome 9

7) Finally, FISH relates DNA pseudomolecules directly to The linkage map method only works if there is mapped sequence in a scaffold. 2) FISH is a more direct method. 3) FISH is accurate in euchromatin and in heterochromatin. 4) Optical sequencing agrees with FISH. 5) FISH permits estimates of gaps sizes. 6) FISH can localize chromosome 0 BACs 7) Finally, FISH relates DNA pseudomolecules directly to chromosome structure, unlike the linkage map.

Using FISH, we have ordered and oriented all 91 scaffolds and determined gap sizes for the 12 tomato pseudomolecules

Discrepancies are shown in red between ordering and orienting scaffolds using the linkage map versus FISH. They vary from chromosome to chromosome, but they are mostly in pericentric heterochromatin where a lack of crossing over makes the linkage map inaccurate.

Based on FISH, scaffolds that were ordered and oriented using the linkage map can be in the wrong order and/or orientation (inverted), and one case, two scaffolds overlapped.

Chromosome/pseudomolecule 8 as an example of linkage map-based versus FISH-based order and orientation of scaffolds and gap sizes

In summary, FISH rearranged 45 of the 91 scaffolds. This amounts to rearranging 34% of the total tomato genome.

See the Sol Genomics website Because of its advantages, the order and orientation of scaffolds determined by FISH is now used in the newest build (SL2.50) of the tomato genome. See the Sol Genomics website https://solgenomics.net/organism/Solanum_lycopersicum/genome

A new estimate of genome size for tomato Scaffold total 759.9 Mb Gap total 40.1 Mb Nucleolus organizer + 25.5 Mb 825.5 Mb

FISH can be used to correct linkage maps, especially in heterochromatin.

And finally, these results suggest similar errors exist in other sequencing projects that ordered and oriented scaffolds using linkage maps, and these errors can be corrected using BAC/FISH and/or optical mapping.

Sol Genomics website: http://solgenomics.net Chromosome/pseudomolecule 8 as an example of linkage map-based versus FISH-based order and orientation of scaffolds and gap sizes Sol Genomics website: http://solgenomics.net

Because of the advantages of FISH FISH is a more direct method. FISH is accurate in both eu- and heterochromatin. 3) Optical sequencing agrees with FISH. 4) FISH permits estimates of gap sizes. 5) FISH can localize chromosome 0 BACs. Sol Genomics website http://solgenomics.net

Scaffold 8 Tail – scaffold 6 Tail – scaffold 8 Head – scaffold 8 Head – scaffold 6 Scaffold 6

head tail Chromosome 1 Pseudomolecule

Tail – scaffold 3 BAC s083L21 Head – scaffold 2 BAC E016I22 Gap 3-2 (centromere)

3.5 Mb/µm X 0.7 µm = 2.4 Mb head centromere ____________ tail Chromosome 1 =100.2 Mb ____________ 3.5 Mb/µm X 0.7 µm = 2.4 Mb

Gaps and Zero BACs

10 µm

A B

1.54 Mb per micrometer in euchromatin 3.06 μm 2.35 μm 3.43 μm 6.26 μm 85% II 4.60 μm Dan Peterson et al. (1996) 1.54 Mb per micrometer in euchromatin 9.22 Mb per micrometer in heterochromatin

3.06 μm 2.35 μm 3.43 μm 6.26 μm 85% II 4.60 μm

Tomato EXPEN 2000 Chromosome 1 Linkage Map

Tomato EXPEN 2000 Chromosome 1 Linkage Map

BAC E021C24 BAC s040G18 Scaffold 6

Scaffold 8 BAC s121I01 BAC s022L14

head tail

head centromere tail Chromosome 1 =100.2 Mb

(Solanum lycopersicum) Tomato (Solanum lycopersicum) 1C = 0.95 pg = ~ 917 Mb 2n = 2x = 24 chromosomes

One Molecular Marker per Scaffold? Scaffold 1-4 order 1 2 3 4 Head (End of short arm) Tail (end of long arm) 1 2 3 Gaps 1-3

BAC FISH probes can locate scaffolds accurately on chromosome structure regardless of crossover patterns.

1.5 Mb/µm of SC in euchromatin 8.5 Mb/µm of SC in heterochromatin 3.5 Mb/µm of SC in centromeres

head tail Chromosome 1 Pseudomolecule

1 2 3 4 5 6 7 8 9 10 11 12