Solanum lycopersicum Chromosome 4 Sequencing Update UK-SOL– Dec 2008 Wellcome Trust Medical Photographic Library.

Slides:



Advertisements
Similar presentations
Sequencing the Maize Genome Maize Genome Sequencing Consortium
Advertisements

Sequencing the Maize (B73) Genome
Chr9 A ntonio Granell IBMCP-Valencia Spain Tomato Sequencing, Madison July 2006.
The Role of Fluorescence in situ hybridization (FISH) in Sequencing the Tomato Genome.
The International Tomato Sequencing Project: The first Cornerstone of the SOL Project Lukas Mueller on behalf of International SOL Tomato Sequencing Project.
US Tomato sequencing project update January 14, 2007.
PAA / Solanaceae July 23-27, Madison, Wisconsin, USA Sequencing the gene-rich space of tomato chromosome 7 Current status of the French effort.
Sequencing Status of the Chromosome 8 and New Marker Development toward a Genetic Map Construction between Micro-Tom and Ailsa Craig SOL Genomics Workshop.
1 Sequencing Workshop Koln, October 15, 2008 Toni Granell My apologies for not being here Thanks Joyce Chr9 sequencing update Spain.
Sequencing Tomato Chr9 Tomato Sequencing Meeting PAG XV 2007 San Diego, 14 January 2007 Antonio Granell IBMCP, Valencia.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Expanding the Tool Kit for BAC Extension Summary of completion criteria developed for NSF Tomato Sequencing Workshop January 14, 2007.
International Tomato Finishing Workshop Wellcome Trust Sanger Institute April 2007 Wellcome Trust Medical Photographic Library.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
UK -Tomato Chromosome Four Sarah Butcher Bioinformatics Support Service Centre For Bioinformatics Imperial College London
Use of FISH in sequencing tomato chromosome 6 René Klein Lankhorst Hans de Jong Korea meeting 2007.
Tomato Chromosome 8 sequencing at Kazusa DNA Research Institute Erika Asamizu.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
EU-SOL 2008 November 13-16, Toulouse, FRANCE CHROMOSOME 7 SEQUENCING Current status and perspective TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497.
Mouse Genome Sequencing
Plant and Animal Genome Conference January 11, 2009.
Chromosome 8 Sequencing: Current Status and Future Prospects toward Finishing Shusei Sato, Erika Asamizu, Takakazu Kaneko, Hiroyuki Fukuoka, Satoshi Tabata.
What is SGN? S GN is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such.
Solanum lycopersicum Chromosome 4 Sequencing Update SOL Germany– October 2008 Wellcome Trust Medical Photographic Library.
Kerstin Howe, Mario Caccamo, Ian Sealy The Zebrafish Genome Sequencing Project Bioinformatics resources.
The New Zealand Institute for Plant & Food Research Limited Potato Genome Sequencing Consortium, notes from the edge Dr Susan Thomson, Dr Mark Fiers, Dr.
Tomato Chromosome 4: A Mapping & Sequencing Update 28 th September 2005 Christine Nicholson Mapping Core Group Welcome Trust Sanger Institute, UK.
SOL 2008 October 12-16, Cologne, Germany CHROMOSOME 7 THE FRENCH CONTRIBUTION TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966.
Tomato Overgo Project and Seed BAC Selection Cornell Team Ying Eileen Wang, 2005 PAG.
SIZE SELECT SHEAR Shotgun DNA Sequencing (Technology) DNA target sample LIGATE & CLONE Vector End Reads (Mates) SEQUENCE Primer.
The Changing Face of Sequencing
Current Sequencing Status of Tomato Chromosome 2 PLANT GENOME RESEARCH CENTER, KRIBB, KOREA Sanghyeob Lee Sung-Hwan Jo Dal-Hoe Koo Chul-Goo Hur Hong-Seok.
4th Solanaceae Genome Workshop 2007, September 09th- 13th, Jeju Island, Korea THE FRENCH CONTRIBUTION TO THE INTERNATIONAL TOMATO GENOME SEQUENCING PROGRAM.
FINISHING WORKSHOP APRIL 2008 CHROMOSOME 7 THE FRENCH CONTRIBUTION TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15.
Finishing tomato chromosomes #6 and #12 using a Next Generation whole genome shotgun approach Roeland van Ham, CBSG, NL René Klein Lankhorst, EUSOL Giovanni.
Chromosome 2 Doil Choi, Sunghwan Jo KOREA. Cytological architecture of chromosome kb/µm DAPI (4’-6-diamidino-2-phenylindole) stained pachytene chromosome.
Progress tomato chromosome 6 René Klein Lankhorst.
INDIAN INITIATIVE FOR TOMATO GENOME SEQUENCING Nagendra Singh National Research Centre on Plant Biotechnology Indian Agricultural Research Institute New.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, C. Perla 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino 2, A.
Wageningen, April 24-25, 2008 II Tomato Finishing Workshop Chromosome 12 Update ENEA, Rome University of Naples ‘Federico II’ CRIBI and Univ. of Padua.
HeterochromatinEuchromatin Relative chromosome length Relative bivalent diameter X 1.23 X 1.00 Relative area Relative optical density.
Human Genome.
2nd TOMATO FINISHING WORKSHOP chromosome 9 Wageningen, April 24-25, 2008.
Italy: tomato chr. 12 Country Representative: Dr. Giovanni Giuliano Maria Luisa Chiusano Maria Raffaella Ercolano University.
The Genome Assemblies of Tasmanian Devil Zemin Ning The Wellcome Trust Sanger Institute.
Solanum lycopersicum Chromosome 4 Mapping and Finishing Update SRC-UK and Wellcome Trust Sanger Institute SOL Korea – September 2007 Wellcome Trust Medical.
Lindsay A. Shearer1, Lorinda K
The Wellcome Trust Sanger Institute
13 th January 2008 Plant & Animal Genome Conference Progress with Sequencing Tomato Chromosome 4 Clare Riddle Tomato Project Group Wellcome Trust Sanger.
16 th April 2007 Christine Nicholson, Mapping Core Group Wellcome Trust Sanger Institute Tomato Chromosome 4 Mapping & Use of FPC Copyright Wellcome Trust.
26 th July 2006 Christine Nicholson, Mapping Core Group Karen McLaren, Finishing Group Leader Wellcome Trust Sanger Institute Sequencing the Gene Space.
Chapter 5 Sequence Assembly: Assembling the Human Genome.
US Contribution to the International Tomato Genome Sequencing Effort Current structure of contributions Ongoing activity summary Funding issues.
CURRENT STATUS ON SEQUENCING OF CHROMOSOME 12 Mara Ercolano Ischia, 2005.
Tomato Sequencing Project Meeting at SOL 2008, Oct. 15, 2008
Pre-genomic era: finding your own clones
Stuff to Do.
Plant & Animal Genome Conference
Development of genome sequencing infrastructure and progress toward sequencing of chromosomes 1, 10 and 11 Steve Tanksley, Cornell U Steve Stack, Colorado.
Sequencing Chromosome 2
Status of the US contribution to the international
Progress on sequencing tomato chromosome 12
Progress in sequencing chromosome 6
Progress in sequencing chromosome 6
TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15 T1347 T1257 T0848 THE FRENCH CONTRIBUTION TO THE INTERNATIONAL.
Sequencing update of tomato chromosome 3 Chinese Academy of Sciences
Sequence the 3 billion base pairs of human
SOL Jeju Island, Korea 14th September 2007
The Potato Genome Sequencing Consortium: An Update
Presentation transcript:

Solanum lycopersicum Chromosome 4 Sequencing Update UK-SOL– Dec 2008 Wellcome Trust Medical Photographic Library

Summary of Project from WTSI Transfer of BAC selection/contig building QC- checking has moved to Imperial College London Call for your help in annotation Overview

Clone by Clone Sequencing Strategy Subcloning & Shotgun sequencing Overlapping Clones anchored by mapped markers Minimal tiling path “Finishing” Order contigs Gap closure Sequence Quality Contiguous Sequence < 1 error in 10,000 …….. TAGCTGTGTACGATGATC ………. Mapped Markers Computer assembly – paired plasmid reads BAC Library

Sequences Uploaded to SGN BAC Registry Updated Ready for Annotation Plasmid Prep Sequencing & Processing Clone DNA Prep Digest Confirmation Library Construction (plasmid) Clone Selection and Verification Clones entered into pipeline Overview of Clone Pipeline Shotgun Sequencing Finished Sequence Final EMBL submission “Complete Sequence” HTGS Phase 3 Mapping Subcloning Sequence Contigs >2Kb available on Sanger FTP site and Public Databases “Sequencing in Progress” BACs assigned to chr4 sequencing project on SGN BAC registry Sequence Improvement Contig Orientation and Gap Closure Confirmation of Assemby (QC) Finishing HTGS Phase 1 HTGS Phase 2

BAC Library & Map Resources Library No. of clones Average Insert Genome equivalents Fingerprints End Sequenced ? LE_HBa129, kb15 X 10x (88,000 AGI ) Yes (188,130) SL_MboI52, kb7 X 5x (43,000 WTSI) Yes (112,507) SL_EcoRI72, kb7 X- Yes (101,375) Tomato EXPEN-2000 map mapped markers across genome Chr4 mapped markers - Overgo analysis at Cornell BAC Libraries FPC Map Construction 1st FPC map build of HindIII Library by Arizona Genomics Institute 2nd FPC map build incorporated MboI Library mid 2006 at WTSI

Fosmid Library End Sequencing 150 plates (1-150) End Sequenced at WTSI December 2007 Approx. ~57600 fosmids ( FES) Fosmid Analysis (January 2008) reads with total bp count of giving average length = 658.4bp (after quality and vector clipping) (60.3% bases repeat masked) Hits within existing chromosome 4 BACs: 380 fosmids with good read pair alignments (within expected size range) 31 fosmids with bad read pair alignments Hits to single end 48 single ends - from fosmids with only 1 end sequenced 1027 single ends - from fosmids where only 1 end is found

Selection of Minimum Tile Path Fingerprinted BACs Markers Overlaps identified by fpc and BES alignment Seed BAC Anchored by marker (Cornell) Framework Markers in FPC Verify overlaps by colony pcr Anchor further BACs by hybridisation to marker sequences and FISH

Increasing Map Coverage using PseudoGoldenPath (PGP) Analysis MAP GAP Bridging clones identified from BES alignments to sequence Sequenced clones

FISH Map for Chr 4 on SGN FISH is used: to confirm BAC assignment to chr 4 to confirm contig order along chr 4 Steve Stack Dora Szinay, Hans DeJong

FISH Map for Chr 4 on SGN FISH is used: to confirm BAC assignment to chr 4 to confirm contig order along chr 4

WTSI Tomato Clone Pipeline Number of BACS Pipeline Stage Dec 2006 Dec 2007Dec 2008 Subcloning3517 Shotgun143 Assembly Start18 Auto-prefinishing168 Finishing910 8 QC Checking12 Finished Total Phase 3 Phase 1 Phase 2 HTGS:

Chr 4 Map and Sequence Update  Chromosome 4 estimate : 19 Mb of euchromatin  80 contigs with sequence December 2006 December 2007 December 2008 Total sequence5,007,106 bp12,590,598 bp19,018,752 bp Unique sequence 4,860,935 bp11,789,635 bp18,778,752 bp Total Finished Length 1,963,352 bp9,211,278 bp18,056,067 bp

Distribution of Contigs Centromere = Euchromatin = Heterochromatin {62 markers}{41 markers} {124 markers} 29 contigs11 contigs 36 contigs 55 BACs (27 markers) 59 BACs (16 markers) 63 BACs (61 markers) Average Contig Length = 250Kb Average BACs/Contig = 2.3 Largest Contigs = ~ Kb 227 markers mapped to Chr_4 Unordered: 4 contigs (5 BACs)

Some facts and figures ~81 contigs (80 contigs with sequence available). Average contig length is just under 250 kb. The average number of BACs per contig is 2.3. The largest sequence contigs are in the range of 450kb-500kb with 5 or 6 BACs.

Summary of Progress on Chromosome 4 81 map contigs have been built 119 BACs/44 contigs definitely on chr4 in FISH/ IL mapped 57 BACs under confirmation of Chr4 location (28 on SGN 29 to be placed after confirmed location) ~60 Markers for which BACs have not been identified. ~13 BACs have been sequenced to HTGS3 and placed on chr0, definitely not on chr4 (others initiated, in same contig etc but stopped in pipeline). 22 Missing markers missing sequence?

Summary of what we will do next 1) Confirm chr4 location of BACs that lack chr4 marker sequence and or have conflicting map location. IL mapping. 2) Use missing marker sequences to identify further BACs (3D pools) and confirm chr4 location using IL mapping. 3) Use 3D BAC pools to identify BACs to extend current contigs. 4) Analyse output from GS-FLX and GA-Illumina sequencing runs on cDNA from chr4 IL and parental lines to identify SNPs and further chr4 markers. 5) Use any markers from (4) to isolate further BACs for sequencing.

Analysis of cDNA sequence to identify Chr4 specific sequences 454 (GS-FLX)Illumina (GA) Chr4 IL lines22.11 MB0.45 GB LA716 pennellii17.25 MB1.9 GB Chr4 sub line8.47 MB1.5 GB Heinz cDNA 4.02 MB1 GB MB IL GB IL GB IL GB IL GB 6.9 GB

Call for your help Need your help in checking and verifying the automated annotation. Please respond to s in 2009 calling for help in annotation your favourite genes.

Acknowledgements Wellcome Trust Sanger Institute: Carol Churcher Jane Rogers Sean Humphray Clare Riddle and Mapping Core Group Karen McLaren and Finishing Team 46 Stuart McLaren and Pre-finishing Team 58 Christine Lloyd and QC Team 57 Karen Oliver Matt Jones Carol Scott Imperial College London: Gerard Bishop Daniel Buchan James Abbott Sarah Butcher Rosa Lopez-Cobollo University of Nottingham: Graham Seymour Scottish Crop Research Institute: Glenn Bryan Cornell University: Lukas Mueller Jim Giovannoni MIPS/IBI Institute for Bioinformatics: Klaus Mayer Remy Bruggmann FISH Resources Stephen Stack Group (Colorado) Hans de Jong (Wageningen) Dora Szinay (Wageningen) FUNDING