What is SGN? S GN is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such.

Slides:



Advertisements
Similar presentations
Genetic Map to Physical Map This activity is intended to supplement the workshop session entitled Integrating the Genetic and Physical Maps of Maize. However,
Advertisements

Mutation Analysis Server Nagarajanlab. © Copyright 2005, Washington University School of Medicine. 2 Agenda Mutation pipeline overview High level design.
Genome Structure/Mapping Lisa Malm 05/April/2006 VCR 221 Lisa Malm 05/April/2006 VCR 221.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
The International Tomato Sequencing Project: The first Cornerstone of the SOL Project Lukas Mueller on behalf of International SOL Tomato Sequencing Project.
US Tomato sequencing project update January 14, 2007.
9 Genomics and Beyond Brief Chapter Outline
Bioinformatics for the Canadian Potato Genome Project David De Koeyer, Martin Lagüe and Rebecca Griffiths Wageningen September 18, 2004.
Structural and Functional Genomics of Tomato Barone et al Tomato (Solanum Lycopersicon) – economically important crop worldwide, – intensively investigated.
PAA / Solanaceae July 23-27, Madison, Wisconsin, USA Sequencing the gene-rich space of tomato chromosome 7 Current status of the French effort.
Copyright OpenHelix. No use or reproduction without express written consent1 Organization of genomic data… Genome backbone: base position number sequence.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Expanding the Tool Kit for BAC Extension Summary of completion criteria developed for NSF Tomato Sequencing Workshop January 14, 2007.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Dept. of Plant Breeding & Genetics, Cornell University
Mouse Genome Sequencing
SOL Genomics Network Formed in 2003 to answer two questions: – How can a common set of genes give rise to such a wide range of morphologically and ecologically.
Genetic Map to Physical Map This activity is intended to supplement the workshop session entitled “Integrating the Genetic and Physical Maps of Maize”.
Plant and Animal Genome Conference January 11, 2009.
Tomato genome annotation pipeline in Cyrille2
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
Rice Sequence and Map Analysis Leonid Teytelman. Rice Genome Annotation Sequence Alignments Automation Comparative Maps Genetic Marker Correspondences.
Genome Annotation and Databases Genomic DNA sequence Genomic annotation BIO520 BioinformaticsJim Lund Reading Ch 9, Ch10.
The New Zealand Institute for Plant & Food Research Limited Potato Genome Sequencing Consortium, notes from the edge Dr Susan Thomson, Dr Mark Fiers, Dr.
Tomato Chromosome 4: A Mapping & Sequencing Update 28 th September 2005 Christine Nicholson Mapping Core Group Welcome Trust Sanger Institute, UK.
Tomato Overgo Project and Seed BAC Selection Cornell Team Ying Eileen Wang, 2005 PAG.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
Canadian Potato Genome Project (CPGP) (Molecular Determination of Tuber Health and Quality in the Potato) Barry Flinn SGII/BioAtlantech (Fredericton) Sharon.
Use cases for Tools at the Bovine Genome Database Apollo and Bovine QTL viewer.
Information System for Comparative Analysis of Legume Genomes Anita Dalwani Advisors: Dr. Roger Innes, Dr. Haixu Tang.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Solanum lycopersicum Chromosome 4 Sequencing Update UK-SOL– Dec 2008 Wellcome Trust Medical Photographic Library.
4th Solanaceae Genome Workshop 2007, September 09th- 13th, Jeju Island, Korea THE FRENCH CONTRIBUTION TO THE INTERNATIONAL TOMATO GENOME SEQUENCING PROGRAM.
FINISHING WORKSHOP APRIL 2008 CHROMOSOME 7 THE FRENCH CONTRIBUTION TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Chromosome 2 Doil Choi, Sunghwan Jo KOREA. Cytological architecture of chromosome kb/µm DAPI (4’-6-diamidino-2-phenylindole) stained pachytene chromosome.
Progress tomato chromosome 6 René Klein Lankhorst.
INTRODUCTION ● Expressed sequence tags offer a low cost approach to gene discovery ● For a range of non-model organisms, ESTs represent the only sequence.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, C. Perla 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino 2, A.
Lettuce/Sunflower EST CGPDB project. Data analysis, assembly visualization and validation. Alexander Kozik, Brian Chan, Richard Michelmore. Department.
Human Genome.
Software and Databases for managing and selecting molecular markers General introduction Pathway approach for candidate gene identification and introduction.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Genome Analysis Assaad text book slides only Lectures by F. Assaad can be downlaoded from muenchen.de/~farhah/index.htm.
16 th April 2007 Christine Nicholson, Mapping Core Group Wellcome Trust Sanger Institute Tomato Chromosome 4 Mapping & Use of FPC Copyright Wellcome Trust.
26 th July 2006 Christine Nicholson, Mapping Core Group Karen McLaren, Finishing Group Leader Wellcome Trust Sanger Institute Sequencing the Gene Space.
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
Welcome to the combined BLAST and Genome Browser Tutorial.
Italy: tomato chr. 12 Country Representative: Dr. Giovanni Giuliano Contribution from Naples Funding Agency: Italian Ministry of Agriculture (MiPAF) Project.
US Contribution to the International Tomato Genome Sequencing Effort Current structure of contributions Ongoing activity summary Funding issues.
Comparative maps of potato, eggplant, pepper and Nicotiana with respect to the tomato genome Silvana Grandillo CNR-IGV, Portici (Naples), Italy March 4,
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
CURRENT STATUS ON SEQUENCING OF CHROMOSOME 12 Mara Ercolano Ischia, 2005.
HICF-based physical mapping of Mimulus guttatus and M. lewisii Anna Blenda 1, John Willis 2, Todd Vision 3, Eric Fang 1, Barbara.
Objectives: Outline the steps involved in sequencing the genome of an organism. Outline how gene sequencing allows for genome wide comparisons between.
Virginia Commonwealth University
Pre-genomic era: finding your own clones
Section 3: Gene Technologies in Detail
Lettuce/Sunflower EST CGPDB project.
Genomes and Their Evolution
Development of genome sequencing infrastructure and progress toward sequencing of chromosomes 1, 10 and 11 Steve Tanksley, Cornell U Steve Stack, Colorado.
Sequencing Chromosome 2
Geneomics and Database Mining and Genetic Mapping
Status of the US contribution to the international
Progress in sequencing chromosome 6
TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15 T1347 T1257 T0848 THE FRENCH CONTRIBUTION TO THE INTERNATIONAL.
Presentation transcript:

What is SGN? S GN is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such as potato, eggplant, pepper and tomato. SGN houses maps, markers, ESTs, unigenes, and phenotypic data. SGN is also part of the bioinformatics platform for the International Solanaceae Project (SOL), which is currently sequencing the euchromatic portion of the tomato genome. As this project progresses, SGN will evolve into a model organism database for tomato. ESTs and Unigenes As an important first type of sequence data, Expressed Sequence Tags (ESTs) are collected and assembled using a pipeline that has been developed at SGN. Sequences are first basecalled from chromatograms with Phred and loaded into the database. Subsequent steps of the custom pipeline trim the sequence according to quality scores and remove vector sequence. Sequences that are contaminated with bacterial or phage DNA, or that appear to be chimeric, are flagged and not used in unigene assembly. The Solanaceae Genomics Network: Data, Methods, Tools, and the Tomato Genome SGN's growing database currently contains information on over 3,000 markers from 5 interactive maps. Our marker collection includes microsatellite (SSR) markers, RFLP and CAPS based markers, and Conserved Ortholog Set (COS 1 and COSII 2 ) markers for comparative mapping between various Solanaceae and some non-Solanaceae species such as Arabidopsis. The collection also includes markers derived from ESTs and unigenes, from genomic sequences such as BACs, and from genes with known functions in tomato. A powerful and user-friendly marker search is also available, so a user can search markers by type, chromosome, position, map, species and other criteria. Beth Skwarecki, Nick Taylor, Teri Solow, Chenwei Lin, Eileen Wang, Emil Keyder, Miriam Wallace, Igor Dolgalev, Evan Herbst, Robert Ahrens, Mark Wright, Lukas Mueller, Steven Tanksley Maps and Markers SGN's collection includes interactive maps for tomato, potato, eggplant and soon pepper. All data for the maps is stored in a database and queried by our interactive map viewer. Through this viewer, a user can easily examine a map, compare it to other maps in the collection, zoom in on a region of interest, access marker information, and view the locations of anchored BACs. Introgression Line (IL) populations can also be seen on this map viewer. SGN's comparative viewer. Left to right: a centiMorgan ruler, IL lines, anchored BACs, the Tomato-EXPEN 1992 map, an enlarged section of interest, the Potato-TXB 1992 map, and a second ruler. Department of Plant Breeding, Cornell University ESTs enter SGN's pipeline as chromatograms like this one. The International Solanaceae Genome Project (SOL) The long-term goal of the SOL project is to create a network of map-based resources and information to address key questions in plant adaptation and diversification. The first cornerstone of this project is the sequencing of the gene-rich euchromatic portion (about 25%) of the 950 Mb tomato genome. Sequencing will occur on a BAC-by-BAC basis, using an existing physical map and newly sequenced BAC ends to determine a tiling path that covers the desired portion of the genome. All data on SGN is available without restrictions. SGN can be accessed on the Web at Ten participating countries are sequencing the BACs for their respective chromosomes. This chart can be found at Unigenes are assembled with Cap-3, after an initial pre- clustering stage that allows the assembly to be run in parallel. Comparisons to tomato mRNAs show that the unigenes are usually of high quality. Putting BACs on the Map The foundation for the sequencing project is a physical map. Overgo analysis has been used to match BACs to probes based on markers from the Tomato-EXPEN 2000 map. Analysis of the overgo results has found over 600 markers that unambiguously anchor over 5000 BACs to the genetic map. A Dutch AFLP map will be integrated with these results, bringing the number of anchor points to about These anchor points will be used as the “seeds” to start the sequencing process. The tiling path can then be extended from bac end sequences and FingerPrint Contigs (FPC). SGN's physical map. Green bars indicate the number of BACs anchored to each point on the Tomato-EXPEN 2000 genetic map. Some well known members of the Solanaceae family. Solanum lycopersicum Solanum tuberosum Solanum melongena Capsicum annuum Petunia hybrida Data Overview Maps and Markers ESTs and Unigenes An interactive display showing the member sequences of one SGN unigene. Technical Details Much of SGN's software is developed in-house and is open source. We write code in Perl and C. We also use tools like Blast, Phred, and Cap-3. Our systems run Linux, Apache, and MySQL. Future Plans SGN is always growing. In the next several months we expect to add new maps and markers, new BAC libraries, BAC sequences and BAC end sequences, and data from FISH experiments. A new annotation pipeline is also being developed, and a registry will keep track of the status of BACs that are being sequenced. 1 Fulton TM, Van der Hoeven R, Eannetta NT, Tanksley SD (2002) Identification, analysis, and utilization of conserved ortholog set markers for comparative genomics in higher plants. Plant Cell 14: Wu, Feinan, in preparation. Datasets are available from our FTP site at ftp://sgn.cornell.edu. BACs and the Physical Map 4857 good marker - BAC associations 652 markers are plausibly associated with BACs. 705 plausible contigs comprised of 1880 BACs 2166 BAC singletons 809 BACs are from 425 implausible contigs 3117 implausible marker-BAC associations 7235 ambiguous associations