Plants.ensembl.org / www.transplantdb.eu The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic.

Slides:



Advertisements
Similar presentations
Genomics for Triticeae improvement FP7 European Project.
Advertisements

Sequencing the Maize Genome Maize Genome Sequencing Consortium
Maize Genetics, Genomics, Bioinformatics workshop
Dan Bolser, EMBL-EBI transPLANT portal: Overview and search Versailles, 12th-13th November 2012 trans-National Infrastructure for Plant Genomic Science.
Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.
2 Unité de Biométrie et d’Intelligence Artificielle (UBIA) INRA
Development of COS markers in grasses Isabelle Bertin, Pauline Stephenson and Michelle Leverington-Waite John Innes Centre.
Mission statement Barley (Hordeum vulgare L.) was one of the first domesticated cereal grains, originating in the Fertile Crescent.
Some Jolly Fun with Barley ESTs David Marshall & All the Folks in Computational Biology.
Whole Genome Sequencing &Crop Genetic Breeding Presentation: Wenhui Gao
The IWGSC: Building the sequence-based foundation for accelerated wheat breeding Kellye A. Eversole IWGSC Executive Director & The IWGSC Cereals for Food,
9 Genomics and Beyond Brief Chapter Outline
How to access genomic information using Ensembl August 2005.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Evaluation of PacBio sequencing to improve the sunflower genome assembly Stéphane Muños & Jérôme Gouzy Presented by Nicolas Langlade Sunflower Genome Consortium.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
The IWGSC: Strategies & Activities to Sequence the Bread Wheat Genome Kellye A. Eversole IWGSC Executive Director & The IWGSC Wheat Breeding 2014: Tools,
Puccinia graminis genome project Les J Szabo USDA ARS Cereal Disease Lab Department of Plant Pathology University of Minnesota.
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
GeVab: Genome Variation Analysis Browsing Server Korean BioInformation Center, KRIBB InCoB2009 KRIBB
Mouse Genome Sequencing
Tomato genome annotation pipeline in Cyrille2
What is SGN? S GN is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such.
Maps and Markers Gramene SAB Report Jan CMap Improvements Expanded, reorganized and hidden menus New map glyphs –Number of features –Crop map –Magnify.
Kerstin Howe, Mario Caccamo, Ian Sealy The Zebrafish Genome Sequencing Project Bioinformatics resources.
Rice Sequence and Map Analysis Leonid Teytelman. Rice Genome Annotation Sequence Alignments Automation Comparative Maps Genetic Marker Correspondences.
Genome Annotation and Databases Genomic DNA sequence Genomic annotation BIO520 BioinformaticsJim Lund Reading Ch 9, Ch10.
CUGI Pilot Sequencing/Assembly Projects Christopher Saski.
The New Zealand Institute for Plant & Food Research Limited Potato Genome Sequencing Consortium, notes from the edge Dr Susan Thomson, Dr Mark Fiers, Dr.
Whole genome scans to localise QTL X. Likely positionQTL Chromosome with mapped markers BAC Contig Spanning QTL region New MarkersCandidate Genes Fine.
Tomato Chromosome 4: A Mapping & Sequencing Update 28 th September 2005 Christine Nicholson Mapping Core Group Welcome Trust Sanger Institute, UK.
The progress of Glossina genomics at RIKEN GSC Todd Taylor RIKEN Genomic Sciences Center, Yokohama, Japan (on behalf of Masahira Hattori)
Genome Sequencing in the Legumes Le et al Phylogeny Major sequencing efforts Minor sequencing efforts ~14 MY ~45 MY.
APPLICATION OF MOLECULAR MARKERS FOR CHARACTERIZATION OF LATVIAN CROP PLANTS Nils Rostoks University of Latvia Vienošanās Nr. 2009/0218/1DP/ /09/APIA/VIAA/099.
DAY 1c: Accessing Completed Genomes 1. UCSC Genome Bioinformatics 2. Ensembl 3. NCBI Genomic Biology.
Solanum lycopersicum Chromosome 4 Sequencing Update UK-SOL– Dec 2008 Wellcome Trust Medical Photographic Library.
I. Introduction and Red Line Education for Data-unlimited Science.
Theobroma cacao Integrated Physical and Genetic Map 2 BAC Libraries 250 Genetic Markers.
Plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
INTRODUCTION ● Expressed sequence tags offer a low cost approach to gene discovery ● For a range of non-model organisms, ESTs represent the only sequence.
Comparative analyses of the potato and tomato transcriptomes
Data Mining in Ensembl with BioMart Giulietta Spudich.
The Genome Assemblies of Tasmanian Devil Zemin Ning The Wellcome Trust Sanger Institute.
Center for Integrated Fungal Research
Maize Genome Project Shiran Pasternak January 13, 2006 Gramene SAB Meeting San Diego, CA Shiran Pasternak January 13, 2006 Gramene SAB Meeting San Diego,
CASE7——RAD-seq for Grape genetic map construction
US Sequencing Project Funded by NSF Two-year project Start date: Sept 1, 2004 Follow-up project for full sequencing of chromosomes 1, 10 and 11.
A guided tour of Ensembl This quick tour will give you an outline view of what Ensembl is all about. You will learn: –Why we need Ensembl –What is in the.
BLAST Sequences queried against the nr or grass databases. GO ANALYSIS Contigs classified based on homology to known plant or fungal genes Next.
Accessing and visualizing genomics data
Genome Analysis Assaad text book slides only Lectures by F. Assaad can be downlaoded from muenchen.de/~farhah/index.htm.
454 Genome Sequence Assembly and Analysis HC70AL S Brandon Le & Min Chen.
Welcome to the combined BLAST and Genome Browser Tutorial.
1 Comparative analyses of the potato and tomato transcriptomes David Francis, AllenVan Deynze, John Hamilton, Walter De Jong, David Douches, Sanwen Huang,
Data Loading into Ensembl Database TGAC Browser
Sequencing and Assembly of the WheatD Genome using BAC Pools A Preliminary Study Daniela Puiu Sept 23rd 2013.
Denise Carvalho-Silva Ensembl Outreach
Figure 1. Phylogenetic tree of PDI gene promoter sequence of Triticum urartu (TU AA), Aegilops speltoides (AS BB) and Aegilops taushcii (TT DD) with three.
Gramene Technical Improvements
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite.
Summary of Current Assembly
Pre-genomic era: finding your own clones
Additional file 9: Figure S2
Volume 8, Issue 6, Pages (June 2015)
Introduction to Bioinformatics II
TAMU Bovine QTL db and viewer
2 Unité de Biométrie et d’Intelligence Artificielle (UBIA) INRA
Cereal Genome Evolution: Grasses, line up and form a circle
The Potato Genome Sequencing Consortium: An Update
Presentation transcript:

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Dan Bolser, EMBL-EBI Triticeae in Ensembl Plants Poznań, 27th-28th June 2013 trans-National Infrastructure for Plant Genomic Science

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number INTRODUCTION

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Barley Hordeum vulgare An important cereal and model for adaption. Diploid – 7 chromosomes – 5.3Gb Genome – ~80% repeats Integrated gene-space and physical map. Triticeae crops Wheat Bread wheat Triticum aestivum Accounts for 20% of human calories and protein. Hexaploid (AA/BB/DD) – 7 chromosomes – 17Gb genome – ~80% repeats Currently only a fragmented assembly is available. Barley

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Triticeae crops WheatBarley

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number WHEAT

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Wheat sequence data Gene-space ‘sub- assemblies’ – 1,394,281 sub- assemblies – contigs and singletons Data provided: “in the syntenic context of Brachypodium distachyon” 117,411 (89%) mapped 6

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Wheat Wheat sub-assemblies, classified into A, B, D (and X) genomes, aligned to Brachypodium distachyon in Ensembl Genomes 7

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Wheat sub-assemblies and homoeologous SNPs Wheat sub-assemblies, classified into A, B, D (and X) genomes, aligned to Brachypodium distachyon in Ensembl Genomes, showing homoeologous SNPs (variations between the A, B and D genomes). 8

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Wheat sequence search

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Wheat sequence search Query Wheat sequence Brachy- podium

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number BARLEY

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Barley NOTES Gene-space assembly Integrated physical map Genome browser – Chromosomes and genes in Ensembl Plants – All the ‘features’ of Ensembl, Trees, Functional annotation

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Barley – Sequence data cv. Morex 5x Illumina GAII – 300b PE – 2.5kb PE 376k contigs > 1kb – 100k directly integrated into PM – + a hierarchical approach for other sequence data

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Barley – Gene & physical map data Gene calls Genes – 167Gb of RNA-Seq – 29k fl-cDNAs – 79k 'transcript clusters' – 26k 'High Confidence' genes (by homology) – 95% anchored on WGS contigs Physical map data Fingerprinted BACs – 600k BACs (14x) in six different BAC libraries – 10k FPC contigs with estimated n50 of 900kb – 500k x2 BES, 6k WGS Markers – 3000 gene-based – 500k sequence tags

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number SUMMARY

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Wheat Too fragmented for a genomic assembly Sub-assemblies and homoeologous SNPs shown in the syntenic context of Brachypodium distachyon – Small model grass Barley 26,000 high confidence genes called. 90% anchored on chromosomes. Standard Ensembl Plants analysis pipelines can be run… – Compara – Functional annotation – Variation 23

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Coming soon… Wheat Bread wheat ESTs and genomic sub-assemblies aligned to both brachypodium and barley – Wheat sequence search returns mapped hits for both Two new wheat genomes added Barley Revised and refined variation data for 11 genotypes. RNA-Seq data.

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Acknowledgements

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Questions?

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Alignment stats for wheat sub- assemblies on brachypodium Sub-Assemblies (88% singletons) Aligned to brachy. Full length alignment? A 123,383 (13%) 115,804 (94%) 114,375 (99%) B 158,440 (17%) 141,278 (89%) 138,438 (98%) D 156,976 (17%) 144,810 (92%) 142,635 (98%) X 510,480 (54%) 412,385 (81%) 402,049 (97%) Total949, ,277 (86%) 797,497 (98%)