Plant Phenotype Pilot Project AIM: To use ontologies in express and analyze plant phenotypes from multiple species The Issue: Traditional free text phenotype.

Slides:



Advertisements
Similar presentations
A Comparative mapping resource ONTOLOGY DEVELOPMENT AND INTEGRATION IN GRAMENE Pankaj Jaiswal Cornell University.
Advertisements

Annotation of Gene Function …and how thats useful to you.
The Arabidopsis Information Resource (TAIR)
Sook Jung, Taein Lee, Stephen Ficklin, Kate Evans, Cameron Peace and Dorrie Main.
Introduction to the Plant Ontology™ Laurel Cooper* 1, Justin Elser 1, Maria A. Gandolfo 3, Chris Mungall 4, Pankaj Jaiswal 1, Barry Smith 5, Dennis Wm.
Pankaj Jaiswal, Laurel Cooper, Laura Moore, RCN-Phenotype (Plant Group), Gramene RCN-Phenotype Meeting Bioshepere 2, Tucson, AZ February 21, 2014 Plant.
The Plant Ontology A Resource for Plant Genomics Laurel Cooper Department of Botany and Plant Pathology Oregon State University,
Gene Ontology John Pinney
Paula Mabee, University of South Dakota Eva Huala, Carnegie Institution for Science Andy Deans, North Carolina State University Suzanna Lewis, Lawrence.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Chapter 7: Molecular markers in breeding
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
PI of TAIR, The Arabidopsis Information Resource Carnegie Institution for Science, located at Stanford, California Background.
Experimental and computational assessment of conditionally essential genes in E. coli Chao WANG, Oct
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
During the 2007 season, over 150 mutants and genetic stocks were planted (April 20) in a field nursery and made available for researchers to visit during.
Clinical Trials, TCGA: Deep Integrative Research RT, Imaging, Pathology, “omics” Joel Saltz MD, PhD Director Center for Comprehensive Informatics.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Moving beyond free text. Authors Scientist does research Scientist publishes research results in journal article Old Paradigm:
PATO An ontology for phenotypes. The development of PATO is the work of George Gkoutos, supported by the NCBO, working in Cambridge.
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Tae-Hyung Kim 1 Gil-Mi Ryu 1,2 InSong Koh 2 Jong Park 3 1.
The Plant Ontology: Linking Phenotypes and Genomics Across Plant Taxa Laurel D. Cooper* 1, Ramona L. Walls 2, Justin Elser 1, Justin Preece 1, Dennis W.
The Plant Ontology: Development of a Reference Ontology for all Plants Plant Ontology Consortium Members and Curators*: Laurel D.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
Ontology development and use for efficient information input and retrieval 1 Alice Clara Augustine, Vijayalakshmi K, Shobha Char, Naveen Sylvester, Mittur.
SolGS: A Bioinformatics Solution for Genomic Selection Isaak Y Tecle, Naama Menda, Jeremy Edwards, Lukas Mueller.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
1 Building Communities Around Ontology Development Pankaj Jaiswal Dept. of Plant Breeding and Genetics Cornell University Ithaca, NY FAO,
Paper prepared for presentation at the 16 th ICABR Conference – 128 th EAAE Seminar “The Political Economy of the Bioeconomy: Biotechnology and Biofuel”
Why do we need good quality annotations? Pankaj Jaiswal Oregon State University Gene Annotation Workshop July 31, 2010 ASPB Plant Biology 2010 Montreal,
The Plant Ontology Consortium website: Contact Information for deliverables Lincoln Stein,
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Rapid method to identify the mutated gene responsible for a trait A systems approach to understand biological mechanism High throughput sequencing to develop.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Linking Animal Models and Human Diseases Supported by NIH P41 HG002659, U54 HG004028, & R01 HG Cambridge University & the University of Oregon.
IPG2P Working Group Update. iPG2P Final deliverable: – Procedure allowing an investigator to begin with trait of interest in species possessing limited.
The Plant Ontology Consortium Lincoln Stein 1, Susan McCouch 2, Elizabeth Kellogg 3, Seung Rhee 4, Pankaj Jaiswal 2, Doreen Ware 1, Peter Stevens 5 1 Cold.
Digesting the Genome Glut Promoting the Use and Extension of GMOD To Emerging Model Organisms David Clements 1 Brian Osborne 2 Hilmar Lapp 1 Xianhua Liu.
Gramene: Interactions with NSF Project on Molecular and Functional Diversity in the Maize Genome Maize PIs (Doebley, Buckler, Fulton, Gaut, Goodman, Holland,
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
The Plant Ontology: Development of a Reference Ontology for all Plants Plant Ontology Consortium Members and Curators*: Laurel D.
MaizeGDB: A Very Short Overview of a Database Resource for Biological Information on Zea mays Jack M. Gardiner ASPB 2010.
This tutorial will describe how to navigate the section of Gramene that provides descriptions of alleles associated with morphological, developmental,
Prospecting for Genes that Fueled the Green Revolution
Phenotype Curation Susan R. McCouch Department of Plant Breeding Cornell University.
Databases, Ontologies and Text mining Session Introduction Part 2 Carole Goble, University of Manchester, UK Dietrich Rebholz-Schuhmann, EBI, UK Philip.
Fig. S1. Amino acid sequence alignment of MYBS3 proteins. MYBS3 protein sequences of Arabidopsis thaliana (MYBH; NP_199550); (At3g16350; NP_188256), Glycine.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Implementing a National Data Infrastructure: Opportunities for the BIO Community Peter McCartney Program Director Division of Biological Infrastructure.
Plant structure and growth stage ontologies to describe phenotypes and gene expression in angiosperms Pankaj Jaiswal Cornell University.
Phenotype And Trait Ontology (PATO) and plant phenotypes
High throughput biology data management and data intensive computing drivers George Michaels.
Effect of Alcohol on Brain Development NormalFetal Alcohol Syndrome.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Networks and Interactions
Behavior and Phenotype in GMOD Natural Diversity in GMOD
Databases, Ontologies and Text mining Session Introduction Part 2
The Teleost Anatomy Ontology: computable evolutionary morphology for teleost fishes Wasila Dahdul University of South Dakota & National Evolutionary Synthesis.
Testing the ABC floral-organ identity model: expression of A and C function genes Objectives: To test the validity of the ABC model for floral organ identity.
Phenoscape Data Jamboree 2
Functional Annotation of the Horse Genome
Genomes and Their Evolution
Welcome to the Gene and Allele Database Tutorial
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Plant Phenotype Pilot Project AIM: To use ontologies in express and analyze plant phenotypes from multiple species The Issue: Traditional free text phenotype descriptions are inadequate for large-scale computerized comparative analyses

4 Working Groups: Vertebrates Arthropods Plants Informatics Many fields of biology represented: Systematics Evolutionary biology Genetics/developmental biology Ecology Paleontology … Phenotype Ontology Research Coordination Network Unifying ideas: Shared Ontologies Shared tools and methods Best practices Community outreach

Challenges of managing phenotype data Extremely diverse data type (can range from expression profile to behavior) Can be associated to individuals, populations or species Different levels (summary, measurement data) Can be comparative (mutant vs. wild type) or absolute (days to flowering of a cultivar) Data integration - needs extensive connections to other types of data (seed stocks, genes, experimental methods, publications) Database schema and interface design Data representation - how to represent the data in a consistent way across experiments, research communities and species Data accessibility – how do we get data out of literature and into the database?

Collection of phenotype data- Who is involved? Species Genes included in project setSource Glycine max233SoyBase Solanum lycopersicum74SGN Medicago truncatula443LIS Zea mays324MaizeGDB Oryza sativa138PO/Gramene/Oryzabase Arabidopsis thaliana2400Lloyd and Meinke 2012

Phenotypic measurement Experimental treatment Genotype measured Data collection method Statistical method Reference genotype Control treatment Mutant yfg1-1 has narrow leaves and flowers early in short days Growth conditions Phenotype Data: Phenotype Summary: Leaves are 1 cm wide Image Data interpretation – preferably done by experimenter Pilot Project - limited scope: Mutant phenotypes (not natural variants) Emphasis on visual and morphological (no gene expression patterns) Summary data (not phenotype measurements)

Why use ontologies? Supplement, not replacement, for free text Provides standardized vocabulary – Dwarf, short stature, small plant, reduced height are different ways of expressing the same idea Provides relationships among terms – Vascular leaf is_a type of leaf – Leaf abscission zone part_of leaf – Leaf develops_from leaf primordium Makes computational approaches possible – Searches – Categorization – Network analysis, semantic similarity

Phenotypes of cloned genes Existing phenotype datasets: Existing reference ontologies Consistent and thorough set of ontology annotations Semantic similarity computational analysis Phenotypes of mutant loci, QTL Plant Ontology Gene Ontology ChEBI Ontology statements PATO Outline of Pilot Project Plant EO

From an ontological perspective, a phenotype is a combination of an entity and a quality that inheres in that entity inheres in Quality fused lobed increased mass increased rate Entity juvenile vascular leaf petal seed transpiration Phenotype name adherent leaf notched petal high yield increased water loss Phenotypes may also consist of two entities and a relationship between them: Relationship* fused with basal to Entity 1 juvenile vascular leaf gynoecium Entity 2 stem perianth Phenotypes and Ontologies: *in PATO, the relationship is called a “relational quality”

Examples of mutant phenotypes shared across species: Dwarf plants Rolled leaves

Examples Description of Mutant Phenotype Atomized Phenotype statements Entity Quality (PATO) Dwarf with profuse slender tillers, small panicles dwarf PO: shoot system decreased height profuse tillers PO: whole plant has extra parts of type (basal axillary shoot system) slender tillers PO: basal axillary shoot system slender small panicles PO: inflorescence decreased size Delayed flowering; Reduction in total chlorophyll GO: flowering delayed ChEBI: chlorophyll decreased concentration

Next steps: Data analysis Clustering of genes into pathways Degree of correlation between sequence and phenotype Computational prediction of gene candidates for uncloned mutant genes and QTL Apply lessons learned Is the data set big enough? Are the ontologies complete enough? Is our annotation consistency good enough? Better analysis methods?

Future Possibilities with cROP Expansion to use Protein Ontology Plant Ontology Gene Ontology PRO Ontology statements PATO Plant EO ChEBI

Acknowledgements Funding: NSF - Phenotype Ontology Research Coordination Network (RCN) Oregon State University: Laurel Cooper Pankaj Jaiswal Laura Moore University of Arizona: Ramona Walls (PO / iPlant) USDA-ARS-CICGRU: Steven Cannon, Scott Kalberer Carolyn Lawrence, Lisa Harper Rex Nelson, David Grant George Gkoutos (University of Aberystwyth) Anika Oellrich (EBI) Oklahoma State University: David Meinke Boyce Thompson Institute: Lukas Mueller (SGN) Naama Menda (SGN) Michigan State University: Johnny Lloyd U. Of Nottingham Sean May

Ontolog y