Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Prometheus I Taxonomic Database POET OODB Jessie Kennedy, Cédric Rageneaud.

Slides:



Advertisements
Similar presentations
The unexpanded ontology. Partially expanded structural hierarchy, tree connections represent part-of relations; note types-of structures are also shown.
Advertisements

22. Calochortus greenei S. Watson, Proc. Amer. Acad. Arts. 14: Greene’s mariposa-lily Stems usually branching, 1–3 dm. Leaves: basal persistent,
Alyxia Banks ex R.Br. Alyxia rubricaulis subsp poyaensis Boiteau Alyxia rubricaulis (Baill.) Guillaumin genus: species: subspecies:
SDD: Structured Descriptive Data Gregor Hagedorn (Germany) Bob Morris (USA) Kevin Thiele (Australia)
Classification of Organisms. Categories of Biological Classification Scientists Assign Organisms Two-Word Names 2,000 yrs ago, Aristotle grouped plants.
Chapter 18 – Classification
Nomenclature is the science of naming organisms Evolution has created an enormous diversity, so how do we deal with it? Names allow us to talk about groups.
Managing Data Resources
Introduction to Databases Transparencies
Taxonomy To sort organisms into species To classify species into higher taxonomic levels A taxon is a taxonomic unit at any level; for example “Mammalia”
Attribute databases. GIS Definition Diagram Output Query Results.
A Beginners Guide to Understanding Taxonomy, Names and Concepts Jessie Kennedy Napier University.
Modern Systems Analysis and Design Third Edition
Business Domain Modelling Principles Theory and Practice HYPERCUBE Ltd 7 CURTAIN RD, LONDON EC2A 3LT Mike Bennett, Hypercube Ltd.
INTRODUCTION TO DATABASE USING MS ACCESS 2013 PART 2 NOVEMBER 4, 2014.
Chapter 4 The Relational Model.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Data Presentation.
Representing taxonomy MarBEF-IODE workshop Oostende, March 2007.
Classification and Systematics Tracing phylogeny is one of the main goals of systematics, the study of biological diversity in an evolutionary context.
Database Processing: Fundamentals, Design and Implementation, 9/e by David M. KroenkeChapter 2/1 Copyright © 2004 Please……. No Food Or Drink in the class.
Architecture for a Database System
311: Management Information Systems Database Systems Chapter 3.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
The Prometheus Database for Plant Taxonomy Cédric Raguenaud, Jessie Kennedy, Peter Barclay Napier University, Edinburgh
Component 4: Introduction to Information and Computer Science Unit 6: Databases and SQL Lecture 2 This material was developed by Oregon Health & Science.
Morphological variability in Melampyrum ( Orobanchaceae ) Milan Štech Faculty of Biological Sciences University of South Bohemia Czech Republic.
Approaches to Storing and Querying Structural Information in Botanical Specimen Descriptions. Trevor Paterson bbsrc biotechnology and biological sciences.
1 Concepts of Variables Greg C Elvers, Ph.D.. 2 Levels of Measurement When we observe and record a variable, it has characteristics that influence the.
Poaceae Gramineae An Re-Introduction.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Chapter 4 Data and Databases. Learning Objectives Upon successful completion of this chapter, you will be able to: Describe the differences between data,
Chapter 8 Page Case Study – The seeds that poisoned Identification is important – knowing the difference between poisonous and harmless plants.
UNIT 6 - Evolution SWBAT compare the relatedness of various species by applying taxonomic principles (cladistics, phylogeny, morphology and DNA.
CS212: Object Oriented Analysis and Design Lecture 13: Relationship between Classes.
Component 4/Unit 6b Topic II Relational Databases Keys and relationships Data modeling Database acquisition Database Management System (DBMS) Database.
Week III  Recap from Last Week Review Classes Review Domain Model for EU-Bid & EU-Lease Aggregation Example (Reservation) Attribute Properties.
What is Object-Oriented?  Organization of software as a collection of discreet objects that incorporate both data structure and behavior.
Classification Chapter 8. Learning Outcomes By the end of this week, you should:  recognise the value of identification and scientific naming (nomenclature).
1Mr.Mohammed Abu Roqyah. Database System Concepts and Architecture 2Mr.Mohammed Abu Roqyah.
Chapter 14 Abstract Classes and Interfaces. Abstract Classes An abstract class extracts common features and functionality of a family of objects An abstract.
Object-Oriented Analysis and Design CHAPTERS 9, 31: DOMAIN MODELS 1.
Phylogeny & the Tree of Life
DBMS_Week 3-4 DBMS. Three-Schema Architecture – Internal schema (one view) describes physical storage structures access paths, indexes used Typically.
CLASSIFICATION Why Classify?. INQUIRY ACTIVITY 1) Construct a table with six rows and six columns. Label each row with the name of a different fruit.
Identifying, Naming, and Classifying Species
Fundamentals, Design, and Implementation, 9/e Appendix B The Semantic Object Model.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
The Prometheus Database Project bbsrc biotechnology and biological sciences research council.
Lecture 5 Frames. Associative networks, rules or logic do not provide the ability to group facts into associated clusters or to associate relevant procedural.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Appendix H: The Semantic Object Model.
Classification Biology I. Lesson Objectives Compare Aristotle’s and Linnaeus’s methods of classifying organisms. Explain how to write a scientific name.
Data Resource Management Lecture 8. Traditional File Processing Data are organized, stored, and processed in independent files of data records In traditional.
August 2008 Forestry Science I Unit 3 Lesson 3 Created by Ms. Holli Bowman FORS 7730 Fall 2001 Modified by Georgia Agricultural Education Curriculum Office.
LECTURE 6: Taxonomic hierarchy Just as a continent is divided into various nations, nations into countries, countries into states, states into provinces.
CIBIO/InBIOIICT Miguel Porto, Pedro Beja, Rui Figueira.
Prometheus II: Capturing and Relating Character Concept Definitions in Plant Taxonomy The Biological Problem Concepts describe objects and people invariably.
Names, Ranks, Clades, and Taxonomy Ontologies
Databases Chapter 16.
Phylogeny & the Tree of Life
Forestry Science I Unit 3 Lesson 3
Geographic Information Systems
Module 11 Classification of Organisms Brainpop - Classification
Organizing Life’s Diversity
Database.
File Systems and Databases
Issues in Knowledge Representation
Geographic Information Systems
Database management systems
Presentation transcript:

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Prometheus I Taxonomic Database POET OODB Jessie Kennedy, Cédric Rageneaud Mark Watson, Martin Pullan, Mark Newman, Peter Barclay Prometheus II (with Character Descriptions) Oracle RDB Sarah McDonald, Kate Armstrong, Trevor Paterson, Alan Cannon Prometheus Ir Oracle RDB Gordon Russell Alan Cumming

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy { Divide organisms into an hierarchical classification { Based on shared ‘characteristics’ { Classical: Morphology, Lifestyle, Habit flower structure, leaf shape, sexual mechanisms, fruit type { Modern: palaeontology, genetics/DNA, biochemistry genetic distance, emzymology, evolutionary relationships Classical Linnaean Taxonomy Taxonomic Characters

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy { Collect specimens for the group/taxon of interest { Perform an initial inspection of specimens and any previous descriptions and classifications { Decide ‘characters’ that will be interesting or useful to segregate specimens into sub taxa { Score ‘characters’ for each specimen on a paper proforma { Use shared ‘characters’ to sort specimens into groups (taxa) e.g. species, genus, family The Taxonomic (Revision) Process

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Some Problems with Taxonomy { Labour intensive { ‘Characters’ are poorly defined { A taxon revision is often the work of one individual and highly idiosyncratic { Only characters of interest (to this revision) are recorded { Raw data (the proforma) is often discarded { Character data is not easily compared between proforma sets - as definitions are not captured

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Proposals to ‘Improve’ Taxonomy { Use standardized, defined ‘terms’ to record character descriptions { Encourage the scoring of ‘quantitative characters’ (discourage ‘qualitative characters’) { Store description data in electronic/database form { Facilitate meaningful comparisons between character descriptions { Store character descriptions within a taxonomic database allowing taxon descriptions to be retrieved/generated

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Approach { Model the process of data capture as character descriptions { Develop an ‘ontology’ of ‘defined terms’ to use in character descriptions { Provide a database and interface for creating the ontology (terms, definitions, relationships between terms) { Extend the Prometheus hierarchical taxonomic database and interface for recording specimen character descriptions

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy The Prometheus II Character model What are Characters? Description Elements (ii) Qualitative DElifecycle, habit, orientation, shape, symmetry, texture, sex, colour, etc. (i)Quantitative DEangle, diameter, length, width, density, height, number (count)

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy DESCRIPTION Taxon Description Specimen Description Structure Term Property Term Value Unit State Term Frequency Modifier DESCRIPTION ELEMENT RELATIVE MODIFIER destination source Modifier Term Value Unit Statement

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy DESCRIPTION UNIT DESCRIPTION ELEMENT { Description Units may be a useful collection container for sets of DEs recording data about a particular structure { DUs might not be necessary if we have a rich structural ontology { DUs may provide a useful mechanism for duplicating instances of a structure

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy (i) Quantitative Characters (i) Qualitative Characters

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy {Need to define quantitative properties length of leaf - from apex to petiole { Need to capture ranges of values leaf length 20 to 50 mm, or less than 20mm {Need to relate DEs to other DEs leaf length is twice leaf width {Need to locate structures the hairs on the upper surface of a leaf at the apex {Need to represent multiple instances of a structure on the same specimen leaf (1) green and hairy, leaf (2) yellow and waxy COMPLICATIONS !

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy The Prometheus II Character model Provide Consistency and Comparability Defined Terms A Defined Term has a TERM leaf - DEFINITION big flappy thing - AUTHOR Kennedy - CITATION ‘Oor Wullie Annual’ (ID in Database) A Defined Term might be a.. - STRUCTURE TERM leaf, hair, apex - PROPERTY TERM texture - STATE TERM glabrous - MODIFIER TERM before, more than,.. - UNIT TERM

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy An Ontology of Defined Terms and Relations PROPERTY has STRUCTURE is a part of Structure Subtype STATE has implied EXCLUSIVE STATE SET has implied belongs to

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Ontology: Structural Hierarchy 1. Define PartOf Relations B poA CpoA EpoA DpoB EpoB DpoC BA DC D E E 3. Nodes define Unique Paths A BA DBA EBA CA DCA EA

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Ontology: Structural Hierarchy All possible Part_Of relations should be represented Some small Structures can appear on so many other structures that it is useful to consider them as Generic Structures and exclude them from the ontology hierarchy (e.g. hairs, glands, stomata) Regions are also excluded from the ontology, as they could be added anywhere (e.g. apex, base, margin, surface)

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy TYPES IN THE ONTOLOGY { Botanists and Taxonomists frequently refer to structures as Types_Of another structure { e.g. berries and capsules are types of fruit { the types share all identifying features of the supertype { but can be distinguished by possession of a collection of states that are always true { and might have a restricted set of potential subparts { e.g. berries always soft and fleshy: capsules dry and dehiscent { Not clear how we can easily accommodate Types_Of in the Part_Of hierarchy, but potentially very useful for generalizing queries

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy PROBLEMS WITH TYPES IN THE ONTOLOGY { should a subtype inherit all of its supertype’s potential Part_Of relations, or can these be restricted, and can it participate in type specific Part_Of relations ? { should alternate types be exclusive ? { can you have type hierarchies ? { e.g. bract is a type of leaf, and is part of an inflorescence { leaf is not included in the ontology as potentially part of inflorescence { the taxonomists can readily identify a bract as a type of leaf, but do not conceptualize a leaf as a potential part of an inflorescence { can we expect taxonomists to be ontologically rigorous

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy DEALING WITH TYPES IN THE ONTOLOGY Allow everything – create a rich type hierarchy, and allow types to behave identically to structures in the part_of hierarchy Allow limited rigorously defined types: { types form a set of exclusive alternatives { types represent the supertype structure plus more than one state that is always true { a type must be substitutable by its supertype RESULT: Horribly complicated network to handle in a relational database! Likely to be Ontologically inconsistent RESULT – very useful for query expansions, e.g. look up instances of a type and all its supertypes Allow a rich type_of Hierarchy, separate from the part_of hierarchy

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy PROFORMA ONTOLOGY { for describing a set of specimens { include all the defined terms necessary to describe the ‘characters’ of interest { created from the parent ontology { a subset of the Structures and State Terms { expanded by instantiating Regions and Generic Structures { the paths of the nodes inherited from the parent ontology are the same  but now need to be able to describe the paths of generic structures and regions relative to these

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy BA DC D E H 9 8 J F I E 7 L K E K ONTOLOGY BA DC D E E L spine hair Generic Structures lower surface upper surface apex base centre Regions PROFORMA ONTOLOGY

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy DUPLICATED STRUCTURES IN THE PROFORMA ONTOLOGY { A specimen description would frequently need to describe different sets and instances of a structure { e.g. several different ‘types of ‘ (sic) leaf { e.g. if it is clear that the basal leaves are different from the apical leaves { these leaves are ontologically identical, but need to be distinguished in the proforma so that their features are recorded independently { furthermore the taxonomist might want to score multiple instances of a structure if they have a range of values (e.g. length)

Napier University School of Computing Capturing Botanical Descriptions for Taxonomy BA DC D E E L BD E ‘Leaf’ #2 ‘Leaf’ #1 { The path of the Leaf structures B, D and E in the proforma ontology has to include its ‘clone’ identity. { When actually scoring specimens we might want to record data for multiple instances of each Leaf. { Description Units could be one mechanism to allow this. CLONED STRUCTURES