Download presentation
Presentation is loading. Please wait.
Published byDominic Summerhill Modified over 9 years ago
1
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Prometheus I Taxonomic Database POET OODB Jessie Kennedy, Cédric Rageneaud Mark Watson, Martin Pullan, Mark Newman, Peter Barclay Prometheus II (with Character Descriptions) Oracle RDB Sarah McDonald, Kate Armstrong, Trevor Paterson, Alan Cannon Prometheus Ir Oracle RDB Gordon Russell Alan Cumming
2
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy { Divide organisms into an hierarchical classification { Based on shared ‘characteristics’ { Classical: Morphology, Lifestyle, Habit flower structure, leaf shape, sexual mechanisms, fruit type { Modern: palaeontology, genetics/DNA, biochemistry genetic distance, emzymology, evolutionary relationships Classical Linnaean Taxonomy Taxonomic Characters
3
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy { Collect specimens for the group/taxon of interest { Perform an initial inspection of specimens and any previous descriptions and classifications { Decide ‘characters’ that will be interesting or useful to segregate specimens into sub taxa { Score ‘characters’ for each specimen on a paper proforma { Use shared ‘characters’ to sort specimens into groups (taxa) e.g. species, genus, family The Taxonomic (Revision) Process
4
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Some Problems with Taxonomy { Labour intensive { ‘Characters’ are poorly defined { A taxon revision is often the work of one individual and highly idiosyncratic { Only characters of interest (to this revision) are recorded { Raw data (the proforma) is often discarded { Character data is not easily compared between proforma sets - as definitions are not captured
5
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Proposals to ‘Improve’ Taxonomy { Use standardized, defined ‘terms’ to record character descriptions { Encourage the scoring of ‘quantitative characters’ (discourage ‘qualitative characters’) { Store description data in electronic/database form { Facilitate meaningful comparisons between character descriptions { Store character descriptions within a taxonomic database allowing taxon descriptions to be retrieved/generated
6
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Approach { Model the process of data capture as character descriptions { Develop an ‘ontology’ of ‘defined terms’ to use in character descriptions { Provide a database and interface for creating the ontology (terms, definitions, relationships between terms) { Extend the Prometheus hierarchical taxonomic database and interface for recording specimen character descriptions
7
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy The Prometheus II Character model What are Characters? Description Elements (ii) Qualitative DElifecycle, habit, orientation, shape, symmetry, texture, sex, colour, etc. (i)Quantitative DEangle, diameter, length, width, density, height, number (count)
8
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy DESCRIPTION Taxon Description Specimen Description Structure Term Property Term Value Unit State Term Frequency Modifier DESCRIPTION ELEMENT RELATIVE MODIFIER destination source Modifier Term Value Unit Statement
9
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy DESCRIPTION UNIT DESCRIPTION ELEMENT { Description Units may be a useful collection container for sets of DEs recording data about a particular structure { DUs might not be necessary if we have a rich structural ontology { DUs may provide a useful mechanism for duplicating instances of a structure
10
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy (i) Quantitative Characters (i) Qualitative Characters
11
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy {Need to define quantitative properties length of leaf - from apex to petiole { Need to capture ranges of values leaf length 20 to 50 mm, or less than 20mm {Need to relate DEs to other DEs leaf length is twice leaf width {Need to locate structures the hairs on the upper surface of a leaf at the apex {Need to represent multiple instances of a structure on the same specimen leaf (1) green and hairy, leaf (2) yellow and waxy COMPLICATIONS !
12
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy The Prometheus II Character model Provide Consistency and Comparability.............. Defined Terms A Defined Term has a...... - TERM leaf - DEFINITION big flappy thing - AUTHOR Kennedy - CITATION ‘Oor Wullie Annual’ 2003 - (ID in Database) A Defined Term might be a.. - STRUCTURE TERM leaf, hair, apex - PROPERTY TERM texture - STATE TERM glabrous - MODIFIER TERM before, more than,.. - UNIT TERM
13
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy An Ontology of Defined Terms and Relations PROPERTY has STRUCTURE is a part of Structure Subtype STATE has implied EXCLUSIVE STATE SET has implied belongs to
14
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Ontology: Structural Hierarchy 1. Define PartOf Relations B poA CpoA EpoA DpoB EpoB DpoC BA DC D E E 3. Nodes define Unique Paths A BA DBA EBA CA DCA EA 12345671234567 1 2 4 3 6 5 7
16
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy Ontology: Structural Hierarchy All possible Part_Of relations should be represented Some small Structures can appear on so many other structures that it is useful to consider them as Generic Structures and exclude them from the ontology hierarchy (e.g. hairs, glands, stomata) Regions are also excluded from the ontology, as they could be added anywhere (e.g. apex, base, margin, surface)
19
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy TYPES IN THE ONTOLOGY { Botanists and Taxonomists frequently refer to structures as Types_Of another structure { e.g. berries and capsules are types of fruit { the types share all identifying features of the supertype { but can be distinguished by possession of a collection of states that are always true { and might have a restricted set of potential subparts { e.g. berries always soft and fleshy: capsules dry and dehiscent { Not clear how we can easily accommodate Types_Of in the Part_Of hierarchy, but potentially very useful for generalizing queries
20
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy PROBLEMS WITH TYPES IN THE ONTOLOGY { should a subtype inherit all of its supertype’s potential Part_Of relations, or can these be restricted, and can it participate in type specific Part_Of relations ? { should alternate types be exclusive ? { can you have type hierarchies ? { e.g. bract is a type of leaf, and is part of an inflorescence { leaf is not included in the ontology as potentially part of inflorescence { the taxonomists can readily identify a bract as a type of leaf, but do not conceptualize a leaf as a potential part of an inflorescence { can we expect taxonomists to be ontologically rigorous
21
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy DEALING WITH TYPES IN THE ONTOLOGY Allow everything – create a rich type hierarchy, and allow types to behave identically to structures in the part_of hierarchy Allow limited rigorously defined types: { types form a set of exclusive alternatives { types represent the supertype structure plus more than one state that is always true { a type must be substitutable by its supertype RESULT: Horribly complicated network to handle in a relational database! Likely to be Ontologically inconsistent RESULT – very useful for query expansions, e.g. look up instances of a type and all its supertypes Allow a rich type_of Hierarchy, separate from the part_of hierarchy
23
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy PROFORMA ONTOLOGY { for describing a set of specimens { include all the defined terms necessary to describe the ‘characters’ of interest { created from the parent ontology { a subset of the Structures and State Terms { expanded by instantiating Regions and Generic Structures { the paths of the nodes inherited from the parent ontology are the same but now need to be able to describe the paths of generic structures and regions relative to these
24
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy BA DC D E 1 2 4 3 6 5 H 9 8 J 11 10 F I E 7 L 13 12 K E 14 15 K ONTOLOGY BA DC D E E 1 2 4 3 6 5 7 13 L spine hair Generic Structures lower surface upper surface apex base centre Regions PROFORMA ONTOLOGY
25
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy DUPLICATED STRUCTURES IN THE PROFORMA ONTOLOGY { A specimen description would frequently need to describe different sets and instances of a structure { e.g. several different ‘types of ‘ (sic) leaf { e.g. if it is clear that the basal leaves are different from the apical leaves { these leaves are ontologically identical, but need to be distinguished in the proforma so that their features are recorded independently { furthermore the taxonomist might want to score multiple instances of a structure if they have a range of values (e.g. length)
26
Napier University School of Computing Capturing Botanical Descriptions for Taxonomy BA DC D E E 1 2 4 3 6 5 7 13 L BD E 2 4 3 ‘Leaf’ #2 ‘Leaf’ #1 { The path of the Leaf structures B, D and E in the proforma ontology has to include its ‘clone’ identity. { When actually scoring specimens we might want to record data for multiple instances of each Leaf. { Description Units could be one mechanism to allow this. CLONED STRUCTURES
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.