TDWG 2007 Bratislava SPM from an SDD perspective: Generality and extensibility Gregor Hagedorn Federal Biological Research Center, Berlin, Germany.

Slides:



Advertisements
Similar presentations
Foundational Objects. Areas of coverage Technical objects Foundational objects Lessons learned from review of Use Case content Simple Study Simple Questionnaire.
Advertisements

The Library of Life Federated Description Services and the Library of Life or What can we do with SDD anyway? Kevin Thiele Centre for Biological Information.
What is a Flora? Peter Hovenkamp. What is not a Flora? Labwork/ecology paper Species selection on non-taxonomic criteria No identification tool Character.
Virtual University - Human Computer Interaction 1 © Imran Hussain | UMT Imran Hussain University of Management and Technology (UMT) Lecture 16 HCI PROCESS.
United Nations Statistics Division Principles and concepts of classifications.
SDD: Structured Descriptive Data Gregor Hagedorn (Germany) Bob Morris (USA) Kevin Thiele (Australia)
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
OASIS Reference Model for Service Oriented Architecture 1.0
Software Testing and Quality Assurance
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
CS 425/625 Software Engineering System Models
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
Object-Oriented Databases
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Design Patterns.
Foundations This chapter lays down the fundamental ideas and choices on which our approach is based. First, it identifies the needs of architects in the.
CORRELATIO NAL RESEARCH METHOD. The researcher wanted to determine if there is a significant relationship between the nursing personnel characteristics.
Species Banks a GBIF mechanism to provide electronic access to quality species information Peter H. Schalk, Marc Brugman ETI, University of Amsterdam Tinde.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
DR. AHMAD SHAHRUL NIZAM ISHA
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
Implementation Yaodong Bi. Introduction to Implementation Purposes of Implementation – Plan the system integrations required in each iteration – Distribute.
Morpho Activity Start Entering/Practicing with real data.
CDM Developer Workshop. TDWG Andreas Kohlbecker Taxonomic Workflow in the EDIT Platform for Cybertaxonomy Purpose What do you want from this workshop?
Chapter 17: Organizing Life’s Diversity
OWL and SDD Dave Thau University of Kansas
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 20 Object-Oriented.
Stat 1510: Statistical Thinking and Concepts 1 Density Curves and Normal Distribution.
CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.
Patrick Leary 23 October, 2008 TDWG Fremantle Experiences With Species Profile Model.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
NetConf Data Model draft-adwankar-netconf-datamodel-01.txt Sandeep Adwankar.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Chapter 8 Object Design Reuse and Patterns. Object Design Object design is the process of adding details to the requirements analysis and making implementation.
Definition of an Observation In general, an observation represents the measurement of some attribute, of some thing, at a particular time and place. Observations.
A gentle introduction to the Species Profile Model Robert A. Morris Department of Computer Science UMASS-Boston Doesn’t talk as fast as Greg Whitbread.
AGRICULTURE #Theme 2. Working sessions 1.Crop Trait ontology 2.Biocuration in agrodatabases 3.SPM III: Visual and textual standards for taxonomic identification.
1. 2 Preface In the time since the 1986 edition of this book, the world of compiler design has changed significantly 3.
Object Oriented Analysis: Associations. 2 Object Oriented Modeling BUAD/American University Class Relationships u Classes have relationships between each.
Object-Oriented Modeling: Static Models. Object-Oriented Modeling Model the system as interacting objects Model the system as interacting objects Match.
Phylogenies Reconstructing the Past. The field of systematics Studies –the mechanisms of evolution evolutionary agents –the process of evolution speciation.
® A Proposed UML Profile For EXPRESS David Price Seattle ISO STEP Meeting October 2004.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
Winter 2011SEG Chapter 11 Chapter 1 (Part 1) Review from previous courses Subject 1: The Software Development Process.
Formal Specification: a Roadmap Axel van Lamsweerde published on ICSE (International Conference on Software Engineering) Jing Ai 10/28/2003.
COMMON COMMUNICATION FORMAT (CCF). Dr.S. Surdarshan Rao Professor Dept. of Library & Information Science Osmania University Hyderbad
Converting an Existing Taxonomic Data Resource to Employ an Ontology and LSIDS Jessie Kennedy Rob Gales, Robert Kukla.
Banaras Hindu University. A Course on Software Reuse by Design Patterns and Frameworks.
Basic Concepts and Definitions
Where now for the taxon transfer schema and related work: collaboration possibilities? Jessie Kennedy.
Plazi: Prospects for Markup of Legacy and New Taxonomic Literature Terry Catapano TDWG Fremantle, WA October 21, 2008.
Laura Russell VertNet Meherzad Romer NatureServe Canada John Wieczorek
Classification Biology I. Lesson Objectives Compare Aristotle’s and Linnaeus’s methods of classifying organisms. Explain how to write a scientific name.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
COP Introduction to Database Structures
Introduction to Persistent Identifiers
Experiences and Status
International Congress of Entomology, Orlando
Development of the Amphibian Anatomical Ontology
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Template library tool and Kestrel training
An Evolutional Model for Operation-driven Visualization Design
Chapter 20 Object-Oriented Analysis and Design
SDMX Information Model: An Introduction
Overview of the ETSI Test Description Language
Proposal of a Geographic Metadata Profile for WISE
Sharing information between projects
Presentation transcript:

TDWG 2007 Bratislava SPM from an SDD perspective: Generality and extensibility Gregor Hagedorn Federal Biological Research Center, Berlin, Germany

SDD Purpose was (From SDD Charter:) Develop standard computer-based mechanisms for expressing and transferring descriptive information about biological organisms or taxa (as well as similar entities such as diseases), including terminologies, ontologies, descriptions, identification tools and associated resources.

SPM vs. SDD SpeciesProfileModelCodedDescription |NaturalLanguageDescription aboutTaxon: The taxon this information is about.Scope/TaxonName associatedTaxon: Another taxon associated with this taxon and this piece of information e.g. a parasite or prey Scope/TaxonName context: A string representation of when this information is valid. (Categorical|Quantitative|Text)/Notes contextOccurrence: An indication of when this information is valid according to a geospatial data. Scope/GeographicArea contextValue: An indication of when this information is valid according to a controlled vocabulary. (Categorical|Quantitative|Text) /Modifier hasContent: A information about a taxon in the form of a string. Should be interpreted in combination with the type of the InfoItem (Categorical|Quantitative|Text) /Content hasValue: A information about a taxon in the form of a controlled vocabulary term. (Categorical|Quantitative|Text)/State

Richness & Atomization InfoItem aboutTaxon … … … Character Data Scopes … … Representation SummaryData Scopes SPMSDD Labels, Definitions, MediaObjects (multilingual) RevisionData SampleData … Taxa, Speci- mens, Observ., Publications, Parts, Stage, Sex. etc.

Naming differences Perhaps consider whether SPM: “context” is a good paradigm: A measurement can be made in the context of a study, and perhaps in the context of a season But is “geographical location”, “frequently”, “sex”, “above 1000 m” a context? SDD distinguishes between Scope of a description = criteria by which data have been aggregated (taxon, specimen, geolocation, season, publication source, etc.) and Modifiers that modify/qualify a statement

Naming differences “Value” for categorical measurements is OK in principle, but may affect extensibility to quantitative data. Publication references would be needed for source of information being aggregated, or citations therein

Occurrence / Distribution Occurrence used in contextOccurrence, Distribution as content term around it. Is the order of information reversed? spm:contextValue =“An indication of when this information is valid according to a controlled vocabulary.” → perhaps: Perhaps use a special type here?

Cardinality?

SPM Concepts BiologyCytology Physiology Ecology MolecularBiologyEvolution Conservation Distribution Use Description Size

Biology Description Overlap! Size Cytology Physiology MolecularBiology Ecology Evolution Conservation Distribution Use Ecology Distribution Evolution Biology

Description Conclusive? Size Cytology Physiology MolecularBiology Ecology Evolution Conservation Distribution Use Ecology Distribution Evolution Biology Anatomy Biochemistry Morphology Secondary metabolites

Biology Description Size Cytology Physiology MolecularBiology Ecology Evolution Conservation Distribution Use Ecology Distribution Evolution Biology Anatomy Biochemistry Morphology Secondary metabolites LifeExpectancy LookAlikes Diagnostic Description LifeCycle PopulationBiology Behavior Associations SPM Version Earlier terms used in SPM example files Weight

Number of characters Size LIAS has 987 “characters”, incl. ca. 30 “pseudo- characters” GrassBase has 1090 characters LifeExpectancy SDD concluded to separate character standardization from structural separation Waiting for exchange of existing definitions and patterns to arise rather than round table

SPM content vocabulary A concise “major concept headings” vocabulary like SPM is certainly desirable But definitions are needed! Human-readable definitions should be developed OWL/RDF currently provides a single semantic information: Size is subclass of Description Provision of general abstract data structures (content, value, contextXXX) should perhaps be separated from definition of biological concepts

Ontologies 1 (Descriptive Terms) Leaf Green leafPetal Cladode (= stem looking like leaf) Leaflike structure Stem Coded Summary Descriptions Taxon 1: Green leaf: Length 7 cm Taxon 2: Green leaf: Length 5 cm Taxon 3: Cladode: Length 8 cm Taxon 4: Cladode: Length 2 cm Identification: Which species have leaf-like structures on the stem between 7 and 10 cm long? Flower

Ontologies 2 (Taxonomic Classes) ThisFamily Taxon concepts are a natural ontology with multiple inheritance from within taxon concept classes and Rank classes. Identification: Which family has species with leaf-like structures on the stem between 7 and 10 cm long? Genus Genus spec1Genus spec2 Genus Genus spec1Genus spec2 Taxonomic Rank Family Genus Species

Break down of communication? SDD was designed for the purpose SPM has been developed for SDD and SPM are strongly analogous SDD has invested much time in trying to find an application profile supporting rich editing applications in a way consistent with simple identification keys and taxon-page creation software. SDD structures and terms have not been evaluated for SPM

“Structured Descriptive Data” “Biological Descriptions” Interest Group: “Structured Descriptive Data” → Interest Group: “Biological Descriptions” →TG SDD-Schema →TG SPM →TG SDD/RDF???

Conveners?

Thank you: For volunteering your personal time in discussions, implementation and testing! Projects and companies for testing and implementing! GBIF, TDWG, and BMBF for traveling and workshop support! TDWG-IP for financing an SDD primer!