Toward Using Ontologies to Reason About Disagreeing Taxonomic Experts Dave Thau UC Davis

Slides:



Advertisements
Similar presentations
How to Use This Presentation
Advertisements

TDWG GUID-2 June 10, 2006Jessie Kennedy/Rob Gales LSID Resolution In SEEK Taxon.
Copyright Pearson Prentice Hall
Finding Order In Diversity
Exploring Change Unity and Diversity. Classification Introduction How many species are there? Why should we be interested in learning about the diversity.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
Taxonomy “The Study of Classification”
18.1 Finding Order in Diversity
Until more recent times, scientists named Things with crazy long names that Just described the organism. Apis pubescens, thorace subgriseo, abdomine.
Chapter 15: Classification
Chapter 20 Classification Review. Phylogeny is the evolutionary history of a species or group of related species The discipline of systematics classifies.
Chapter 18 – Classification
CLASSIFICATION Chapter 17.
18-1 Finding Order in Diversity Biologists have identified and named 1.5 million species so far.
Online Counseling Resource YCMOU ELearning Drive… School of Architecture, Science and Technology Yashwantrao Chavan Maharashtra Open University, Nashik.
Fig Copyright © 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings Overview: Investigating the Tree of Life Phylogeny is the.
Plant Names and Classification. Outline Development of Binomial Nomenclature International Code of Botanical Nomenclature Development of the Kingdom Concept.
Objectives 18.1 Finding Order in Diversity
Unit Two: Biodiversity
Storing and Retrieving Biological Instances with the Instance Store Daniele Turi, Phillip Lord, Michael Bada, Robert Stevens.
A Beginners Guide to Understanding Taxonomy, Names and Concepts Jessie Kennedy Napier University.
BIOLOGICAL CLASSIFICATION. Taxonomy  Biological classification, or scientific classification in biology, is a method by which biologists group and categorize.
Dave Thau PASI, Costa Rica, June 7, 2008 Ontologies in Ecology and Biodiversity Informatics Dave Thau With some slides by Shawn Bowers and Josh Madin gratefully.
Stern - Introductory Plant Biology: 9th Ed. - All Rights Reserved - McGraw Hill Companies Plant Names and Classification Chapter 16 Copyright © McGraw-Hill.
Building an Ontology of Semantic Web Techniques Utilizing RDF Schema and OWL 2.0 in Protégé 4.0 Presented by: Naveed Javed Nimat Umar Syed.
OWL 2 in use. OWL 2 OWL 2 is a knowledge representation language, designed to formulate, exchange and reason with knowledge about a domain of interest.
Chapter 18 Classification. What is Classification? Classification is the grouping of objects or information based on similarities. Can you name some things.
Classification. History Aristotle organized living things into 2 main categories: –Plants –Animals.
Classification. Taxonomy: branch of biology that groups organisms based on the presence of similar characteristics Taxonomist: scientist who group and.
World is so diverse Classification  The grouping of information or objects based on similarities  We classify things all the time (creates order) 
TAXONOMY & CLASSIFICATION LECTURE Grouping and Naming Organisms.
Classification WAL: about classification, the binomial system and taxonomy. All Most Some What is a species and how are species named? What are the principles.
Chapter 14 Table of Contents
1 Ch 17:Classification Modified from Massengale, biology junction.
CleanTAX, Dave Thau Stanford Research Institute Artificial Intelligence Center Seminar 8/16/ of 47 Dave Thau and Bertram Ludäscher.
Systematics: The Science Of Biological Diversity Chapter 12
Finding Order in Diversity
1 Classification of organisms. 2 There are 13 billion known species of organisms There are 13 billion known species of organisms This is only 5% of all.
Taxonomic Levels “King Philip”. Carolus Linnaeus invented the Latin binomial system for naming organisms (which we still use today) but he also understood.
The History of Classification Copyright © McGraw-Hill Education Early Systems of Classification Classification is the grouping of objects or organisms.
Chapter 17: The History and Diversity of Life
History of Classification.
CLASSIFICATION Why Classify?. INQUIRY ACTIVITY 1) Construct a table with six rows and six columns. Label each row with the name of a different fruit.
Identifying, Naming, and Classifying Species
Do Now 11/19 1.How is your closet organized? ( or how would your dream closet be organized?) 2. What criteria did you use to help organize your closet?
Choose a category. You will be given the answer. You must give the correct question. Click to begin.
CLASSIFICATION Chapter 17. Warm-up:  When scientists discover a new species, what is the first thing they need to do?  What is the scientific name for.
Classification. What is Classification? Biologists want to better understand organisms so they organize them. Classification = the grouping of objects.
Classification Review!. The scientific name for dogs is Canis familiaris. What is its species name?
18-1 History of Taxonomy Taxonomy  Branch of biology that names and groups organisms according to their characteristics and evolutionary history.
An Introduction to Classification November 29, 2010.
Plant Names and Classification
17.1 Classification Think about how things are grouped in a store or in your kitchen to help create order.
Unit 1: Diversity of Living Things. Specific Expectations Analyze the effects of various human activities on the diversity of life Investigate the principles.
Taxonomy What is taxonomy? Taxonomy is the branch of biology concerned with the grouping and naming of organisms Biologists who study this are called.
Today we are covering from the specification:. What is classification? Organising living organisms into groups We need to do this as scientists have already.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Classification: Organizing Life’s Diversity Taxonomy, kingdom, phylum, class, order, family. Warm-up: 1.What is it called when natural selection leads.
Chapter 17 Organizing Life's Diversity. A. How Classification Began 1. Classification - grouping of objects or information based on similarities 2. Taxonomy.
Classification- putting things into orderly groups based on similar characteristics.
How do we “classify” living organisms.  Classification: the systematic grouping of organisms based on _________________________________  ______________________:
THE CLASSIFICATION OF LIVING THINGS
Section 1: The Importance of Classification
Section 1: The Importance of Classification
ece 720 intelligent web: ontology and beyond
Service-Oriented Computing: Semantics, Processes, Agents
Towards a Logic Formalization of Taxonomic Concepts
Service-Oriented Computing: Semantics, Processes, Agents
Science 9: Unit A – Biological Diversity
Presentation transcript:

Toward Using Ontologies to Reason About Disagreeing Taxonomic Experts Dave Thau UC Davis

NeSC RDF Workshop June 8, 20062/25 Why Did The Chicken Cross The Road? To get to the other side. To boldly go where no chicken has gone before. To prove it could never reach the other side. Chickens, over great periods of time, have been naturally selected so that they are now predisposed to cross roads. Zeno of Elea

RDF Workshop June 8, 20063/25 Why did the taxonomists cross the road? So they could properly identify the chicken

RDF Workshop June 8, 20064/25 Overview Quick primer on taxonomy Some types of disagreements between experts Problems this causes Using an ontology to represent taxonomic opinions Using the ontology to compare experts’ theories

RDF Workshop June 8, 20065/25 Linnaean Taxonomy Basics Ranks: kingdom, phylum, class, order, family, genus, species, variety (and others!) Canidae CanisVulpes Nyctereutes Canis familiaris Family Rank Genus Rank Species Rank Canis lupus Canis latrans

RDF Workshop June 8, 20066/25 Things you may not know There is no big list of all the known species in the world This is partly because people don’t agree on the definitions of the species, genera, etc. Estimates are that 6% of the known taxa are changed every year This has been going on since Linnaeus published his classification scheme in 1735

RDF Workshop June 8, 20067/25 Ranunculus aquatilis R.a. var aquatilis R.a. var diffusus R.a. var hispidulus FNA-03, 1997 Ranunculus aquatilis R.a. var capillaceus Benson, 1948 Types of Disagreement: The Basics R.a. var calvescens This results in 5 12 (more than 240 million) possible sets of relationships.       B A A  B A  B BA A overlap B BA A disjoint B B A A  B

RDF Workshop June 8, 20068/25 Types of Disagreement - Splitting and Lumping Ranunculus flammula R.f. var genuiinus R.f. var ovalis Benson, 1948 Ranunculus flammula R.f. var filiformis Kartesz, 2004 Peet, 2005: B.1948:R.flammula is congruent to K.2004:R.flammula B.1948:R.f. genuiinus is included in K.2004:R.f.flammula B.1948:R.f.ovalis is included in K.2004:R.flammula B.1948:R.f.filifomis is congruent to K.2004:R.f.filiformis R.f. var filiformis R.f. var flammula    

RDF Workshop June 8, 20069/25 Ranunculus glaberrimus R.g. var reconditus R.g. var ellipticus R.g. var typicus Benson, 1948 Ranunculus glaberrimus R.g. var ellipticus R.g. var glaberrimus Kartesz, 2004 Types of Disagreement – Differing Extents Peet, 2005: B.1948:R. glaberriums contains K.2004:R. glaberrimus B.1948:R.g.ellipticus is congruent to K.2004:R.g.ellipticus B.1948:R.g.typicus is congruent to K.2004:R.h.blaberrimus B.1948:R.g.reconditus is congruent to K.2004:R.tritenatus   

RDF Workshop June 8, /25 Impact on Data Analysis Can’t find data –If A  B, a search on A should retrieve B Can’t aggregate data –If B  A, you should be able to combine data from B into A

RDF Workshop June 8, /25 What to do in case of conflicting experts? Just listen to one expert you like Pick an expert you like and everyone who agrees with this expert (and each other) Choose experts who form the largest set of agreeing experts Choose experts whose opinions encompass the smallest or largest number of taxa

RDF Workshop June 8, /25 How can we find out which experts agree? Represent taxonomy using logic Use the logic to determine relations between expert opinions (theories) –Two theories may conflict –Two theories may be equivalent –One theory may encompass another

RDF Workshop June 8, /25 Representation Details Based on the Taxon Concept Schema (TCS) Represented using Description Logic –(OWL DL)

RDF Workshop June 8, /25 Example Ontology Specimen Ranunculus (Kartesz, 2004) hasSpecies Ranunculus glaberrimus (Kartesz, 2004) Things in the species Ranunculus glaberrimus Things in the genus Ranunculus Taxon Taxon Description hasGenus

RDF Workshop June 8, /25 Fundamental Assumptions Each Taxa class has at least one instance Each Taxa class is defined as the union of its subclasses A class’s subclasses are defined to be mutually disjoint

RDF Workshop June 8, /25 Questions Ontology Can Answer Find the subclasses of a class Make sure the taxonomy is consistent See if two classes are equivalent Can also use it to compare expert opinions

RDF Workshop June 8, /25 Compatible Theories A theory is one expert’s set of classes and relations and all they imply. A set of theories is compatible if –Each theory is consistent and –The correspondences between classes in the theories do not cause inconsistency.

RDF Workshop June 8, /25 Ranunculus hydrocharoides R.h. var natans R.h. var stolonifer R.h. var typicus Benson, 1948 Ranunculus hydrocharoides R.h. var stolonife r R.h. var typicus Kartesz, 2004 Example Incompatibility Peet, 2005: B.1948:R.h.stolonifer is congruent to K.2004:R.h.stolonifer B.1948:R.h.typicus is congruent to K.2004:R.h.typicus B.1948:R. hydrocharoides is congruent to K.2004:R. hydrocharoides  

RDF Workshop June 8, /25 Example Incompatibility Peet, 2005: B.1948:R. macranthus contains K.2004: R. petiolaris B.1948:R. petiolaris is contained by K. petiolaris Ranunculus petiolaris … Benson, 1948 Ranunculus petiolaris Kartesz, 2004 Ranunculus macranthus …   B.48:R. petiolaris  K.04:R. petiolaris  B.48:R. macranthus contradicts B.48:R. macranthus and B.48:R. petiolaris are disjoint.

RDF Workshop June 8, /25 Inferring Unstated Correspondences Ranunculus arizonicus R.a. var chihuahua R.a. var typicus Benson, 1948 Ranunculus arizonicus Kartesz, 2004 Peet, 2005: B.1948:R.a.typicus is included in K.2004:R. arizonicus B.1948:R. arizonicus is congruent to K.2004:R. arizonicus   

RDF Workshop June 8, /25 Given two compatible theories, T and T’: –The theories are equivalent if each class in theory T is equivalent to one class in T’ (and vice versa). – T is smaller than ( ) T’ if each class in T either equals or is contained by a class in T’. Comparing Theories

RDF Workshop June 8, /25 Example of Theory Ordering A BCD A BC A BCE T1 T2T3        T1 T2 T3

RDF Workshop June 8, /25 Whom to believe? Just listen to one expert you like –Easy! Don’t need any reasoning Pick an expert you like and everyone who can agree with this expert –Choose all experts with theories equivalent to the expert you like Choose experts who form the largest set of agreeing experts –Find largest equivalence class Choose experts whose opinions form the smallest or largest number of taxa –Bigger theories account for more taxa

RDF Workshop June 8, /25 Future Work Vetting the ontology Adding ‘intelligence’ to tools which build correspondences Implementing authority picker in a workflow system Efficient algorithm for determining theory hierarchy

RDF Workshop June 8, /25 Thanks! Questions? I’d like to acknowledge: –Bertram Ludäscher, Shawn Bowers, Serguei Krivov, Richard Waldinger for many discussions on this topic. –Jessie Kennedy, Robert Kukla, Trevor Patterson, Martin Graham for their work on the Taxon Concept Schema –Bob Peet for the Ranunculus data set –Kirsten Menger-Anderson for Chicken Drawing –NSF, under SEEK awards , , , and

RDF Workshop June 8, /25 R. aquatilis  R. trichophyllus Where In Greece Can I Find Ranunculus aquatilis?

RDF Workshop June 8, /25 Beginnings of Biological Taxonomy Egypt, 1500 BC: Ebers medical papyrus, classification of medical plants Greece, 300 BC: Aristotle and Theophrastus China, 200 BC: Erh-ya dictionary (second century BC)