Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY The Tangled Tree of Life Informatics and.

Slides:



Advertisements
Similar presentations
TDWG GUID-2 June 10, 2006Jessie Kennedy/Rob Gales LSID Resolution In SEEK Taxon.
Advertisements

Risk Communication is not Crisis Communication Tee L. Guidotti George Washington University Center for Risk Science and Public Health.
SpeciesLink The Brazilian experience on setting up a network Renato De Giovanni Centro de Referência em Informação Ambiental, CrIA.
Zoology 305 Library Databases/Indexes Lab Goals for session: 1) Meet your librarian Kevin Messner 2) Understand.
GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
Gathering Information Information Collection: Garbage In – Garbage Out.
Diana Hernandez Integrating the catalogue of Mexican biota: different approaches for different client perspectives.
Comp 335 File Structures Indexes. The Search for Information When searching for information, the information desired is usually associated with a key.
1 Barcodes and Zoocodes David J Patterson
Finding Primary Source Documents The Student’s View.
Integrated Taxonomic Information System Janet Gomon, Deputy Director, ITIS Smithsonian Institution Museum of Natural History The.
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements Workforce Demand and Career Opportunities From.
Scaling up The International Plant Names Index (IPNI) James A. Macklin Harvard University Herbaria Paul J. Morris Harvard University Herbaria & Museum.
Interagency Digital Library for Science & Engineering (IDLSE) Proof of Concept Tammy Borkowski Defense Technical Information Center
Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.
SpeciesLink A System for integrating distributed primary biodiversity data Vanderlei Perez Canhos Centro de Referência em Informação Ambiental, CrIA.
Medical Informatics Basics
An Introduction to Infrastructure Ch 11. Issues Performance drain on the operating environment Technical skills of the data warehouse implementers Operational.
Digital Identity Management Strategy, Policies and Architecture Kent Percival A presentation to the Information Services Committee.
Why create a Gandhara What is it expected to do that the library catalog is not doing? What other benefits can it offer to users? Think of Gandhara as.
Data Mining Chun-Hung Chou
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
GIS Concepts ‣ What is a table? What is a table? ‣ Queries on tables Queries on tables ‣ Joining and relating tables Joining and relating tables ‣ Summary.
Overview: Humans are unique creatures. Everything we do is slightly different from everyone else. Even though many times these differences are so minute.
The Research Process Mr. Burt—Southwest HS—El Centro, CA.
For Official Use Only Records Management: Essential Key to Content Management and eDiscovery Elizabeth L. (Bette) Fugitt, Ed.D. Unit Chief, Records Management.
What we discussed Project based learning Tools and that can assist learning The use of essential questions, unit questions and lesson questions Blogs,
THE YEE CATALOGING RULES: FRBRIZED CATALOGING RULES WITH AN RDF DATA MODEL FOR THE SEMANTIC WEB Presented to ALCTS FRBR Interest Group, ALA Annual 2010,
Indexing the Species Names of the World - for the World Frank Bisby (Species 2000), Michael Ruggiero (ITIS) Per de Place Bjørn (GBIF - ECAT)
Five Years InterLab ’07 Los Alamos, New Mexico October 1–3, 2007 Valerie S. Allen, MSLIS U.S. Department of Energy Office of Scientific and.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Communication Aim: To discuss the relationship between science and emotion and to address the role of communication in post industrial society.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
GLOBAL BIODIVERSITY INFORMATION FACILITY Cataloging and using Taxonomic Data The Global Names Architecture David Remsen Senior Programme Officer, ECAT.
BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.
Open access to biodiversity data: the speciesLink experience Dora Ann Lange Canhos
Normalization (Codd, 1972) Practical Information For Real World Database Design.
Lesson 1 What Is the World Wide Web?. Objectives Upon completion of this lesson, you should be able to: Explain what the World Wide Web is and how it.
Institute for Parallel Processing Bulgarian Academy of Sciences MySQL, PHP and Apache Server in Development of Database for Scientific Research in Defence.
Module 5 Planning for SQL Server® 2008 R2 Indexing.
Multiplication Facts. 1 x3 3 Think Fast… 2 x4 8.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Experience from Mapping Existing Models to the Transfer Schema Robert Kukla.
Neuroinformatics Maryann Martone Amarnath Gupta. Bioinformatics a scientific discipline that encompasses all aspects of biological information acquisition,
1 Transforming Invention into Innovation: The Conceptualization Stage.
Understanding Search Engines What Is The Web? Web Search Lesson Plan Module A1.
Big Data EUDAT 2012 – Training Day Adam Carter, EPCC EUDAT Training Task Leader.
Operating System Principles And Multitasking
COMM89 Knowledge-Based Systems Engineering Lecture 8 Life-cycles and Methodologies
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
ActionScript: For Loops, While Loops, Concatenation and Arrays MMP 220 Multimedia Programming This material was prepared for students in MMP220 Multimedia.
Computer Science & Engineering 2111 Lecture 13 Outer Joins 1.
L&I SCI 110: Information science and information theory Instructor: Xiangming(Simon) Mu Sept. 9, 2004.
How Linked Open Data helps Museums Collaborate, Reach New Audiences, and Improve Access to art Information Eleanor E. Fink Manager, American Art Collaborative.
Royal Botanic Garden Edinburgh Funded mostly by Scottish Government Martin Pullan – Biodiversity informatics David Harris – Herbarium Curator.
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
Charles Copp, Neil Caithness & Richard White.  Evaluation, selection and acquisition of existing thesauri  Thesaurus modelling - logical and physical.
Use Google Scholar! What the experts say: Use Google Scholar Use simple search for articles on library homepage Better: in the digital library main screen.
Smart Web Search Agents Data Search Engines >> Information Search Agents - Traditional searching on the Web is done using one of the following three: -
 Project Team: Suzana Vaserman David Fleish Moran Zafir Tzvika Stein  Academic adviser: Dr. Mayer Goldberg  Technical adviser: Mr. Guy Wiener.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Information and Information Technology 1. Information and employment 2.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Data Management: The Data Repatriation Re-integration Step or …
NSDL Data Repository (NDR)
Multiplication Facts.
Literature Reviews.
Terminology Services Project team: Diane Vizine-Goetz Andrew Houghton
Presentation transcript:

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY The Tangled Tree of Life Informatics and Biological Names

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Name-bearing data objects If the names are lost the knowledge also disappears -J.C. Fabricius, 1778, Philosophia Entomologica

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY The Names Problem Names are not stable Search:

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Other Names Problems 5-10% scientific names become invalid per decade Scientific names aren’t unique Acalyptus

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Newt: as concept Triturus viridescens Rafinesque 1820 Computers see string String Properties Nomenclatural concept Single specimen viridis - to become green

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Concepts:Nomenclatural Triturus viridescens Rafinesque 1820 Notopthalmus viridescens Baird 1850 Notophthalmus viridescens Gray 1850 msp. Notophthalma viridescens Gray 1858 msp. Diemyctylus viridescens Hallowell 1856 Triton viridescens Strauch, 1870 Molge viridescens Boulanger, 1872 Diemyctylus minatus viridescens Yarrow …

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Problems for locating information 476 unique Name (Nomenclatural Synonyms)PMIDDateUnique Notophthalmus viridescens Diemictylus viridescens Triturus viridescens LibrariesPublishersMuseums Federal Agencies

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Concepts:Taxonomic Notopthalmus viridescens Valid name Triturus viridescens Notopthalmus viridescens Notophthalmus viridescens Notophthalma viridescens Diemyctylus viridescens Triton viridescens Molge viridescens Diemyctylus minatus viridescens Triturus viridescens dorsalis Diemyctylus viridescens dorsalis Notophthalmus viridescens dorsalis … 24 others Frost 2005 AMNH Notopthalmus viridescens viridescens Triturus viridescens Notopthalmus viridescens Notophthalmus viridescens Notophthalma viridescens Diemyctylus viridescens Triton viridescens Molge viridescens Notophthalmus viridescens dorsalis Triturus viridescens dorsalis Diemyctylus viridescens dorsalis Notophthalmus viridescens louisianensis Dolbe 2004 Expert interpretation of the original specimens

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Concepts:Taxonomic Amphibia Urodela Salamandridae Notophthalmus Notopthalmus viridescens Frost 2005 AMNH Amphibia Batrachia Caudata Salamandroidea Salamandridae Notophthalmus Notopthalmus viridescens NCBI 2005

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY The Concepts Problem (how do we integrate)

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Who is addressing the problem

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY uBio Library origins “System” must account for all names Any classifications Biological Name Server 2 million nomenclatural concepts 1.7 taxon concepts (60 classifications) SOAP web service Tool for data organization/retrieval

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY NameBank

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY ClassificationBank (Concepts)

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Search and Retrieval

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Organization (Chapin)

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Uses (Google)

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Just another database The response of the world…

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Lessons Learned Everyone needs a job Many systems No consensus Multiple standards It’s not just technical No one is solving my problem There will always be multiple systems

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Lessons Learned Too much knowledge can be a dangerous thing Preble’s jumping meadow mouse Zapus hudsonius preblei Krutzsch, 1954 It doesn’t exist Or it does

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Putting it all together

Account for how objects are actually recorded

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Lessons Learned So many standards so little time Olivier Olivier Ent. v 5 Ent. Olivier v. p. 5 TaxMLit 802 bytes TaxonX 169 bytes Olivier, Ent., v., p. 5

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Lessons Learned Service works in both directions

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Give as we get (Attribution)

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Find Common Ground Stovepipes

Share and share alike “Furthermore, in contrast to normal synonyms, the relationships between basionyms and their combinations are purely nomenclatural and do not convey any information on classification. For this reason the relationship between a basionym and its combinations should be treated separately (on the NT side)…” Martin Pullan, The Prometheus Taxonomic Model: A Practical Approach to Representing Multiple Classifications

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Concepts:Summary Factual Inter-relationships are objective No new science required (except to make new ones) Stable Expert scrutiny useful, not required Compilation potentially FAST uBio 1 million/year share (no opinion attached) Nomenclatural Concepts Opinion Interelationships are subjective Derived from nomenclatural concepts Expert scrutiny is required Unstable Compilation slow CoL 50K / year Diptera 200K/15 years sharing concerns - opinions attached Taxonomic Concepts

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Separate fact from interpretation The informatics value of facts

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY Federate Layered architecture Common Foundation Diverse expression Enhanced Interchange Cooperation Efficient

Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY A note on Service