IPlant's Taxonomic Name Resolution Service Naim Matasci BIO5 / The iPlant Collaborative.

Slides:



Advertisements
Similar presentations
AUSTRALIA’S VIRTUAL HERBARIUM
Advertisements

Globalnames.org.  Discovery  Ephemeral  Individualistic  Massive redundancy  Optional  Risk taking.
GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
Reporting Measurement Uncertainties According to the ISO Guide Duane Deardorff Dept. of Physics and Astronomy The University of North Carolina at Chapel.
Placing barcodes with precision against the Catalogue of Life Frank Bisby Executive Director: Species 2000 Species 2000 Secretariat University of Reading,
BIS TDWG Conference, New Orleans, 2011 GBIF: Issues in providing federated access to digital information related to biological specimens David Remsen Senior.
The iPlant Tree of Life Project and Toolkit: Building a Cyberinfrastructure for Plant Science Research Naim Matasci The iPlant Collaborative Evolution.
VegBank.org: a Permanent, Open-Access Archive for Vegetation Plot Data. Michael T. Lee 1, Michael D. Jennings 2, Robert K. Peet 1. Interacting with the.
Scaling up The International Plant Names Index (IPNI) James A. Macklin Harvard University Herbaria Paul J. Morris Harvard University Herbaria & Museum.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL One-Thousand Words Peter Parente Enabling Technology Spring 2003 Enriching Digital Images to Improve Information.
Biodiversity Heritage Library by Connie Rinaldo. Overview History EOL/BHL: WHY? Members/Collaborators Process Governance Sustainability: Legal and Financial.
Data models for Community information Robert K. Peet, University of North Carolina John Harris, Nat. Center for Ecol. Analysis & Synthesis Michael D. Jennings,
NSF on the web- An indispensable resource
Computational Thinking across the Curriculum Workshop Amber Settle and Ljubomir Perkovic DePaul University June 11, 2010 Work supported by the National.
Plant names: obstacles and solutions
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
The EDIT Platform for Cybertaxonomy as an information broker in name infrastructures Andreas Kohlbecker 1, Yde de Jong 2, Cherian Mathew 1, Lorna Morris.
The Encyclopedia of Life: A Web Site for Every Species James Edwards Executive Director, EOL Barcode of Life Conference Taipei 20 September 2007.
Open Science Grid For CI-Days Internet2: Fall Member Meeting, 2007 John McGee – OSG Engagement Manager Renaissance Computing Institute.
IPlant's Taxonomic Name Resolution Service Naim Matasci BIO5 / The iPlant Collaborative tnrs.iplantc.org.
The Macroalgal Digitization Project Chris Neefus, Department of Biological Sciences University of New Hampshire, Durham, New Hampshire.
Explorer of Taxon Concepts (ETC) From description to matrix and beyond in a web-based toolbox.
Use case lessons: Components of the SEEK architecture Robert K. Peet University of North Carolina.
Tom Garnett April 12, 2007 Smithsonian Institution Libraries National Museum of Natural History Board Science Committee Meeting Biodiversity Heritage Library.
Brian J. Enquist Dept. Ecology and Evolutionary Biology University of Arizona, Tucson, A.Z. and The Santa Fe Institute, Santa Fe, N.M. Brian J. Enquist.
University of Florida Florida State University
Using Scientific Measurements. Uncertainty in Measurements All measurements have uncertainty. 1.Measurements involve estimation by the person making the.
Serving the needs of the conservation community Global Biodiversity Information Facility.
GLOBAL BIODIVERSITY INFORMATION FACILITY Cataloging and using Taxonomic Data The Global Names Architecture David Remsen Senior Programme Officer, ECAT.
Open Science Grid For CI-Days Elizabeth City State University Jan-2008 John McGee – OSG Engagement Manager Manager, Cyberinfrastructure.
The iPlant Collaborative: A Cyberinfrastructure for the Life Sciences Naim Matasci BIO5 / The iPlant Collaborative EEB, University of Arizona Oct 4, 2011.
Applying Open Source to Open Science Ray Idaszak Director, Collaborative Environments RENCI, University of North Carolina at Chapel Hill.
The TNRS: a Taxonomic Name Resolution Service for Plants Naim Matasci The iPlant Collaborative iEvoBio 2011 Jun 21-22,
TDWG 2006 Conference, St Louis Digitizing the legacy literature of biodiversity An introduction to the Biodiversity Heritage Library (BHL) Neil Thomson.
Botanicus.org: Prototyping a Web 2.0 interface to digitized taxonomic literature Chris Freeland - Application Development Manager Doug Holland – Director.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop – Part 2 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 29, 2015,
Vegetation Data Management: VegBank Funding: National Science Foundation (DBI ) January 8, 2002 John Harris - NCEAS.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Atmosphere.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop GWAS/QTL Apps Overview.
Build an Automated Workflow Visual Workflow Creator Discovery Environment.
 Data Quality Resources in Species Occurrence Digitization Allan Koch Veiga Etienne Americo Cartolano Jr Antonio Mauro Saraiva Agricultural Automation.
My Biome Vacation Where science, literacy, and technology #ncsta2014.
Spotlight on the Global Plants Initiative
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop …and Environments.
Brian J. Enquist Dept. Ecology and Evolutionary Biology University of Arizona, Tucson, A.Z. and The Santa Fe Institute, Santa Fe, N.M. Brian J. Enquist.
Royal Botanic Garden Edinburgh Funded mostly by Scottish Government Martin Pullan – Biodiversity informatics David Harris – Herbarium Curator.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop BISQUE.
AUSTRALIA’S VIRTUAL HERBARIUM A national collaborative model for integrated access to distributed biological information Australian National Herbarium.
Enabling Plant Sciences Research with the iPlant Discovery Environment and Condor Juan Antonio Raygoza Garay, Sonya Lowry, John Wregglesworth.
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting.
Botanical Nomenclature The Basics. International Code of Botanical Nomenclature Revised every 6 years 2006 (Vienna) Code:
Quality control of biodiversity data: tools & techniques Leen Vandepitte On behalf of WoRMS, EurOBIS & LifeWatch data management teams.
Taxonomic Name Recognition (TNR) in Biodiversity Heritage Library (生物多样性图书馆分 类学名称识别) Qin Wei (魏琴), Chris Freeland, P. Bryan Heidorn Missouri Botanical.
Biodiversity Heritage Library: A Successful Collaboration, A Fully Open Access Collection Marty Schlabach Mann Library, Cornell University Upstate New.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
Freeland, LAPI II, 18 NOV 2008 Digital Libraries for Science: Botanicus & Biodiversity Heritage Library Chris Freeland Director of Bioinformatics, Missouri.
World wide access to biodiversity literature The Biodiversity Heritage Library Henning Scholz 1 & Tom Garnett 2 1 Museum für Naturkunde, Berlin, Germany.
Data sharing and exchange: Experiences within the
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Applying Open Source to Open Science
Patterns, Plants and Paint! Year 2 – Term 4
Taxonomic and Community Classification Resources and Standards
Volume 135, Issue 5, Pages e2 (November 2008)
Volume 135, Issue 5, Pages e2 (November 2008)
Big Data Needs Little CRUD:
Figure:
Presentation transcript:

iPlant's Taxonomic Name Resolution Service Naim Matasci BIO5 / The iPlant Collaborative

The Problem – Taxonomic Uncertainty Solanum lycopersicum Lycopersycon lycopersicum Lycopersycon esculentum Solanum lycopersicon?

Taxonomic uncertainty 1.Non-existent names Misspellings Contamination Annotations Morphospecies Digitization issues (frame shifts, character encoding)Lexical variants (digitization conventions) 2.Synonymy Nomenclatural synonyms Taxonomic synonyms / concepts 3.Misidentifications, incomplete identifications

a) Centaurium curvistamineum (Wittr.) Abrams (1951) b) Centaurium minimum (Howell) Piper (1915) c) Centaurium muhlenbergii (Griseb.) Wight ex Piper (1906) d) Centaurium muhlenbergii (Griseb.) Wight ex Piper forma albiflorum (Suksd.) St. John (1937) e) Centaurium muhlenbergii (Griseb.) Wight ex Piper var. albiflorum Suksd. (1927) f) Centaurodes muhlenbergii (Griseb.) Kuntze (1891) g) Erythraea curvistaminea Wittr. (1886) h) Erythraea minima Howell (1901) i) Erythraea muhlenbergii Griseb. (1839) Image: Gordon Leppig & Andrea J. Pickart

How to figure that out? …or ask around at My-Plant.org

Makemake at de.wikipedia Original Use Case – Density Plots (BIEN/NCEAS)

Hans Hillewaert

Taxonomic Name Resolution Service Computer assisted standardization of plant names Corrects spelling errors and alternative spellings to a standard list of names Convert out-of-date names to currently accepted names

Where is my plant!!! It's all wrong!!!

Future More sources – Standard source import with DwC support Better performance TNRastic API Integration with Global Names components

Web: Code: ource/TNRS API (provisional): TNRastic API:

Brad Boyle Brian Enquist Juan Antonio Raygoza Garay Nicole Hopkins Zhenyuan Lu Martha Narro Shannon Oliver William Piel Jill Yarmchuk Bob Magill (Missouri Botanical Garden) Chris Freeland (Missouri Botanical Garden) Chuck Miller (Missouri Botanical Garden) Peter Jorgensen (Missouri Botanical Garden) Amy Zanne (University of Missouri, St. Louis) Peter Stevens (Missouri Botanical Garden) Jay Paige (Missouri Botanical Garden) Bob Peet (University of North Carolina at Chapel Hill) Paul Morris (Harvard University) Alan Paton (Kew Royal Botanic Gardens and their International Plant Names Index) Tony Rees (Commonwealth Scientific and Industrial Research Organisation) Michael Giddens ( Dmitry Mozzherin (Global Biodiversity Information Facility) David Remsen (Global Biodiversity Information Facility) David Patterson (Encyclopedia of Life) Cam Webb (Harvard University) Missouri Botanical Garden (Tropicos) Funding provided by the National Science Foundation Plant Cyberinfrastructure Program (grant #DBI ).