Brian J. Enquist Dept. Ecology and Evolutionary Biology University of Arizona, Tucson, A.Z. and The Santa Fe Institute, Santa Fe, N.M. Brian J. Enquist.

Slides:



Advertisements
Similar presentations
The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National.
Advertisements

EDIT General Meeting Carvoeiro, January 2008.
V Alyssa Rosemartin 1, Lee Marsh 1, Ellen Denny 1, Bruce Wilson USA National Phenology Network, Tucson, AZ; 2 - Oak Ridge National Laboratory, Oak.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
PlantCollections A Community Solution An Institute of Museum and Library Services National Leadership Grant Building Digital Resources.
SDD: Structured Descriptive Data Gregor Hagedorn (Germany) Bob Morris (USA) Kevin Thiele (Australia)
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Plant Systematics databases: Users perspectives Robert K. Peet, University of North Carolina In collaboration with The National Center for Ecological Analysis.
Down-scaling climate data for microclimate models and forecasts Securing the Conservation of biodiversity across Administrative Levels and spatial, temporal.
The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National.
Biodiversity and Climate Change
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
Joint Research Centre GLC2000 First Results Workshop Ispra 18 th to 22 nd March 2002 Project overview.
NSF EF Welcome to Summit III University of Florida Florida State University.
TDWG Annual Conference 2013, Florence Hannu Saarenmaa University of Eastern Finland Integrating observation and survey data for production of the Essential.
Dawn Wright Oregon State University Ned Dwyer Coastal & Marine Resources Centre, Ireland The International Coastal Atlas Network (ICAN) FGDC Marine & Coastal.
European GBIF Nodes Meeting 2013 Rui Figueira Digitarium, Joensuu, Finland, March GBIF Portugal
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
Drivers for a PRAGMA Biodiversity Science Expedition Reed Beaman Florida Museum of Natural History University of Florida.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Developing a cyberinfrastructure: Experiences from a regional network of plant taxonomists Zack Murrell, Derick Poindexter, and Michael Denslow Appalachian.
Developing an on-line taxonomic guide to the freshwater diatoms of the United States: scope, process and initial steps Steve Moulton Sarah Spaulding National.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
Use case lessons: Components of the SEEK architecture Robert K. Peet University of North Carolina.
A new floristic atlas for the Southeast based on taxon concept relationships Robert K. Peet 1, Alan S. Weakley 1,2 & Xianhua Liu 1,3 1 The University of.
Resource Identification for a Biological Collection Information Service in Europe An introduction to the BioCISE project Walter G. Berendsohn Botanical.
Indexing the Species Names of the World - for the World Frank Bisby (Species 2000), Michael Ruggiero (ITIS) Per de Place Bjørn (GBIF - ECAT)
@dimitriskoureas making small data… big. Publications based on countless specimens, images, maps, keys and datasets Typically generated by small communities.
SWWG PROJECT OVERVIEW Semantic Technologies for Integrating USGS Data.
Field Work, Herbaria, Databases, Floras, and Monographs for Plant Systematics Spring 2014.
1 Data Integration Community of Practice Meeting September 15, 2009 Science Data Integration.
University of Florida Florida State University
[] Where Did Those GBIF Occurrences Come From? Providing Digital Access to NatureServe's Reference Database: Report on a Project in the Early Stages of.
INTERNATIONAL UNION FOR CONSERVATION OF NATURE Preparing Component Programmes January 2011.
ITR: Collaborative research: software for interpretation of cosmogenic isotope inventories - a combination of geology, modeling, software engineering and.
Synopsis of current BIEN and Enquist projects managed by Martha iPlant 2014.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
The TNRS: a Taxonomic Name Resolution Service for Plants Naim Matasci The iPlant Collaborative iEvoBio 2011 Jun 21-22,
GIS and Community Health. Some critiques of GIS emphasize the potentially harmful social consequences of the diffusion of GIS technology, including reinforcing.
Overview of progress in Ecoinformatics Susan Wiser Landcare Research, Lincoln New Zealand.
BIEN Confederated DB (S) Analytical DB(s) Heterogeneous source database(s) of Plots/Specimens/Occurrences Synonymy Names Reference taxonomy *** *** Feedback.
Trees, taxonomy & location: mapping phylogeography using Biodiverse Dan Rosauer & Shawn Laffan University of New South Wales & Centre for Plant Biodiversity.
Definition of an Observation In general, an observation represents the measurement of some attribute, of some thing, at a particular time and place. Observations.
A Provisional Observational Data Standard to Facilitate Data Sharing and Aggregation Lynn Kutner, Bruce Stein, and Donna Reynolds TDWG Annual Meeting,
Challenge Grant Update: Linking the Network of Natural Heritage Biodiversity Data to the Environmental Information Exchange Network.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
Context: The Strategic Plan for Establishing the Network Integrated Biocollections Alliance Judith E. Skog, Office of the Assistant Director, Biological.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
Brian J. Enquist Dept. Ecology and Evolutionary Biology University of Arizona, Tucson, A.Z. and The Santa Fe Institute, Santa Fe, N.M. Brian J. Enquist.
VegBank A vegetation field plot archive Produced at: The National Center for Ecological Analysis and Synthesis Principal Investigators: Robert K. Peet,
HISCOM An Australian Virtual Herbarium Jim Croft Australian National Herbarium.
Networking Biodiversity Data – Online Access to Distributed Data Sources in GBIF-D Andrea Hahn, A. Kirchhoff & W.G. Berendsohn Botanic Garden and Botanical.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
GBIFS Seminar with the Science Committee and the Nodes Strategy Group Analysis of the content published by the GBIF network – Better understanding what’s.
AUSTRALIA’S VIRTUAL HERBARIUM A national collaborative model for integrated access to distributed biological information Australian National Herbarium.
The challenge of organism identity --- The flora of the Southeast The flora of the Southeast as a case study Robert K. Peet University of North Carolina.
Example projects using metadata and thesauri: the Biodiversity World Project Richard White Cardiff University, UK
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
A vision for community involvement and integration Robert K. Peet & Alan S. Weakley Alan S. Weakley.
12 th Meeting of the GBIF Participant Nodes Committee 6-7 October 2013, Berlin, Germany Data mobilization and use for international policy Olaf Bánki Senior.
12 th Meeting of the GBIF Participant Nodes Committee 6-7 October 2013, Berlin, Germany Towards a generic work programme for a Node Olaf Bánki Senior Programme.
Data sharing and exchange: Experiences within the
RCN Development of an Online Database to Enhance the Conservation of SGCN Invertebrates in the Northeastern Region James W. Fetzner Jr. & John.
Taxonomic and Community Classification Resources and Standards
Introduction to USA-NPN and Nature’s Notebook
7.b Marine alien species on EASIN
Challenge Grant Update
  1-A) How would Arctic science benefit from an improved GIS?
Presentation transcript:

Brian J. Enquist Dept. Ecology and Evolutionary Biology University of Arizona, Tucson, A.Z. and The Santa Fe Institute, Santa Fe, N.M. Brian J. Enquist Dept. Ecology and Evolutionary Biology University of Arizona, Tucson, A.Z. and The Santa Fe Institute, Santa Fe, N.M. BIEN, MBG, iPToL, Developing a Taxonomic Name Resolution Service: Overview of Science and Initial Meeting Goals

BIEN – Botanical Information and Ecology Network Specimens Ecological observations species/individual observations, Trait Data + =

Map by N. Swenson Green et al. Science (2008) Merging plant specimens from various herbaria, plots, and ecological data (functional traits) Swenson and Enquist (2007) American Journal of Botany Example of potential science to be done...

BIEN is a synthetic NCEAS working group Systematists, ecologists, informaticians and computer scientists We are presenting a new model forward for asking questions in botanical science BIEN NCEAS Working Group Dec > 2009, 2010

Plots with plant community abundance, diversity, size information Building on work by Alwyn Gentry... (1) Plot Data Literally thousands of ecological vegetation plots by numerous researchers

Integrating Plot Networks

(2) Specimen Data….

Exchange schema(s) Confederated resource DATA SCRUBBING CORRECTING Plot and Trait Data Specimen Data Science Data Standardization Tools TAXONOMIC INTELLIGENCE BIEN Initial Roadmap

(1) Specific short-term science questions at the nexus of merging herbarium, plot (abundance), and observation (trait) data for plants in the Americas. (2) Technology development goals associated with answering these questions effectively – as well as to establish an informatics methodology for continuing to assemble and integrate relevant observation data for this and other projects. BIEN Initial Project Goals (3) Longer-term program development – seek support to develop a permanent technical solution to the integration of vegetation/trait/botanical data

Q 3: What are the physiological, demographic, environmental, and phylogenetic correlates of rarity and commonness across environmental gradients? Can these correlates be used to predict vulnerability or resistance to extinction for species and communities under differing scenarios of habitat loss and climate change? Q 2: How are abundance and range size related? For example, do trees with small ranges tend to be rare, relative to widespread species? Q 1: How does climate influence the relative distribution of narrow and widespread species? Do these relationships vary in tropical and temperate environments? BIEN Initial Science Goals

Use collections and observations from plots to then map/model distributions University of Arizona Diversity Mapper

BIEN2.0 Database (March 2010) Core specimen datasets for the Americas (GBIF, MBG) Core plots datasets (FIA, CTFS, VEGBANK, SALVIAS) BIEN database is housed at NCEAS and contains Core species trait database (Literature and collaborators) 7.5 million specimens 10.7 plot observations ~ 18 million individual taxon strings Traits: 25, Species: 36,241, Records: 122,230 ~ 250,000 unique taxa (!) indicating synonyms comprise large fraction of names Use these data to generate measures of geographic ranges, measures of abundance and trait composition

- Spring 2009, expanded BIEN group and submitted a proposal to become a ‘Grand Challenge’ team at iPlant Long story.... Central Grand Challenge team is the ‘Tree of Life’ team (iPToL) Formalized a two year partnership with iPlant to work closely with iPToL -To fund collaborations with MBG -To develop cyberinfrastructure to support development of a TNRS and synonymized New World Plant Names Database (NWPND) - To ‘do science’ by merging ecology and traits on iPToL phylogenies. Longer-term program development

Why Develop a TNRS? The ‘names problem’ is one of the most fundamental impediments to biological science The plant sciences do not yet have the cyberinfrastructure needed for taxonomic standardization Our inability to integrate differing heterogeneous data sources across differing temporal and spatial scales has significantly limited our ability to answer key scientific questions (applied and academic).

Taxonomic Name Resolution Service Taxonomic Name Resolution Service iPlant Tree of Life (iPToL) APWeb Group Generic Synonymy APWeb Tropicos Core: Names, Specimens, References, People Projects / Concepts Alt. Classifications Literature links Vouchered Images The Plant List iPlant NCEAS BIEN MOBOT Informatics Relationship Between Institutional and Efforts iPlant cyberinfrastructure development team

- Develop a botanical Taxonomic Name Resolution Service (TNRS) to authenticate botanical names against a standard list of published taxa -Develop an authoritative, synonymized NWPND that will allow New World plant occurrence data to be mapped to a standard set of taxon concepts. Primary goals of this BIEN/MBG/iPToL collaboration will be to work together with software developers and technology staff at iPlant to: Specific goals for this meeting

Technology to be developed with iPlant - The TNRS & NWPND will be web services, with batch- processing capability and an intuitive user interface - The TNRS & NWPND services will be available to any user interested in correcting and harmonizing names of plant taxa. Developed with the assistance of programmers and technical support from iPlant. The development of the TNRS & NWPND necessitates short- and long-term goals. It is up to us to define these goals in order to develop this technology.

(i) Overview of science - Science goals will help define use cases, needs, and structure of the TNRS. (ii) Agree upon outcomes - Ensure MBG, BIEN, iPTOL and collaborators are clear on respective needs and desired outcomes. (iii) Articulate ‘use cases’ - will define what TNRS should and should not do as well as clarify programming and technology needs. **(iv) Articulate short and long-term goals for the TNRS ** (v) Detail longer term vision and goals - for next TNRS meetings and technology development goals between now and future meetings. General Goals for This Meeting - Rank in order of importance - Rank in order of ease of implementation

To start we need to clearly articulate short- and long-term goals. Initially, toward a TNRS, we propose two lines of development: (1) (2) These lines of development can start separately but ultimately will merge Synonymy Name Matching

Short-term goals (implement within first year) (1) Name matching Compile complete list of names within TROPICOS and ideally IPNI. Flag taxa occurring within the Americas, Define and articulate the basic functions of proposed web service, including but not limited to: - To match a user-submitted list of names against authoritative list. - Build an interface for user interpretation and adjudication of ambiguous cases based on match rankings - To catch and correct common spelling mistakes, atomize data elements, extract additional concatenated information, etc. Data Applications

(2) Synonymy - Identify other digitized sources of monographic and regional synonymy The expectation is that these will provide a common framework for combining various authoritative lists into a first draft of a New World plant taxon checklist - Define functions of application for checking submitted names against list of synonymized names. Applications - Interface for user interpretation and adjudication of ambiguous cases - Ranking of degrees of ambiguity - Digitized synonymized checklists within MBG. Data: Short-term goals (implement within first year)

In this meeting we will not create a requirements document or detailed work flows The charge for the meeting is to finalize short-term goals for TNRS and immediate ‘use cases’ to develop the bullet points of these documents -These documents will be created on project wiki after the meeting with the feedback from the TNRS collaborators - Documents will be used to then start developing the prototype of the TNRS this year for implementation from technology staff and programmers at iPlant.