globalnames.org
Discovery Ephemeral Individualistic Massive redundancy Optional Risk taking
Discovery Ephemeral Individualistic Massive redundancy Optional Risk taking Implementation Communal / agreed Essential Persistent Robust & reliable Adaptable
Discovery Ephemeral Individualistic Massive redundancy Optional Risk taking
Data re-use Data generation Data pool
AggregationVisualizationAnalysisManipulation ModelsObservationsExperimentsProcessed Data re-use Data generation Data pool
AggregationVisualizationAnalysisManipulation ModelsObservationsExperimentsProcessed Data re-use Data generation
Can be used as metadata To index content A names-based cyberinfrastructure to index and interconnect distributed data 99
‘The initial mapping was constructed by extracting the scientific name of the taxon that was the topic of each Wikipedia page, then finding the match for this in the NCBI taxonomy’.
Initiated by GBIF & EOL in 2007 To build a names-based cyberinfrastructure An open and free virtual layer that interconnects and enriches distributed content Shaped through Nomina workshops Overseen by GNA Advisory panel Globalnames.org
PIs: Stan Blum, California Academy of Sciences Replication Chris Freeland, Missouri Botanical Gardens / WUSTL BHL / CiteBank David Patterson, Arizona State University Names services Rich Pyle, Bishop Museum ZooBank / GNUB The “Global Names Architecture,” an innovative infrastructure for unifying nomenclatural and taxonomic databases and services for managers of biological information.
Many names for one species One name for many species The indefinite nature of species Classifications / phylogenies Species without names
Variant spellings (some legitimate, some mistakes) Homotypic synonyms (= objective = nomenclatural) Heterotypic synonyms (= subjective = taxonomic) Common (plain language) names Surrogates for names Chresonyms
Didymosphenia geminata Echinella geminata Didymosphenia geminata (Lyngbye) D. geminata Didymosphenia geminata (Lyngbye) Schmidt 1899 Gomphonema vulgare Bréb. AAAAAGCTCGTAGTTGGATTTGTGAT GGAATTTGAATACTTTTAAAGTGTTCT AGAAACTGTCATCCGTGGGTGGAATT TGTTTGGCATTAGGTTGTCAGRCAGAG GATGCCTATMCTTTACTGTGAAAAAAT CAGTGCGTTCAAAGCAGACTTACGTC GATGAATGTATTAGCATGGAA Didimosphenia geminata didymo Rock Snot didymo
Didymosphenia geminata (Lyngbye) M.Schmidt in A. Schmidt 1899 Misspellings Didimosphenia geminata Lexical variants Didymosphenia geminata (Lyngbye) Didymosphenia geminata D. geminata D. geminata Schmidt 1899 D. geminata Schmidt Surrogates Homotypic Synonyms Echinella geminata Heterotypic Synonyms Gomphonema vulgare Bréb. Vernculars Didymo Rock Snot AAAAAGCTCGTAGTTGGATTTGTGATGGAATTTGAATACTTTTAAAGT GTTCTAGAAACTGTCATCCGTGGGTGGAATTTGTTTGGCATTAGGTTGT CAGRCAGAGGATGCCTATMCTTTACTGTGAAAAAATCAGTGCGTTCAAA GCAGACTTACGTCGATGAATGTATTAGCATGGAA
Gomphonema vulgare Brébisson 1838 G. vulgare Breb. Gomphonema vulgare Brébisson 1838 G. vulgare Breb. Vernaculars Surrogates RECONCILIATION GROUP AAAAAGCTCGTAGTTGGATTTGTGATGGAATTTGAATACTTTTAAAGTGTT CTAGAAACTGTCATCCGTGGGTGGAATTTGTTTGGCATTAGGTTGTCAGRC AGAGGATGCCTATMCTTTACTGTGAAAAAATCAGTGCGTTCAAAGCAGACTT ACGTCGATGAATGTATTAGCATGGAA Heterotypic synonyms
Gomphonema vulgare Brébisson 1838 G. vulgare Breb. Gomphonema vulgare Brébisson 1838 G. vulgare Breb. Vernaculars Surrogates AAAAAGCTCGTAGTTGGATTTGTGATGGAATTTGAATACTTTTAAAGTGTT CTAGAAACTGTCATCCGTGGGTGGAATTTGTTTGGCATTAGGTTGTCAGRC AGAGGATGCCTATMCTTTACTGTGAAAAAATCAGTGCGTTCAAAGCAGACTT ACGTCGATGAATGTATTAGCATGGAA Heterotypic synonyms RESOLUTION Didymosphenia geminata (Lyngbye) Schmidt 1899
Using the name endorsed by your favored taxonomic source
Aa Ar Pet1 A marina Abe__Heli Apodemia.mor.A13 N_larina_aethra_20018 Apion pensylvaticum: Boheman 1839 Apion pennsylvaticum Boheman, 1839 Gy091_Lv_Bonn_Ger Gy642_Lv_Bas_Switz P.potto_JCKerbis2889 S.sciereus_U53582 C.major G.crass. L._catta L.catta Solution: Taxonomic validation at point of data entry
Focus on ‘Use Cases’ And the infrastructure will follow Extend existing software, dbs and services around the concept of nodes GN UUIDs for names and reconciliation groups Exchange standards
Particular They represent a class of problems Must be do-able Must visibly benefit many end users Must be openly available to use and to improve
Taxonomic validation services. At point of data entry or with publishers. Requires Reconciliation and resolution. Indexing. Using names recognition and discovery tools. Essential for name-linking services. Names normalization – for data federation, but must deal with poor OCR, colloquial names in many languages etc. Content synchrony and curation