Connect UNAVCO, a VIVO for a Scientific Community

Slides:



Advertisements
Similar presentations
VIVO and Linked Open Data December 13, 2010 Dean B. Krafft Chief Technology Strategist and Director of IT Cornell University Library.
Advertisements

Doug Nebert, Senior Advisor for Geospatial Technology, System-of-Systems Architect FGDC Secretariat.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
1 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Global Earth Observation Grid Workshop, Bangkok, Thailand, March Integration Platform.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
ÆKOS: A new paradigm for discovery and access to complex ecological data David Turner, Paul Chinnick, Andrew Graham, Matt Schneider, Craig Walker Logos.
1 Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
Search Server Index Search Server Index Somewhere There’s a PLACE for Us: Linking Fedora Digital Collections and Open Geoportal Eleta Exline, Thelma Thompson,
, Implementing GIS for Expanded Data Accessibility and Discoverability ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
Infrasound Consortium for Applied Research 2002 Infrasound Technology Workshop, Netherlands Milton Garces Infrasound Laboratory, University of Hawaii,
University of Florida CTSI: Consuming and disambiguating publications data from Microsoft Academic Search in VIVO. Nicholas Rejack 1, Erik Schmidt 1, Michael.
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Common Archive Observation Model (CAOM) What is it and why does JWST care?
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Using Open Data to Create Value for Citizens. Data.gov Provides instant access to ~400,000 datasets in easy to use formats Contributions from UN, World.
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Interface for Glyco Vault Functionality and requirements. Initial proposal. Maciej Janik.
Improving User Access to Metadata for Public and Restricted Use US Federal Statistical Files William C. Block Jeremy Williams Lars Vilhuber Carl Lagoze.
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
High performance, full-featured text search engine written in Java. Technology suitable for nearly any application requiring full-text search, especially.
Using VIVO for Scientific Applications Matt Mayernik (National Center for Atmospheric Research) Anne Wilson (Laboratory for Atmospheric and Space Physics)
Enhancements to Galaxy for delivering on NIH Commons
ArcGIS Workflow Manager: Advanced Workflows and Concepts
Ilya Zaslavsky Jeffrey Grethe amarnath Gupta burak Ozyurt
Joslynn Lee – Data Science Educator
Overview of DLESE Metadata & Catalog System
B. Piringer R. Barbera, A. Calanducci, C. Carrubba, D. Davidovic, G
Database System Concepts and Architecture
COMSATS INSTITUTE OF INFORMATION TECHNOLOGY, VEHARI
Repository Cross-Linking
Matt Link Associate Vice President (Acting) Director, Systems
Connection of the scholarly work flow with the open science framework
VIVO: Faculty Research Information System and Discovery
Progress Collaborations FUTURE
Open Science Framework
Linked data, geographical search, and faceting
CUAHSI HIS Sharing hydrologic data
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Digital library for Earth System Education Teaching Boxes
INFS 3500 Martin, Brad, and John
Doron Goldfarb & Yann LE FRANC
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
VI-SEEM Data Repository
Shankar Chandrasekaran
Tools for Memory: Database Management Systems
Extending VIVO infrastructure to support linking information between EarthCollab VIVO instances Huda Khan, Matthew Mayernik, Keith Maull, M. Benjamin Gross,
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
The Re3gistry software and the INSPIRE Registry
Microsoft Services Provider License Agreement Program reference card
MIX 09 11/23/2018 6:07 PM © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
An ecosystem of contributions
Geospatial and Problem Specific Semantics Danielle Forsyth, CEO and Co-Founder Thetus Corporation 20 June, 2006.
Metadata Construction in Collaborative Research Networks
Semantic Annotation service
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
Sustaining Networks of Researchers:
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Web archives as a research subject
Metadata supported full-text search in a web archive
Presentation transcript:

Connect UNAVCO, a VIVO for a Scientific Community M. Benjamin Gross1, Linda R. Rowan1, Matthew Mayernik2, Michael D. Daniels2, Huda Khan3 and Dean B. Krafft3 UNAVCO, Boulder, CO National Center for Atmospheric Research, Boulder, CO Cornell University, Ithaca, NY

About UNAVCO UNAVCO is a non-profit university-governed consortium which facilitates geoscience research and education using geodesy. Geodesy: the study of Earth’s shape, gravity field, and rotation Could also show video if there’s audio https://www.youtube.com/watch?v=yxLMk120vMU

About EarthCollab EarthCollab goals EarthCollab A National Science Foundation EarthCube building block Partnership between UNAVCO, NCAR, and Cornell University EarthCollab goals Support scientific collaboration, and increase the discoverability and usability of scientific resources, via semantic and linked data technologies. Could also show video if there’s audio https://www.youtube.com/watch?v=yxLMk120vMU

Connect UNAVCO – connect.unavco.org Ontology Controlled research vocab Geospatial info Faceted search PIDs Data facility Distributed community, for specific domain and as such can be, should be customized for our domain. Ontology, Controlled research vocab, Geospatial info, Faceted search, PIDs, Data facility Data facility: datasets, stations, external collaborators (consortium)

User Engagement EarthCollab Survey conducted in 2014 and 2015 Survey about how researcher find and share research.

Requirements and Challenges Site must be easily searchable – survey takers indicated they use search for most tasks Connect data with people, publications, and tools in a discoverable way and point user toward data source Use unique IDs whenever possible, e.g. DOIs Minimize duplication of data by crosslinking VIVO instances Extend VIVO ontology to capture UNAVCO concepts Could also show video if there’s audio https://www.youtube.com/watch?v=yxLMk120vMU

Requirements and Challenges Ontology requirements Describe Earth observations – ships, networks, platforms, temporal and spatial aspects Capture relationships between the UNAVCO facility and member universities and their representatives Could also show video if there’s audio https://www.youtube.com/watch?v=yxLMk120vMU

Local ontology extensions

Local ontology extensions

vocabulary comparison

Ingest Process Challenges: Distributed and variable data stores, no institutional subscription to publication indexing service, publications authored by external collaborators, not employees

Connect UNAVCO stats http://connect.unavco.org Events: mostly scientific conferences Locations: includes GPS/GNSS stations ~ 555,000 asserted triples, running v1.9

Connect UNAVCO Research Terms Expertise Community members and employees can select from a list of 120 research and expertise terms Software Engineering Expertise Research area

Connect UNAVCO Research Terms Limited vocabulary > longer lists of people

Geospatial info

Facets in Connect UNAVCO Find member reps Find people with expertise

Facets in Connect UNAVCO Filter publications by publication year, sort by Altmetric score

Facets in Connect UNAVCO Elasticsearch 1 https://www.elastic.co/ Facetview2 https://github.com/CottageLabs/facetview2 Ingest scripts and themes https://github.com/gneissone/connect-unavco-elasticsearch https://github.com/tetherless-world/dco-elasticsearch https://github.com/cu-boulder/facetview2 Workflow: Query VIVO → Map to JSON → load to Elasticsearch A

Facets in Connect UNAVCO Query VIVO → Map to JSON → load to Elasticsearch Use VIVO SPARQL API to pull out necessary info Station name, location, PIs, retirement date, related datasets, image thumbnail CONSTRUCT queries: Better performance than DESCRIBE, more complicated to write A

Facets in Connect UNAVCO Query VIVO → Map to JSON → load to Elasticsearch Create JSON file Optionally, create schema file for Elasticsearch that defines each field Define data type and type of tokenizing that should be done on it by Elasticsearch’s analyzers A data.json

Facets in Connect UNAVCO Data is loaded to Elasticsearch via the load API $ curl –XPOST ‘http://localhost:9200/unavco/_bulk’ – data-binary @data.json Query VIVO → Map to JSON → load to Elasticsearch A For more on Elasticsearch and facetview2 in VIVO…

Altmetric scoreS VIVO displays Altmetric badges on demand… But we need Altmetric score in database for sorting... Get score by doi using API

Altmetric scoreS Fetch scores for 5,400 publications daily Can buy commercial license or get free license for academic research projects A

Future Work Integrate crosslinking work Elasticsearch/facetview geospatial capabilities Refine and enhance faceted browsing Survey community for dataset and publication connections A

Other EarthCollab presentations at VIVO 2016 Thursday, 5pm, Colorado Ballroom A-D: EOL Artic Data Connects – Don Stott, John Allison, and C. Brooks Snyder Friday, 11am, Colorado Ballroom G: Using VIVO for Scientific Applications - Matthew Mayernik, Anne Wilson and John Furfey Friday, 3:30pm, Colorado Ballroom E-F: Extending VIVO Infrastructure to Support Linking Information between EarthCollab VIVO Instances - Huda Khan et al.

Thank you! connect.unavco.org git.io/vG9AJ earthcube.org/group/earthcollab Contact: Benjamin Gross mbgross@unavco.org orcid.org/0000-0002-7908-1987