DCO-DS: Moving Forward DCO Synthesis Meeting. Oct. 29-30, 2015 DCO-DS = DCO Data Science.

Slides:



Advertisements
Similar presentations
Portfolio Management, according to Office of Management and Budget (OMB) Circular A-16 Supplemental Guidance, is the coordination of Federal geospatial.
Advertisements

Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Complexity must become Linear or Decrease Smart data infrastructure: The sixth generation of mediation for data science Peter Fox 1
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Building the LTER Network Information System. NIS History, Then and Now YearMilestone 1993 – 1996NIS vision formed by Information Managers (IMs) and LTER.
Fostering Continuous Improvement of Curriculum - Learning Outcomes Peter Wolf Director, Centre for Open Learning Educational Support University of Guelph.
Saturday 1 SN4CI. November 2005SNAC2 Words (used across 3 or more groups) Defined: community, scope Identifying: developers, early adopters, mechanism.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
1 Data Strategy Overview Keith Wilson Session 15.
GMD German National Research Center for Information Technology Innovation through Research Jörg M. Haake Applying Collaborative Open Hypermedia.
V. Chandrasekar (CSU), Mike Daniels (NCAR), Sara Graves (UAH), Branko Kerkez (Michigan), Frank Vernon (USCD) Integrating Real-time Data into the EarthCube.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
The Natural Resources Digital Library Needs, Partners, and Challenges Bonnie Avery, Janine Salwasser, & Janet Webster Oregon State University.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Configurable User Interface Framework for Cross-Disciplinary and Citizen Science Presented by: Peter Fox Authors: Eric Rozell, Han Wang, Patrick West,
Progress in Open-World, Integrative, Web-based Collaborative Research Platforms Peter Fox and the DCO-DS* Team Tetherless World Constellation.
Training of Process Facilitators Training of Process Facilitators.
The Digital Library for Earth System Education: A Community Resource
Clinical Trials Program PhUSE Semantic Technology WG.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
Global Change Information System: Information Model and Semantic Application Prototypes (GCIS-IMSAP) Status 01/08/2013 Stephan Zednik 1, Curt Tilmes 2,
An Example in The DCO Data Portal Formal Specification of Data Types in the Deep Carbon Observatory Data Portal Xiaogang (Marshall) Ma
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
24 March 2010Atlanta, Georgia Passing it on: Notes on digital initiative sustainability Marty Kurth HBCU Library Alliance – Cornell University Library.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
DCO's Data Science Day Introduction June 5, 2014, Troy NY Peter Fox (Rensselaer Polytechnic Institute)
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
The ICDP Information Network Telework and Information Management in Scientific Drilling Projects Jens Klump and Ronald Conze GeoForschungsZentrum Potsdam.
Deep Carbon Observatory Data Science and Data Management Infrastructure Overview and Demonstration Patrick West – Tetherless World Constellation Rensselaer.
Interfacing Registry Systems December 2000.
Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal Principle Investigator: Eric Rozell Tetherless World Constellation.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Putting Research to Work in K-8 Science Classrooms Ready, Set, SCIENCE.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
GEO Work Plan Symposium 2012 ID-03: Science and Technology in GEOSS ID-03-C1: Engaging the Science and Technology (S&T) Community in GEOSS Implementation.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
Prof. Peter #twcrpi) Tetherless World Constellation Chair, Earth and Environmental Science/ Computer Science/ Cognitive.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Building the LTER Network Information System. NIS History, Then and Now YearMilestone 1993 – 1996NIS vision formed by Information Managers (IMs) and LTER.
Last Updated 1/17/02 1 Business Drivers Guiding Portal Evolution Portals Integrate web-based systems to increase productivity and reduce.
Brief: Data Science Progress/ Activities and Renewal Plans DCO Executive Committee. Oct. 8-9, Rome (IT) DCO-DS = DCO Data Science.
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
VIVO Conference 2013 Panel on VIVO Use-Cases for Collaborative Science: From Researcher Networks to Semantic User Interfaces for Data Patrick West – Tetherless.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
OOI-CYBERINFRASTRUCTURE OOI Cyberinfrastructure Education and Public Awareness Plan Cyberinfrastructure Design Workshop October 17-19, 2007 University.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Deep Carbon Observatory Data Science and Data Management Infrastructure Overview and Demonstration Patrick West – Tetherless World Constellation Rensselaer.
E ARTHCUBE C ONCEPTUAL D ESIGN A Scalable Community Driven Architecture Overview PI:
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
ISWG / SIF / GEOSS OOSSIW - November, 2008 GEOSS “Interoperability” Steven F. Browdy (ISWG, SIF, SCC)
Building Systems for Today’s Dynamic Networked Environments A Methodology for Building Sustainable Enterprises in Dynamic Environments through knowledge.
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
DataNet Collaboration
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Semantic Database Builder
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Data types and persistent identifiers in
Adoption of RDA DTR and PIT in the Deep Carbon Observatory Data Portal
CEOS WGISS Carbon Data Portal: Progress and Demo CEOS WGISS Carbon Portal Team Reported at WGISS’48 Vietnam Academy of Science and Technology, Hanoi,
Presentation transcript:

DCO-DS: Moving Forward DCO Synthesis Meeting. Oct , 2015 DCO-DS = DCO Data Science

Vision… “Our vision is to develop, facilitate, and maintain sustained multi-way engagement of carbon scientists in multi-scale local to global networks” [for the transformation of our understanding of carbon in Earth]. Organization is required so participants can carry out their mission(s) Those participants (by defn.) may never be in a single organization -> virtual organization

Virtual Organizations as Socio-Technical Systems ‘ …a geographically distributed organization whose members are bound by a long-term common interest or goal, and who communicate and coordinate their work through information technology’ (Ahuja) ‘These members assume well defined roles and status relationships within the context of the virtual group that may be independent of their role and status in the organization employing them’ (Ahuja et al., 1998) Technology Communication Patterns Organizational Structure

Virtual Organization Feature: Outcomes/ values Dynamic versus static Evolvable/ ecosystem-like Heterogenetic tolerance Attributes of the organization Roles/ responsibilities Scale or scalability

Strategy…

Mapping… goal -> use case participation -> team(s), vetting, acceptance outcomes/ value -> goals, metrics, evaluation, incentives, data/information/ knowledge projects, responses, decisions dynamic -> agile working format, small iterations evolution -> rapid development, evaluation and iteration (open)

Methodology…

DCO-DS Evaluation Form as key input to DCO-DS ●Focused on the evaluation of Deep Carbon virtual Observatory ●Evaluation questions will help determine DCvO's role in ○Increasing members, activity and awareness of DCO activities ○Enabling search, access, exchange and use of data & information for DCO scientific and educational needs ○Needs to further integrate with DCO Members' essential technologies ●Phased roll-out to begin early Oct ○Wave 1: Executive Committee, Secretariat, Community leads, selected others ○Wave 2: DCO SSCs, Engagement ○Waves 3, 4, 5, 6: DCO Communities

Value Philosophy Value focuses on organizational outputs (or outcomes) rather than inputs For example: Deployed knowledge and skills vs research budgets Value relates to benefit of outcomes, rather than outcomes themselves Products and services enabled by knowledge and skills Value implies relative, useful, and usable outcomes Beneficiaries have to understand and appreciate Credit: B. Rouse (BEVO) 2008

Leveraging existing data resources Interface between DCO Data Portal and other data repositories – key part of post-2019 efforts (e.g. Spring 2015 effort with CoDL/ MBL) Incorporate specific metadata requirements into the DCO Knowledge Store Extend DCO Ontology for incorporation of other repository data, and/or utilize existing schema Provide data in a variety of formats for use (non-specialists) Populate the metadata and data repository for DCO projects that do not already have their own portal Work on and develop new boundary activities

DCO-DS Boundary Activities

Moving Forward A technology refresh for major platform components for the DCO network, and a “network” succession plan Prioritized efforts based on evaluations (Nov-Dec) Inputs from DCO synthesis discussions and post-2019 committees/ task groups Significant efforts on data registration and data legacies And continue to work on existing and develop new boundary activities

Questions? Comments? Patrick West, Peter Fox, The Team: Lead: Peter Fox, Staff: Patrick West, Stephan Zednik and John Erickson, Post Doc: Marshall Ma, Graduate Students: Han Wang, Hao Zhong, Ahmed Eleish

DCO Knowledge Graph Analytics 1.Identified key areas of DCO for analysis and visualization, initially: ○Publications and publication keywords ○User registrations ○DCO Member areas of expertise 2.Instance Creation statistics: who is creating what and associated with what communities. 3.What would you like to see?

DCO Knowledge Graph Analytics Publication Subject Area Word Cloud

Current Work: Thermodynamic Data Rescue ●A large number of geoscience publications contain publication datasets that are not expressed external to the publication text ●Extracting, organizing, and reusing these datasets is valuable ●Data Science Team and Extreme Physics and Chemistry community member Mark Ghiorso identified thermodynamic datasets about the enthalpy and entropy of chemicals

Current Work: Geo Sample curation and IGSN ●Have GeoSample as a class in DCO ontology and collect the core metadata items for sample registration in the DCO data portal; ●Interface between the DCO IGSN Allocation Agent and the IGSN registry agent, with two potential functionalities: ○Assign IGSN to a sample record through the DCO data portal in collaboration with UT funded activity ○Use IGSN to import sample records from existing repositories to the DCO data portal, if there is a mature IGSN metadata API

Future Work: Instrument Reporting and Browsing* ●Progress to-date: ○Reporting on DCO-funded Instrument use by Projects and Field Studies ○Referencing DCO Instrument use within Grant Summary Reports ■within Instrument grants and related project/field study grants ●Future work: The Instrument Browser ○Dynamically generated instrument list and instrument summary page ○A faceted search interface for instruments ○Instrument discovery based on nature of use, data collected, projects and point of contact * Outcome from the DCO Data Science day at RPI in 2014!!!

Future Work: Deep Carbon Science Trend Analysis ●Natural Language Processing (NLP) based analysis of Deep Carbon publication corpus ○Extracts entities and relations from the corpus ○Constructs a Deep Carbon Knowledge Base consisting of unified entities and relations ○Provides structured knowledge for downstreaming applications and analysis ●Includes retrieval of authoritative metadata into DCO Knowledge Graph ●Includes Deep Carbon Science Visualization Dashboard