Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

Slides:



Advertisements
Similar presentations
Complexity must become Linear or Decrease Smart data infrastructure: The sixth generation of mediation for data science Peter Fox 1
Advertisements

DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Applying Semantics in Dataset Summarization for Solar Data Ingest Pipelines James Michaelis ( ), Deborah L. McGuinness
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
TWC Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semantic Technologies Xiaogang (Marshall) Ma Tetherless World Constellation.
Scientific Knowledge Discovery in Complex Semantic Networks of Geophysical Systems (no pressure…) EGU2012, NP2.6 April 25, 2012, Vienna, Austria Peter.
ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Configurable User Interface Framework for Cross-Disciplinary and Citizen Science Presented by: Peter Fox Authors: Eric Rozell, Han Wang, Patrick West,
Progress in Open-World, Integrative, Web-based Collaborative Research Platforms Peter Fox and the DCO-DS* Team Tetherless World Constellation.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Global Change Information System: Information Model and Semantic Application Prototypes (GCIS-IMSAP) Status 01/08/2013 Stephan Zednik 1, Curt Tilmes 2,
An Example in The DCO Data Portal Formal Specification of Data Types in the Deep Carbon Observatory Data Portal Xiaogang (Marshall) Ma
References: [1] [2] [3] Acknowledgments:
DCO's Data Science Day Introduction June 5, 2014, Troy NY Peter Fox (Rensselaer Polytechnic Institute)
Deep Carbon Observatory Data Science and Data Management Infrastructure Overview and Demonstration Patrick West – Tetherless World Constellation Rensselaer.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal Principle Investigator: Eric Rozell Tetherless World Constellation.
Discovering accessibility, display, and manipulation of data in a data portal Nancy Hoebelheinrich Patrick West 2
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
Local global disambiguation of terms and concepts The BCO-DMO metadata database uses controlled vocabularies to record many of the important pieces of.
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
Tetherless World Constellation Open Government Data Jim Hendler Tetherless World Professor of Computer and Cognitive Science Assistant Dean of Information.
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
Prof. Peter #twcrpi) Tetherless World Constellation Chair, Earth and Environmental Science/ Computer Science/ Cognitive.
1 Semantic Provenance and Integration Peter Fox and Deborah L. McGuinness Joint work with Stephan Zednick, Patrick West, Li Ding, Cynthia Chang, … Tetherless.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Resource Discovery for Extreme Scale Collaboration Benno Lee Patrick West 1 William Smith 2
Brief: Data Science Progress/ Activities and Renewal Plans DCO Executive Committee. Oct. 8-9, Rome (IT) DCO-DS = DCO Data Science.
Knowledge Networks and Science Data Ecosystems December 7, 2012, AGU12 IN54A-02. Peter Fox (RPI/ Tetherless World Constellation and WHOI/AOP&E)
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
VIVO Conference 2013 Panel on VIVO Use-Cases for Collaborative Science: From Researcher Networks to Semantic User Interfaces for Data Patrick West – Tetherless.
Information Modeling and Semantic Web Application For National Climate Assessment Jin Guang Zheng 1 Curt Tilmes 2
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Facilitating Next Generation Science Collaboration: Marine Ecosystems Status Reports and Assessments June 24, 2014 IMBER – D2 Peter Fox (RPI/ Tetherless.
Data Type Registries (DTR) RDA 4th WG/IG Collab Meeting NIST: Dec 2015 Larry Lannom CNRI.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
TWC Illuminate Knowledge Elements in Geoscience Literature Xiaogang (Marshall) Ma, Jin Guang Zheng, Han Wang, Peter Fox Tetherless World Constellation.
DCO-DS: Moving Forward DCO Synthesis Meeting. Oct , 2015 DCO-DS = DCO Data Science.
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
Supported by ESIP Semantic Web Cluster A service based on community-built semantic web applications Provide users with the means to match their datasets.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Deep Carbon Observatory Data Science and Data Management Infrastructure Overview and Demonstration Patrick West – Tetherless World Constellation Rensselaer.
How Environmental Informatics is Preparing Us for the Era of Big Data AGU FM 2013 GC11F-01 December 09, 2013, MW 3001 Peter
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
A Framework for Earth Science Search Interface Development Design and Implementation of S2S Presented by: Stephan Zednik, Tetherless World Constellation.
The Semantic eScience Framework AGU FM10 IN22A-02 Deborah McGuinness and Peter Fox (RPI) Tetherless World Constellation.
Poster: EGU Glossary: USGCRP – United States Global Change Research Program NCA – National Climate Assessment GCIS – Global Change Information.
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Deep Carbon Observatory Data Science Platform
Data types and persistent identifiers in
Modeling Data Set Versioning Operations
Adoption of RDA DTR and PIT in the Deep Carbon Observatory Data Portal
Science Data Platforms: Informatics Architectures at the Forefront.
Bird of Feather Session
Presentation transcript:

deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer Polytechnic Institute From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

Outline Deep Carbon Observatory Deep Carbon Virtual Observatory (DCvO) –Architecture of DCvO –DCO Ontologies –Boundary activities –Discovering information by clicking through Summary 2

A 10-year ( ) initiative to intensify global attention and scientific effort in the burgeoning field of deep carbon science 3

Faculty, staff and students from the Tetherless World Constellation (TWC) at Rensselaer Polytechnic Institute (RPI) Responsible for –DCO Architecture and technology infrastructure –DCO Computer Cluster –The Deep Carbon Virtual Observatory DCvO Deep Carbon Observatory – Data Science 4 4

Deep Carbon Virtual Observatory 5 Scientists – actually ANYONE - should be able to access a global, distributed knowledge base of scientific data and information that: appears to be integrated appears to be locally available is in a language (written, programming, or science) that is understandable and can be shared Data intensive – volume, complexity, mode, scale, heterogeneity, … in an OPEN WORLD 5

Deep Carbon Virtual Observatory A vision of the DCvO: –A conceptual model of the interplay between data, people, publication, instruments, models, organizations, etc. –Identify, annotate and link all key entities, agents and activities –A repository for datasets and associated metadata –Unique and powerful data and metadata visualization for dissemination of information –Facilitates the discovery of potential collaborations –An integrated portal for diverse content and applications (Fox et al., 2014) 6

DCvO “Architecture” 7

vivo.cornell.edu VIVO - represents academic research communities DCO ontology: a model for concept types and relationships DCO ontologies extend each other and the VIVO ontology 8

Ontologies and schemas used in the DCO web portal 9 NamePrefix Dublin Core Metadata Element Setdc DCMI Metadata Termsdct VIVO Corevivo VIVO Scientific Research Ontologyscires Data Catalog Vocabularydcat Bibliographic Ontologybibo Citation Counting and Context Characterization Ontologyc4o Citation Typing Ontologycito FRBR-Aligned Bibliographic Ontologyfabio Event Ontologyevent Friend of a Friendfoaf vCard Ontologyvcard Geopolitical Ontologygeo Simple Knowledge Organization Systemskos DCO Ontologydco PROV Ontologyprov

Ontologies and schemas used in the DCO web portal 10 DCO Boundary Activities are driving the extensions within the DCO Ontologies

DCO Extension for Project Updates 11

12 Dynamically generated list of Grants that are part of the Deep Carbon Observatory. Users can click through to learn more, and members can create reports to be sent to funding orgs

13 Grant page lists all projects and reporting updates for each of the projects and field studies

DCO Extension for Data Types 14

15 A Few Boundary Activities Given a DOI pull publication information from CrossRef and/or Web of Science DCO IGSN Allocation Agent to work with the IGSN Registry Integration with existing data portals and repositories Data Rescue activities

Modern informatics enables a new scale-free framework approach Use cases Stakeholders Modeling Ontologies Evaluation 16

What does a DCO data publication look like? 17

18 Identification and annotation Information on the landing page of a dataset

19 Linking to enable forward and backward tracking Landing page of Helium Concept

20 Landing page of a person Linking to build Collaborations

21 Landing page of a research area Linking to build Collaborations

22 DCO Knowledge Graph Analytics

Thus… progress… Integrative – semantics Transparent – semantics Collaborative – semantics Application integration –Yep – semantics 23

Thank you! 24 Patrick West,

25 An integrated portal: deepcarbon.net

26 Faceted publication browser

Repository for archiving datasets Archived datasets of ‘Noble gas isotope abundances in terrestrial fluids’ 27

Collaboration tools Group Based Collaboration Group data deposit and reporting Listings of group content Group management and messaging 28

29 RDA DTR and PIT adoption The DTR primitives are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, e.g. Dataset, Image, Video, Audio, etc. A registered DCO dataset is asserted as an instance of one of those basic data type classes. It is possible to further annotate the dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID. A Few Boundary Activities

Results of data type specification Updates to the DCO Ontology: –A new class dco:DataType. Each specific data type is an instance of it –An object property dco:hasDataType linking a dataset and a data type –A collection of other classes and properties associated with dco:DataType 30

31 New datasets available via dataset browser Includes citations to the originating publication Data files accessible through dataset repository Thermodynamic Data Rescue

32 DCO Knowledge Store Analytics

33 DCO Knowledge Store Visualizations

All information is linked and traceable! 34

Mediation From: C. Borgman, 2008, NSF Cyberlearning Report, Illustration by Roy Pea and Jillian C. Wallis 6 th Generation All these generations of mediation are in effect as we collaborate 35