Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2

Slides:



Advertisements
Similar presentations
Complexity must become Linear or Decrease Smart data infrastructure: The sixth generation of mediation for data science Peter Fox 1
Advertisements

A Framework for Earth Science Search Interface Development Designing and Implementing S2S Eric Rozell, Tetherless World Constellation, RPI.
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
Ontology and Application for Reusable Search Interface Design Plans for Advanced Semantic Technologies Final Project Eric Rozell, Tetherless World Constellation.
Evolving the BCO-DMO search interface - experience with semantic and smart search Cyndy Chandler (WHOI) Peter Fox (RPI and WHOI) Robert Groman, Dicky Allison.
Presenting Provenance Based on User Roles Experiences with a Solar Physics Data Ingest System Patrick West, James Michaelis, Peter Fox, Stephan Zednik,
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Information Fusion: Moving from domain independent to domain literate approaches Professor Deborah L. McGuinness Tetherless World Constellation, Rensselaer.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Applying Semantics in Dataset Summarization for Solar Data Ingest Pipelines James Michaelis ( ), Deborah L. McGuinness
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Peter Fox CSCI Week 9, October 27, 2008.
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
1 Class Exercise I: Use Cases Deborah McGuinness and Peter Fox (NCAR) CSCI Week 4 (part II), 2008.
Configurable User Interface Framework for Cross-Disciplinary and Citizen Science Presented by: Peter Fox Authors: Eric Rozell, Han Wang, Patrick West,
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Biological and Chemical Oceanography Data Management Office slide 1 of 21 Interoperability ~ An Introduction Cyndy Chandler Biological and Chemical Oceanography.
References: [1] [2] [3] Acknowledgments:
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI Week 7, October 18, 2010.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI Week 10, November.
What has been lacking, until recently, is a successful method to develop, implement and sustain informatics solutions to modern application problems, such.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal Principle Investigator: Eric Rozell Tetherless World Constellation.
Domain Modeling In FREMA David Millard Yvonne Howard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
Discovering accessibility, display, and manipulation of data in a data portal Nancy Hoebelheinrich Patrick West 2
Semantically-Enabled Science Data Integration (SESDI) and The Virtual Solar-Terrestrial Observatory (VSTO) Semantically-enabled (large-scale) Scientific.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
Local global disambiguation of terms and concepts The BCO-DMO metadata database uses controlled vocabularies to record many of the important pieces of.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
Prof. Peter #twcrpi) Tetherless World Constellation Chair, Earth and Environmental Science/ Computer Science/ Cognitive.
 The many scales encountered in assessing and managing large marine ecosystems (LMEs) presents a level of diversity and heterogeneity, or complexity,
Beginning with an NSF INTEROP project whose goal is to facilitate the deployment of an Integrated Ecosystem Approach (IEA) to management in the Northeast.
ToolMatch Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Products Patrick West 1 Nancy Hoebelheinrich.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Semantically-Enabled Virtual Observatories: VSTO Highlights for Observational Data Deborah McGuinness Acting Director and Senior Research Scientist Knowledge.
The VIRTUAL SOLAR-TERRESTRIAL OBSERVATORY - Exploring paradigms for interdisciplinary data-driven science Peter Fox 1 Don Middleton 2,
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
DCO-DS: Moving Forward DCO Synthesis Meeting. Oct , 2015 DCO-DS = DCO Data Science.
Advanced Semantic Technologies Project S2S Framework Evaluation Eric Rozell, Tetherless World Constellation.
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
NMFS Use Case 1 review/ evaluation and next steps April 19, 2012 Woods Hole, MA Peter Fox (RPI* and WHOI**) and Andrew Maffei (WHOI) *Tetherless World.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Information Model Driven Semantic Framework Architecture and Design for Distributed Data Repositories AGU 2011, IN51D-04 December 9, 2011 Peter Fox (RPI)
A Framework for Earth Science Search Interface Development Design and Implementation of S2S Presented by: Stephan Zednik, Tetherless World Constellation.
The Semantic eScience Framework AGU FM10 IN22A-02 Deborah McGuinness and Peter Fox (RPI) Tetherless World Constellation.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Ontology and Application for Reusable Search Interface Design Plans for Advanced Semantic Technologies Final Project Eric Rozell, Tetherless World Constellation.
Poster: EGU Glossary: USGCRP – United States Global Change Research Program NCA – National Climate Assessment GCIS – Global Change Information.
Get the poster at Semantic Visualization Provenance Records:
improve the efficiency, collaborative potential, and
Deep Carbon Observatory Data Science Platform
CMSP / OCM Vocabulary Services rpi
NMFS Use Case 1 review/ evaluation and next steps
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Presentation transcript:

Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2 Stephan Zednik 1 ( 1 Rensselaer Polytechnic Institute th St., Troy, NY, United States), ( 2 McGuinness Associates 4 Shaker Bay Rd., Latham, NY, United Abstract As part of our semantic data framework activities across multiple, diverse disciplines we required the involvement of domain scientists, computer scientists, software engineers, data managers, and often, social scientists. This involvement from a cross-section of disciplines turns out to be a social exercise as much as it is a technical and methodical activity. Each member of the team is used to different modes of working, expectations, vocabularies, levels of participation, and incentive and reward systems. We will examine how both roles and personal responsibilities play in the development of semantic infusion projects, and how an iterative development cycle can contribute to the successful completion of such a project. Catalogs/Ontology Integration Knowledge InfusionExperience Glossary: VSTO - Virtual Solar-Terrestrial Observatory MLSO – Mauna Loa Solar Observatory CEDAR - Coupled Energetics and Dynamics of Atmospheric Research WHOI – Woods Hole Oceanographic Institution BCO-DMO – Biological and Chemical Oceanographic Data Management Office Partial VSTO Ontology Interdisciplinary support: Virtual Solar-Terrestrial Observatory (VSTO) science domains are solar physics, space physics, and solar-terrestrial physics. Major communities include those interested in solar images from the Mauna Loa Solar Observatory (MLSO), and the NSF-funded Coupled Energetics and Dynamics of Atmospheric Regions (CEDAR). The BCO-DMO project (Biological and Chemical Oceanographic Data Management Office) at WHOI (Woods Hole Oceanographic Institution) domain encompasses the ocean sciences. These collections provide a good focus for virtual observatory work since the datasets are of significant scientific value to a set of researchers and capture the challenges inherent in complex, diverse scientific data. Use Cases: Use cases are used to indicate a specific capability that drives both what knowledge is to be represented and used and also what software and interfaces are built for the user and to the underlying data. It contains functional and non-functional requirements, success and failure scenarios, the vocabulary used within the research domain of the user developing the use case (typically a research/domain expert) and domain specific research practices. Ontologies: Ontologies created in OWL define concepts, relations, terms, etc…, in order to utilize their precise formal definitions for semantic search and interoperability. Use case sentences are examined to identify initial concepts, and relations between them. Hierarchies became apparent, and important properties, as well as restrictions on the values for certain concepts within a given context. Initial ontologies are as close to the vocabulary of the user as possible. With additional iterations, new concepts can be introduced by the ontologist, other ontologies utilized. You can see here how many of the tables and relationships in the CEDAR catalog relate directly to classes and properties in the VSTO ontology. The work in CEDAR has helped to define the ontology for VSTO, and the work on the ontology for VSTO has helped to define a better Catalog for CEDAR. CEDAR Catalog Relationship Diagram Science/Expert Review & Iteration Use Case Small Team, mixed skills Analysis Adopt Technology Approach Leverage Technology Infrastructure Rapid Prototype Open World: Evolve, Iterate, Redesign, Redeploy Develop model/ ontology Evaluation Use Tools Iterative Design and Development First iteration of the WHOI Person Concept Second iteration of the WHOI Person Concept Includes all of the foaf concepts for name, contact information, interests We keep three concepts Third iteration of the WHOI Person Concept Example where the iterative process helped to develop an understanding by WHOI domain experts ontologies and translating their concepts into an ontology and the ontology developers to understand the specific domain vocabulary. Successive iterations helped to expand and simplify concepts and incorporate already existing ontologies. Similar in instrument, platform, parameter ontology development. Iterative Ontology Development and Understanding Sponsors: National Science Foundation Acknowledgments: Cyndy Chandler, Dickey Allison, Robert Groman and Christine Hammond of WHOI for their considerable contributions. Collaboration: Extensive engagement of the end user; the domain scientists, students, and professionals; enables explicit semantic interoperability. In order to provide a scientific infrastructure that is usable and extensible, VSTO and BCO-DMO required contributions concerning semantic integration, and knowledge representation while requiring depth in each of the science areas. In developing and analyzing use cases within the iterative approach, a small team made up of a facilitator (knows iterative methodology), domain experts (knows resources, data, applications, tools), ontology modelers (to extract objects/relations), software engineers (architecture and technology), and a scribe (capturing everything discussed), should be utilized. Social: An implication of adding semantics is the strong need and role for domain literate members of the team to develop and then vet the knowledge encoding. This turns out to be a social exercise as much as it is a technical and methodical activity since the team comprises people from multiple disciplines, and can even include social scientists. Social considerations also affect how projects are sustained. Personal: Personalities are important to consider. The facilitator must be aware of and accommodate many dimensions of participation if the infusion activity is to be successful. Team members are used to different modes of working, vocabularies, and incentive and reward systems. One result of this is that it is very difficult to conduct the infusion simply as a remote or virtual activity unless it is substantially founded on face-to-face meetings where personal and social forces come into play.