NSF DataNet Initiative

Slides:



Advertisements
Similar presentations
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Advertisements

Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Context Interchange for Dynamic Services - A daptability, extensibility, scalability analysis Hongwei (Harry) Zhu Stuart Madnick MIT Sloan School of Management.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
NSF Site Visit Introduction to Google and Masdar Instutute 8 Feb 2010 DataSpace 1.
Project Management NSF DataNet site visit to MIT February 8, 2010 DataSpace February NSF Site Visit to MIT DataSpace.
A Robust Health Data Infrastructure P. Jon White, MD Director, Health IT Agency for Healthcare Research and Quality
CASE Tools And Their Effect On Software Quality Peter Geddis – pxg07u.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Ontology-derived Activity Components for Composing Travel Web Services Matthias Flügge Diana Tourtchaninova
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
ITEC 3220M Using and Designing Database Systems
Agent Model for Interaction with Semantic Web Services Ivo Mihailovic.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
FP WIKT '081 Marek Skokan, Ján Hreňo Semantic integration of governmental services in the Access-eGov project Faculty of Economics.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
1 Feburary 8, 2010 DataSpace 1. HP Labs Research Interests HP Labs have organized its corporate research around 8 major themes that include Information.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Co-funded by the European Community eContentplus programme The “Protected Areas” scenario of the HUMBOLDT project Roderic Molina GISIG NATURE-SDIplus Good.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
STASIS Technical Innovations - Simplifying e-Business Collaboration by providing a Semantic Mapping Platform - Dr. Sven Abels - TIE -
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
SIG: Synthetic Seismogram Exchange Standards (formats & metadata) Is it time to establish exchange standards for synthetic seismograms? IRIS Annual Workshop.
1 CS851 Data Services in Advanced System Applications Sang H. Son
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Independent Insight for Service Oriented Practice Summary: Service Reference Architecture and Planning David Sprott.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
16/11/ Semantic Web Services Language Requirements Presenter: Emilia Cimpian
LTER Science 2050: Challenges, Constraints and Opportunities Bill Michener Professor and DataONE Project Director University of New Mexico 12 September.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
OWL-S: As a Semantic Mark-up Language for Grid Services By Narendranadh.J.
End-to-End Data Services A Few Personal Thoughts Unidata Staff Meeting 2 September 2009.
Example projects using metadata and thesauri: the Biodiversity World Project Richard White Cardiff University, UK
CIMA and Semantic Interoperability for Networked Instruments and Sensors Donald F. (Rick) McMullen Pervasive Technology Labs at Indiana University
5. 2Object-Oriented Analysis and Design with the Unified Process Objectives  Describe the activities of the requirements discipline  Describe the difference.
Model Checking Early Requirements Specifications in Tropos Presented by Chin-Yi Tsai.
4 Chapter 4: Beginning the Analysis: Investigating System Requirements Systems Analysis and Design in a Changing World, 3 rd Edition.
Witold Staniszkis Empowering the Knowledge Worker End-User Software Engineering in Knowledge Management Witold Staniszkis
The Emergent Structure of Development Tasks
‘Ontology Management’ Peter Fox (Semantic Web Cluster lead)
Why Metrics in Software Testing?
THE SP SYSTEM AS AN AID TO CRIME PREVENTION AND DETECTION (CPD)
The Semantic Web By: Maulik Parikh.
An Overview of Data-PASS Shared Catalog
OPM/S: Semantic Engineering of Web Services
System Design.
improve the efficiency, collaborative potential, and
Unified Process Source & Courtesy: Jing Zou.
Web Ontology Language for Service (OWL-S)
CS 501: Software Engineering Fall 1999
Design, prototyping and construction
EOSC services architecture
The Case for Data Management: Agency Requirements
Database Systems Instructor Name: Lecture-3.
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
Chapter 11 user support.
Automated Analysis and Code Generation for Domain-Specific Models
Metadata Development in the Earth System Curator
School of Information Studies, Syracuse University, Syracuse, NY, USA
CSE591: Data Mining by H. Liu
Wrap-Up – NSF Site Visit 8 February 2010
SECTION 4: OO METHODOLOGIES
Mark Quirk Head of Technology Developer & Platform Group
Design, prototyping and construction
Toward an Ontology-Driven Architectural Framework for B2B E. Kajan, L
Presentation transcript:

NSF DataNet Initiative Research Agenda NSF DataNet Initiative Site Visit 8 February 2010 DataSpace Some general comments plus one specific example … (v8)

DataSpace Research: Risk Management Spectrum Lowest risk: Incremental improvements to starting operational Dspace/Fedora (DuraSpace) platform Guaranteed useful tool for curation of scientific data Minor risk: Inclusion of existing and operational components including those suggested by partners, such as XAM (EMC), OpenII (Google), etc. Risk: Selection, Integration Modest risk: Adapting existing and evolving research technologies & prototypes (e.g., Context Interchange) Risks: Make robust, Scaling, and Integration

Some Research Areas Listed in Proposal (p.10-14) Data policy, protection and security (Abelson, Berners-Lee, Pato, Weitzner, White) Data discovery and data semantics (Madnick, Siegel, Smith) Data quality & provenance (Madnick, Abelson) Data analysis and analytics Model calibration and mediation (Woon) Operational Scientific Intelligence (Hsu) High-speed pre-processing and data consistency (Hsu) Data visualization (Karger) Workflow for scientific research and archives (Smith) Data storage (Todd, Milojicic) Legal issues with data (Wilbanks) Data interoperability, conversion, integration (Madnick, Smith)

Types of Semantic Differences Representational Ontological Temporal Simple Example (snow fall) Temporal Representational Meter vs Feet Feet before 2001, Meters afterward Ontological “Snow fall” – using standard "snowboard” method or other method (e.g., liquid) “Snowboard” before 1990, liquid method afterward Given recent situation – measuring “snow fall” seems like a very relevant example. Q: How many familiar with “snowboard” method? Briefly describe: "snowboard“ method. Essentially this is a piece of wood about 16" by 16" that is painted white. The snowboard should be wiped clean every six hours or so to prevent the natural settling of fallen snow from occurring. In addition, these should be placed well away from structures and obstructions -- about 20 to 30 feet if possible -- in order to prevent drifting from inflating the totals as well. NOAA – has some differences. Another approach uses liquid precipitation (melted snow): in general an inch of water is equal to about 10 inches of snow. Snow Depth is a Different Measurement CoCoRaHS, a network of precipitation observers, describes it this way: "For example, if half the ground has 2" of old snow and the other half of the ground is already bare, the average snowdepth would be 1"."

COntext INterchange (COIN) Approach to Resolving Semantic Differences Concept: Depth Modifiers: Meters Feet f() Meters Feet Specialized symbolic equation solving techniques used to dynamically create comprehensive conversion programs from small conversion components Light-weight Ontologies with Context Modifiers Shared Ontologies Conversion Creation Mediation & Transformation uses an integrated framework of abductive and constraint logic programming Context Mediator Declarative description of Source’s actual semantics Source Context Declarative description of Receiver’s desired/expected semantics Receiver Context 2 1 Select depth x 3.35 From dataset A Where id =“12AY” Select depth From dataset A Where id =“12AY” Note animation … Steps: SQL Query Gets raw data and checks on source context (Meter) Sees if matches receiver context (Feet) Sees if knows how to do conversion (Meters -> Feet) If so, converts to feet (e.g., about 55 feet) and returns data to receiver. depth Context Transformation 17 55.25 3 Source (Data set A) Receiver 10

Intuition – Capability for automatic determination of complex conversion programs Dataset A context Depth notion std 1 meters Scale factor Depth units Component conversions are provided along modifier axes Composite conversions between any cubes in the space can be composed automatically std std 1000 1 meters feet

Research Agenda DataSpace Will draw on extensive research experience and on-going efforts of the entire DataSpace team, including research by our corporate partners Much more details on these research efforts to date can be found in the more than 200 papers, written by the research team, and listed in the References section of the DataSpace Full Proposal Collectively the proposed research efforts represent an ambitious research agenda. We will present two more examples …