DataNet Collaboration Organization and administration Pairwise collaborations DataNet-wide collaboration opportunities
“New types of digital data preservation and access organizations that “New types of digital data preservation and access organizations that . . . work cooperatively and in coordination to create a functional data network with revolutionary new capabilities for information access, use, and integration without regard to conventional barriers such as data type and format, discipline or subject area, and time and place.”
“Operating as a data network – requires cooperation, coordination, and close interaction both among the members of this program portfolio and with other preservation and access organizations, national and international.”
PEP Revisions Additional WBS elements 3.3 DataNet Program Collaborations Adjustment to accommodate shift in effort Completion of sustainability options document (WBS 4.1.2.1) moved from quarter 6 to quarter 10 No adjustments to scope of prototype software system Quarter complete 3.3.1 DataNet interoperability vision document 6 3.3.2 Initial planning and collaboration with DataNet partners 3.3.3 Implement DataNet interoperability 20
Learning and Communication One-on-one calls with other projects DataNet Project Managers’ calls DataNet PI & PM mailing list Attendance at DataONE Users’ Group meeting DataNet PIs meeting
DataNet PIs Meeting Project overview presentations Shared on TerraPop wiki and via webcast Round-robin pairwise discussions Large-group discussion Potential DataNet-wide collaborations Beginning of program-wide interoperability vision
MPC as DataONE Member Node resolve get search synchronize replicate
Contributions to DataONE Catalog DataONE works in terms of datasets, MPC works in terms of variables Selected high-interest area-level extracts Total population, population density Education Occupation Household utilities Metadata with pointers to our data access systems
Member Node Planning Tier 1 Member Node Draft timeline Public-access data available through DataONE Restricted access routed through MPC data sites and authentication process Draft timeline Review documentation from DataONE Dec. 2012 Complete MN Description Form Mar. 2013 Determine implementation strategy Spring 2013 Draft Partnership Agreement June 2013 Testing (stand-alone testing, content checking, functional integration testing) Winter/spring 2014 Operational June 2014
Collaboration with DFC-iRODS Export extracts to iRODS collaboration environment Store in external data grid Access from anywhere Share with collaborators Incorporate into iRODS workflows Variations on input data sets and parameters Organize inputs and outputs Demo with IPUMS; expand to TerraPop
Long-Term DFC Collaboration Incorporate TerraPop services into iRODS workflows Work with DFC team during TerraPop API development Identify and define specific services to be made available
SEAD Researcher Communities Connect people, publications, and data Scripts to harvest information from Google Scholar, EndNote, etc. Visualizations of networks Facilitate data discovery through network MPC community Bibliography Research topics Extract history
TerraPop-SEAD Topics of Mutual Interest Time series linkages Connections among similar data over time Identification and harmonization Geographic linkages Place identifiers Translation across boundaries
DataNet Interoperability Architecture User’s environment (e.g., laptop, cluster) Collaboration Environment (Data grid/portal; interface between user’s environment and community resources) Protocols, Web Services, Brokers (links between community resources and collaboration enviornments) Community Resources (Data collections, ontologies, services, models, etc. applicable across many collaboration environments)
DataNet-Wide Collaboration Cyberinfrastructure Interoperability of software stacks Technical sustainability Semantic integration Leveraging domain science input across partners Discovery, data formats, data interoperability from a scientist’s perspective
DataNet-Wide Collaboration Community Engagement DataNet branding, web page Governance (including preservation and federation policies) Long-term financial sustainability Data management training Cross-disciplinary awareness of data resources Engagement with libraries and related repositories Assessment and evaluation criteria Working on DataNet RCN proposal