Building a CMMI Data Infrastructure

Slides:



Advertisements
Similar presentations
May 17, Capabilities Description of a Rapid Prototyping Capability for Earth-Sun System Sciences RPC Project Team Mississippi State University.
Advertisements

The Vision, Process, and Requirements for Creating EarthCube Presentation at Second EarthCube WebEx Aug 22, 2011.
The Changing Face of Research Anthony Beitz DART Integration Manager.
THE JOINED UP WORLD OF E-RESEARCH Professor Neil McLean National Technical Standards Adviser to the Department of Education Science and Training (DEST)
Knowledge Management Solutions
Advertising your data: Agency/Institution requirements for publishing metadata Nancy Hoebelheinrich Knowledge Motifs LLC Version 1.0 [Reviewed August 2012]
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
SCIENCE, RESEARCH DATA, AND PUBLISHING Stewart Wills Editorial Director, Web & New Media, Science 26 February 2013.
Making Connections: SHARE and the Open Science Framework Jeffrey Open Repositories 2015.
API, Interoperability, etc.  Geoffrey Fox  Kathy Benninger  Zongming Fei  Cas De’Angelo  Orran Krieger*
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
DriveSense’14 NSF Workshop on Large-Scale Traffic and Driving Activity Data DriveSense’14, Oct 30-31, Norfolk, VA.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Chaitan Baru Senior Advisor for Data Science CISE Directorate National Science Foundation NIEHS Webinar October 27, 2015 Image Credit: Exploratorium. Integrating.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Chemistry and Materials Science break-out group (Friday morning)
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Helmholtz Open Science Webinars on Research Data Webinar 34 – 6 / 11 April 2016 Dr. Birgit Schmidt Niedersächsische Staats- und Universitätsbibliothek.
Evolution of storage and data management
Chapter 6 Foundations of Business Intelligence: Databases and Information Management.
Data sharing and exchange: Experiences within the
Data Platform and Analytics Foundational Training
Using Data Management Plans and existing NSF data centers
Welcome! Enhancing the Care Team May 25, 2017
Redesigning the DOE Data Explorer to embed dataset relationships at the point of search and to reflect landing page organization Sara Studwell Department.
GISELA & CHAIN Workshop Digital Cultural Heritage Network
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
CARER Proposal Writing Workshop November 2004
EOSC MODEL Pasquale Pagano CNR - ISTI
Joslynn Lee – Data Science Educator
Jarek Nabrzyski Director, Center for Research Computing
INTAROS WP5 Data integration and management
Data access and sharing
Donatella Castelli CNR-ISTI
Summit 2017 Breakout Group 2: Data Management (DM)
Building a CMMI Data Infrastructure
KNOWLEDGE MANAGEMENT (KM) Session # 30
Welcome slide.
Jay Bhatt Drexel University Libraries
Standards for success in city IT and construction projects
CIS 333 Competitive Success/snaptutorial.com
CIS 333Competitive Success/tutorialrank.com
CIS 333 Education for Service-- snaptutorial.com.
CIS 333 Education for Service-- tutorialrank.com.
CIS 333 Teaching Effectively-- snaptutorial.com
Access  Discovery  Compliance  Identification  Preservation
Chapter 6 Foundations of Business Intelligence: Databases and Information Management.
EOSC Governance Development Forum
E-Science Life-Cycle A. D. Smith – September 26, 2011.
One Language. One Enterprise.™
Emerging Information Technologies I
Research Infrastructures: Ensuring trust and quality of data
Open Archival Information System
Scott Thorne & Chuck Shubert
Incentivizing data sharing
Chapter 6 Foundations of Business Intelligence: Databases and Information Management.
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Bird of Feather Session
Anatomy of a modern data-driven content product
Wrap-Up – NSF Site Visit 8 February 2010
Open Access to scientific publications
The Digital Library for Earth System Education (DLESE):
  1-A) How would Arctic science benefit from an improved GIS?
Sustaining Repositories
Building a CMMI Data Infrastructure
Successful Data Curation for Large Data Archives
Draft Charter Community of Practice for Direct Access Entities
Presentation transcript:

Building a CMMI Data Infrastructure DAY 2 – Breakout Summaries Arlington, VA February 6-7, 2017

Sustaining Repositories Define the key benefits / value to users Visualizations, citations, curation, etc. Describe the organizational entity ideal to manage the data repository and the governance structure Non-profit, university, etc. Board of directors, volunteer steering committee, government group Explain the funding model to sustain operations Individual - Pay to put data in, pay to take data out, per use, subscription Organization – long-term funding?

Incentivizing Data Sharing Need to develop, implement, and refine rules aimed at enforcing data sharing on federally funded work wherever possible using a combination of requirements from federal agencies, requirements from publishers/editors, and community (e.g., professional societies) developed and agreed practices. Need to develop, implement, and refine incentives aimed at encouraging data sharing that might include improved productivity, enhanced data longevity and utility, more citations compared to traditional publishing, rewards/recognition from peers, and more funding from federal agencies. Portfolio of case studies illustrating the benefits of data sharing.

Innovative Data Creation and Data Fusion Approaches The main discussion of this group were on new research topics and approaches needed to tackle many of the challenges with emerging forms of data, and in building repositories and data services using such data. New approaches to doing science – crowdsourcing, grand challenges, multidisciplinary research – for problem-understanding, generating solutions and knowledge discovery Research and applications around capturing, discovering emerging forms of data particularly geospatial data and the privacy and information security issues around sharing such data

Metadata, Vocabulary, & Workflow Tools for Discovery Address need for commonly used words and describe approach for building and evolving terms as well as their relationships. Incorporate electronic lab notebooks with complementary tools to capture and associate the full workflow with data and publications. Employ approaches to search and query for specific parameters across multiple distributed data.

Using Data Management Plans and existing NSF data centers Can (or should) existing NSF data centers, or other data repositories, be used in this regard? There are possibilities, but currently limited Not really a NSF based center solution for material sciences Perhaps something like EarthCube is the solution here NHERI based center (DesignSafe) working with XSEDE (UT-TACC) is a potential solution Address some data issues (storage and work space; potentially (?) addressing integration, meta data standards, confidentiality, proprietary data) May not be a solution for all areas of infrastructure Are such data centers a necessary condition for formulating a reasonable data management plan? No, but clearly would be helpful Range of possibilities: Repository function Data preparation and processing for the community Data integration and linkage