Transition to Practice. We define “Transition to Practice” as making privacy tools and systems operational.

Slides:



Advertisements
Similar presentations
Pulan Yu School of Informatics Indiana University Bloomington Web service based Varuna.Net.
Advertisements

The Data Curation Profile IASSIST 2010 Jake Carlson Data Research Scientist Purdue University Libraries.
Current Developments in Differential Privacy Salil Vadhan Center for Research on Computation & Society School of Engineering & Applied Sciences Harvard.
MAHI Research Database Project Status Report August 9, 2001.
Web Development Process Description
SAKAI February What is SAKAI? Sakai ≠ Course Management System Sakai = Collaboration & Learning Environment.
DATAVERSE FOR JOURNALS Mercè Crosas, Ph.D. Director of Data Science IQSS, Harvard Society for Scholarly Publishing 37 th Meeting,
I.Boulder Community GSI Code Repository II.GSI Regression Test Suite III.GSI Code Release IV.GSI Community Support V.Collaborative Efforts.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Configuration Management (CM)
Informing a data revolution right data, to the right people, at the right time, in the right format The IDR road map Bangkok Regional Workshop July 17-18,2014.
PDS4 and Build 5a Update Dan Crichton, Emily Law November
NMI End-to-End Diagnostic Advisory Group BoF Fall 2003 Internet2 Member Meeting.
Jamie Hall (ILL). SciencePAD Persistent Identifiers Workshop PANData Software Catalogue January 30th 2013 Jamie Hall Developer IT Services, Institut Laue-Langevin.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
Orientation Webinar July 10, 2013 University of California Libraries Systemwide Advisory Structure.
Data Publishing with Dataverse Mercè Crosas, Ph.D. Director of Data Science Institute for Quantitative Social Science, Harvard University.
Community Codes Free and shared resource Ongoing distributed development by both research and operational communities – Maintained under version control.
PRIVACY TOOLS FOR SHARING RESEARCH DATA NSF site visit October 19, 2015 Salil Vadhan Supported by the NSF Secure & Trustworthy Cyberspace (SaTC) program,
Contract Year 1 Review IMT Tilt Thompkins MOS - NCSA 15 May 2002.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
GraDS MacroGrid Carl Kesselman USC/Information Sciences Institute.
Data Citation Dataverse Mercè Crosas Chief Data Science and Technology Officer, IQSS, Harvard Workshop: Data Citation.
Tools Report Engineering Node March 2007
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Project Execution Methodology
Jan 2016 Solar Lunar Data.
An Overview of Data-PASS Shared Catalog
AGILE PRODUCT ROADMAP AGILE PRODUCT ROADMAP
Average Monthly Temperature and Rainfall
Apr-Jun Jan-Mar Jul-Sep Oct-Dec

Introduction to D4Science
RC Ohio Valley Region Strategic and Annual Planning Process January – September 2016 Notes Fr John says greeting and reflects on 75 anniversary and then.





How to Design and Implement Research Outputs Repositories

Gantt Chart Enter Year Here Activities Jan Feb Mar Apr May Jun Jul Aug
Q1 Q2 Q3 Q4 PRODUCT ROADMAP TITLE Roadmap Tagline MILESTONE MILESTONE

FUTURE PLANS NSF site visit October 19, 2015 Salil Vadhan

FUTURE PLANS NSF site visit October 19, 2015 Salil Vadhan

Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Q1 Q2 Q3 Q4 PRODUCT ROADMAP TITLE Roadmap Tagline MILESTONE MILESTONE
DEEDS A Platform for Sharing Data, Computing & Scientific Workflows
Dataverse for citing and sharing research data
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
  1-A) How would Arctic science benefit from an improved GIS?
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3



Core 5: Training Randy Gollub, MD PhD Guido Gerig, PhD
Q1 Q2 Q3 Q4 PRODUCT ROADMAP TITLE Roadmap Tagline MILESTONE MILESTONE
OpenDP: A Pitch for a Community Effort
Presentation transcript:

Transition to Practice

We define “Transition to Practice” as making privacy tools and systems operational

What do we mean by “operational”? Make the tools and systems available and usable to social scientists, legal scholars, and practitioners. Move from a research prototype to a production system that is maintained and supported beyond the duration of this project. Distribute the software to allow others use the tools and systems for their needs, with their own environments.

Parallel Efforts on Systems and Tools DataTags Repositories Repositories that allow sharing or publishing sensitive data and support two or more datatags. Technology Science Dataverse Codified Legal Knowledge Scalable ways to digest privacy regulations to assign a datatag to a dataset and generate a license Rule-based model Decision-tree model DataTagging interview License generator Crowd-sourcing wiki Applied Differential Privacy Tools that enable to learn about a sensitive dataset while preserving its privacy. Curator Tools for: Descriptive Stats Causal Inference Regression Queries

In : Towards Transition to Practice Operational: Technology Science supports six datags, from blue to crimson. Dataverse supports four datatags, from blue to orange. Prototyping: Differential Privacy Curator Tool: Privacy Budget, Descriptive Statistics, Causal Inference Rule-Based model Decision-tree model DataTagging Interview Crowd-sourcing wiki

End-to-End Systems Decision-Tree Language Generator Logic Programming/ Legal Model DataTags Interview Tool Access Control Knowledge Acquisition Data Ingestion In : Attempt to Make All Tools Operational Secure Infrastructure Knowledge Codification Data Retrieval Legal Scholars IRB License Generator Tool Privacy Budget Tool Privacy Analysis Tool

“Transition to Practice” proposal for Year 5 ( ) Dataverse Repository Full DataTags support in Harvard Dataverse (implement the rest of the tags for high-sensitive data: red, crimson). Software distribution for other Dataverse installations (outside Harvard). Technology Science Repository Software distribution for standalone uses. Knowledge base compiled from Technology Science data sharing decisions. DataTagging Tool and License Generator Tool Integration with Dataverse (works with Harvard Dataverse repository). Privacy Curator Tool Integration with Dataverse (works with Harvard Dataverse repository).

Transition to Practice: Dataverse with DataTags Oct - Dec, 2016Jan - Mar, 2017 Apr - Jun, 2017 Jul - Sep, 2017 Requirements for Dataverse compliance with red datatag. Collaboration with IRB and research groups to test DataTagging and Licenses with real data. Implementation of red compliant Dataverse. Integrate DataTagging Interview and License Generator with test Dataverse Requirements for Dataverse compliance with crimson datatag. User testing with real data and practitioners, Usability testing. Implementation of crimson compliant Dataverse. Incorporate testing feedback and integrate DataTagging Interview and License Generator with production Dataverse. What’s expected by the end of Sept 2017? End-to-end support in Dataverse to share sensitive data, including high- sensitive data Integrate with DataTagging Interview and License Generator tools. Year 5

Transition to Practice: Technology Science Oct - Dec, 2016Jan - Mar, 2017 Apr - Jun, 2017 Jul - Sep, 2017 Technology Science implementing and using all levels. Software standalone distribution, alpha version, usability testing. Software standalone distribution, beta version, usability testing. Knowledgebase with software tools, alpha version. Software used at two independent sites. Knowledgebase software with software tools, beta version, released. What’s expected by the end of Sept 2017? Standalone software to download and use for independent repositories. Knowledgebase of Technology Science data sharing decisions with tools. Year 5

Transition to Practice: Differential Privacy Curator Integrating with Dataverse test and preliminary user testing Defining Relationships between Tags and Curator Access Interactive Query UI Development Security Analysis and Architecture Integration with Dataverse production for low sensitive data. Interactive Query UI In Production Community Outreach including Architecture for Extensibility Integration with Dataverse for high sensitive data. Extensive user testing feedback of workflow Continued Community Outreach and Extensibility Engineering Incorporate user testing feedback. Tutorials and Webinars Oct - Dec, 2016Jan - Mar, 2017 Apr - Jun, 2017 Jul - Sept, 2017 What’s expected by the end of Sept 2017? Complete coupled workflow across all tools for privacy preservation. Year 5