Transition to Practice
We define “Transition to Practice” as making privacy tools and systems operational
What do we mean by “operational”? Make the tools and systems available and usable to social scientists, legal scholars, and practitioners. Move from a research prototype to a production system that is maintained and supported beyond the duration of this project. Distribute the software to allow others use the tools and systems for their needs, with their own environments.
Parallel Efforts on Systems and Tools DataTags Repositories Repositories that allow sharing or publishing sensitive data and support two or more datatags. Technology Science Dataverse Codified Legal Knowledge Scalable ways to digest privacy regulations to assign a datatag to a dataset and generate a license Rule-based model Decision-tree model DataTagging interview License generator Crowd-sourcing wiki Applied Differential Privacy Tools that enable to learn about a sensitive dataset while preserving its privacy. Curator Tools for: Descriptive Stats Causal Inference Regression Queries
In : Towards Transition to Practice Operational: Technology Science supports six datags, from blue to crimson. Dataverse supports four datatags, from blue to orange. Prototyping: Differential Privacy Curator Tool: Privacy Budget, Descriptive Statistics, Causal Inference Rule-Based model Decision-tree model DataTagging Interview Crowd-sourcing wiki
End-to-End Systems Decision-Tree Language Generator Logic Programming/ Legal Model DataTags Interview Tool Access Control Knowledge Acquisition Data Ingestion In : Attempt to Make All Tools Operational Secure Infrastructure Knowledge Codification Data Retrieval Legal Scholars IRB License Generator Tool Privacy Budget Tool Privacy Analysis Tool
“Transition to Practice” proposal for Year 5 ( ) Dataverse Repository Full DataTags support in Harvard Dataverse (implement the rest of the tags for high-sensitive data: red, crimson). Software distribution for other Dataverse installations (outside Harvard). Technology Science Repository Software distribution for standalone uses. Knowledge base compiled from Technology Science data sharing decisions. DataTagging Tool and License Generator Tool Integration with Dataverse (works with Harvard Dataverse repository). Privacy Curator Tool Integration with Dataverse (works with Harvard Dataverse repository).
Transition to Practice: Dataverse with DataTags Oct - Dec, 2016Jan - Mar, 2017 Apr - Jun, 2017 Jul - Sep, 2017 Requirements for Dataverse compliance with red datatag. Collaboration with IRB and research groups to test DataTagging and Licenses with real data. Implementation of red compliant Dataverse. Integrate DataTagging Interview and License Generator with test Dataverse Requirements for Dataverse compliance with crimson datatag. User testing with real data and practitioners, Usability testing. Implementation of crimson compliant Dataverse. Incorporate testing feedback and integrate DataTagging Interview and License Generator with production Dataverse. What’s expected by the end of Sept 2017? End-to-end support in Dataverse to share sensitive data, including high- sensitive data Integrate with DataTagging Interview and License Generator tools. Year 5
Transition to Practice: Technology Science Oct - Dec, 2016Jan - Mar, 2017 Apr - Jun, 2017 Jul - Sep, 2017 Technology Science implementing and using all levels. Software standalone distribution, alpha version, usability testing. Software standalone distribution, beta version, usability testing. Knowledgebase with software tools, alpha version. Software used at two independent sites. Knowledgebase software with software tools, beta version, released. What’s expected by the end of Sept 2017? Standalone software to download and use for independent repositories. Knowledgebase of Technology Science data sharing decisions with tools. Year 5
Transition to Practice: Differential Privacy Curator Integrating with Dataverse test and preliminary user testing Defining Relationships between Tags and Curator Access Interactive Query UI Development Security Analysis and Architecture Integration with Dataverse production for low sensitive data. Interactive Query UI In Production Community Outreach including Architecture for Extensibility Integration with Dataverse for high sensitive data. Extensive user testing feedback of workflow Continued Community Outreach and Extensibility Engineering Incorporate user testing feedback. Tutorials and Webinars Oct - Dec, 2016Jan - Mar, 2017 Apr - Jun, 2017 Jul - Sept, 2017 What’s expected by the end of Sept 2017? Complete coupled workflow across all tools for privacy preservation. Year 5