Presentation is loading. Please wait.

Presentation is loading. Please wait.

Synthetic Data within the Risk – Utility Framework Keith Spicer Office for National Statistics.

Similar presentations


Presentation on theme: "Synthetic Data within the Risk – Utility Framework Keith Spicer Office for National Statistics."— Presentation transcript:

1 Synthetic Data within the Risk – Utility Framework Keith Spicer Office for National Statistics

2 Microdata Product Range Disclosure Risk of Dataset Data Utility High Low AR Desktop Access Safeguarded Licensing OGL / Public Use NOT PERSONAL INFORMATION AR On Site Access (VML or SDS) Level at which data become Personal Information PERSONAL INFORMATION Access for Approved Researchers only 2

3 Microdata Product Range Disclosure Risk of Dataset Data Utility High Low AR Desktop Access Safeguarded Licensing OGL / Public Use NOT PERSONAL INFORMATION AR On Site Access (VML or SDS) Level at which data become Personal Information PERSONAL INFORMATION Access for Approved Researchers only 3 Target Area for Synthetic Data

4 Utility Framework only considers the utility of the data in a research context Synthesis creates microdata that are not personal information (key assumption) Ease of Access Training Testing code (prior to access of Personal Information)

5 Synthesised data for research Goal to retain research utility while reducing disclosure risk At what point do real data become synthetic? What methods are specifically ‘synthesising’ methods? Is synthesising just doing lots of SDC – but in a smart “utility retaining” way?


Download ppt "Synthetic Data within the Risk – Utility Framework Keith Spicer Office for National Statistics."

Similar presentations


Ads by Google