Download presentation
Presentation is loading. Please wait.
Published byAlvin Jacobs Modified over 9 years ago
1
Operationalising ‘safe statistics’ the case of linear regression Felix Ritchie Bristol Business School, University of the West of England, Bristol
2
Background: output SDC Safe statistics in principle Making it work: regression and totals What does ‘safe’ mean? Plan
3
Researchers increasingly using very sensitive data ‘Traditional’ SDC research (tables and anonymisation) of limited relevance Need rules for generalised output ‘Output SDC’ Ideally, principles-based Background: output SDC
4
How do you devise guidelines for output when everything is possible? The ‘research zoo’ –Separate lions from rabbits –Focus on the lions –Forget about the rabbits Making output SDC manageable
5
Define a statistic (sum, regression, odds ratio, index etc) as ‘safe’ or ‘unsafe’ –safe: release unless there’s a reason –unsafe: don’t release unless shown to be safe in the specific context SDC efforts concentrated on problematic output ‘Safe statistics’
6
1.Define the functional form 2.Identify the disclosure potential 1.Can it directly reveal a single data point? 2.Can it be differenced? 3.Anything else? 3.If provisionally ‘safe’, identify special cases 4.Draft guidelines Categorising ‘safe statistics’
7
Example: regression coefficients
8
Example: total
9
Safe statistics decision chart
10
We can’t –nor are we trying to ‘safe’ = ‘for all practical purposes posing no significant disclosure risk’ ‘unsafe’ = ‘high risk; spend time on this’ How can we say X is unconditionally ‘safe’?
11
Everything has a theoretical risk Resources are limited Overall risk protection is maximised by concentrating on real risks …and also the non-experienced Risk assessment for grown-ups
12
Questions
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.