EFGS Vienna, 11 November 2015, Mr. Niek van Leeuwen, Non disclosure rules applied on non-hierarchical small areas.

Slides:



Advertisements
Similar presentations
Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Advertisements

Chapter 3 L3-4 Notes: Estimating Sums & Differences.
STATISTICS ELEMENTARY MARIO F. TRIOLA
Eurostat Statistical Disclosure Control. Presented by Peter-Paul de Wolf, Statistics Netherlands (CBS)
1 How to make a selection table like the one below from SIENA output. The table shows contributions to the objective function for highest / lowest possible.
IMPROVING CONFIDENTIALITY WITH tau-ARGUS BY FOCUSSING ON CLEVER USAGE OF MICRODATA Roland van der Meijden MSc. ± 10 minutes.
Statistics 1: Introduction to Probability and Statistics Section 3-3.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 9/6/12 Describing Data: One Variable SECTIONS 2.1, 2.2, 2.3, 2.4 One categorical.
GRIPS Mask Design Rationale (1) Grid Pitch Parameters Minimum pitch, P min = 1.0 mm (set by detector resolution) Want a finite number, N v, of discrete.
Intersection of Graphs. Example (1) Why we get two answers, when actually the graphs intersect at only one point?
MR. REY BELEN PRICE ELASTICITY OF SUPPLY. Supply Price and Quantity Demanded are directly related to each other. An increase in price causes the quantity.
5-Minute Check Round to the nearest whole number
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall. Chapter 1 The Whole Numbers.
Statistics Class 4 February 11th , 2012.
12. Feb.2010 | Christian Müller Distributed Resource Allocation in OFDMA-Based Relay Networks Christian Müller.
Chapter 3 Descriptive Measures
The process of replacing a number by another number that is close to the same value. What’s Our Rule for the Boss?
Basic Statistics. Scales of measurement Nominal The one that has names Ordinal Rank ordered Interval Equal differences in the scores Ratio Has a true.
1.1 EXPLORING STATISTICAL QUESTIONS Unit 1 Data Displays and Number Systems.
The Group Lasso for Logistic Regression Lukas Meier, Sara van de Geer and Peter Bühlmann Presenter: Lu Ren ECE Dept., Duke University Sept. 19, 2008.
Today Ensemble Methods. Recap of the course. Classifier Fusion
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
1 Duplicate Analyzer Exercises. 2 Installation and Initial Configuration: Exercises Exercises 1.Install Duplicate Analyzer on your local PC. 2.Configure.
Chapter 5: Measures of Variability  The Importance of Measuring Variability  IQV (Index of Qualitative Variation)  The Range  IQR (Inter-Quartile Range)
Extent and Mask Extent of original data Extent of analysis area Mask – areas of interest Remember all rasters are rectangles.
Agenda Descriptive Statistics Measures of Spread - Variability.
Delineating population clusters Applying grid methods Niek van Leeuwen Statistics Netherlands
1-5 Adding Integers There are three parts to today's lesson: Adding positive integers Adding negative integers Adding a positive integer and a negative.
MCAS Math Terms. Add The process of finding the total number of items when two or more groups of items are joined; the opposite operation of subtraction.
Unit 2 Estimation and Computation. Use the following number 5, to answer questions 1 and What digit is in the hundreds place? a. 5 b. 6.
The Mean : A measure of centre The mean (often referred to as the average). Mean = sum of values total number of values.
5-Minute Check Round to the nearest whole number Round to the nearest tenth
EFGS Vienna, 12 november 2015, Mr. Niek van Leeuwen, Natural Capital Accounting Ecosystem Unit map of the Netherlands.
Place your chosen image here. The four corners must just cover the arrow tips. For covers, the three pictures should be the same size and in a straight.
Round any whole number to a required degree of accuracy.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
Estimating Decimals Textbook page 96. What is clustering? Clustering means rounding the numbers to the same value. The numbers 18, 21, and 19 are all.
Foundations of Math I: Unit 3 - Statistics Arithmetic average Median: Middle of the data listed in ascending order (use if there is an outlier) Mode: Most.
Uncertainty2 Types of Uncertainties Random Uncertainties: result from the randomness of measuring instruments. They can be dealt with by making repeated.
Unit 2 Estimation and Computation. Use the following number 5, to answer questions 1 and What digit is in the hundreds place? a. 5 b. 6.
 A standardized value  A number of standard deviations a given value, x, is above or below the mean  z = (score (x) – mean)/s (standard deviation)
9d. Pie Charts CSCI N207 Data Analysis Using Spreadsheet Department of Computer and Information Science, IUPUI Lingma Acheson
STATISTICS Chapter 2 and and 2.2: Review of Basic Statistics Topics covered today:  Mean, Median, Mode  5 number summary and box plot  Interquartile.
Estimation Whole numbers, Sums, Differences, Products, and Quotients.
11 Measuring Disclosure Risk and Data Utility for Flexible Table Generators Natalie Shlomo, Laszlo Antal, Mark Elliot University of Manchester
Presentation Outline What is microinsurance Importance of data
Statistical Reasoning for everyday life
Explanation Answer Code Question A J M X D H P S K L B C X 4
Frequency Distributions
Session 7 – Data aggregation and visualisation
Definite Integrals and Antiderivatives
Establishing an Automated Confidentiality Service in Stats NZ
Treatment of statistical confidentiality Table protection using Excel and tau-Argus Practical course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER.
Treatment of statistical confidentiality Table protection using Excel and tau-Argus Practical course Trainer: Felix Ritchie CONTRACTOR IS ACTING UNDER.
21a. Temporary workers as a percentage of the workforce, construction versus non-construction,
Lucas in NL Harmonised Land Use/ Land Cover Statistics, Part A
Service- and business areas
Estimating Sums and Differences of Fractions and Mixed Numbers
Exhibit 8.5 Percentage of Covered Workers Enrolled in an HDHP/HRA or an HSA-Qualified HDHP, * Estimate is statistically different from estimate.
Math Journal Notes Unit 5.
Exhibit 7.8 Percentage of Covered Workers Enrolled in a Plan with a High General Annual Deductible for Single Coverage, by Firm Size, 2016 * Estimate is.
Exhibit 7.12 Percentage of Covered Workers Enrolled in a Plan with a General Annual Deductible of $2,000 or More for Single Coverage, by Firm Size,
Numerical Descriptive Measures
Prepared by: C.Cichanowicz, March 2011
Other urban data collections
Exhibit 7.9 Percentage of Covered Workers Enrolled in a Plan with a General Annual Deductible of $1,000 or More for Single Coverage, By Firm Size,
PCS Day – Feedback Results
Discuss with a partner and record your thoughts.
Workpackage Infrastructure
Presentation transcript:

EFGS Vienna, 11 November 2015, Mr. Niek van Leeuwen, Non disclosure rules applied on non-hierarchical small areas

Regular non disclosure rules Minimum number. N>= 3. Dominance rule on total sum: Size is identifying. ( Total sum – Max -1 – Max ) X% Average: Equality is identifying. abs(Max – Max (estimate) ) > p * Max / 100, p > X% Max (estimate) = Total sum – (N -1 ) * Max -1 2

Regular non disclosure rules 3 Secondary non-disclosure. Combination with larger, hierarchical areas

Different small areas 4 Disclosure when combined with publically available sources. NL: The base register on addresses (PC) and buildings (coordinates, grid). But there is a larger demand for small area statistics for different types of areas. -Grids -Postal codes

Types of intersection 5 Differencing whole coverage Differencing partly covered Z-type R-type

Types of intersection 6

Results so far 7 Iterative approach, initially: -Frequencies: Rounding on 5. -Percentages; Rounding on 10%, minimum 10 persons/dwelling. -No non disclosure of groups applied.

Results so far 8 Grid 100 meter Postal code Data after non disclosure Combination Z-type

Optimising, different scenario’s 9

Optimising 10 Promising scenario’s so far -Suppress small numbers less than 5. -Rounding frequencies to 5. -Percentages rounding to 10%, minimum = 10 persons. -Small numbers? 10 persons total and 10% score is 1 person! -Diminishing number of categories. Introducing 5 to 7 categories.

Future Generic rules disclosure controle for combinations of non hierarchical areas Report 2016, Q2/Q3. 11

12