Census Data for Transportation Planning—Some Thoughts

Slides:



Advertisements
Similar presentations
STRENGTHENING FINANCING FOR DEVELOPMENT: PROPOSALS FROM THE PRIVATE SECTOR Compiled by the UN-Sanctioned Business Interlocutors to the International Conference.
Advertisements

Characterization and Management of Multiple Components of Cost and Risk in Disclosure Protection for Establishment Surveys Discussion of Advances in Disclosure.
Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Business microdata dissemination at Istat Daniela Ichim Luisa Franconi
29e CONFÉRENCE INTERNATIONALE DES COMMISSAIRES À LA PROTECTION DES DONNÉES ET DE LA VIE PRIVÉE 29 th INTERNATIONAL DATA PROTECTION AND PRIVACY COMMISSIONERS.
Introduction to Clinical Ethics Ben Faneye, OP, DHCE West African Bioethics Training Program.
Statistical database security Special purpose: used only for statistical computations. General purpose: used with normal queries (and updates) as well.
Information Systems Controls for System Reliability -Information Security-
Session 16: Distribution of Geospatial Data 1 Distribution of Geospatial Data in the Public Environment Hazard Mapping and Modeling.
1. “Software for Tabular Data Protection” Joe Fred Gonzalez, Jr. Lawrence H. Cox National Center for Health Statistics NCHS Data Users Conference July.
Confidentiality Issues with “Small Cell” Data Michael C. Samuel, DrPH STD Control Branch California Department of Public Health 2008 National STD Prevention.
Disclosure risk when responding to queries with deterministic guarantees Krish Muralidhar University of Kentucky Rathindra Sarathy Oklahoma State University.
ITU CoE/ARB 11 th Annual Meeting of the Arab Network for Human Resources 16 – 18 December 2003; Khartoum - Sudan 1 The content is based on New OECD Guidelines.
Privacy vs. Utility Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
The views expressed herein are those of the author and should not necessarily be attributed to the IMF, its Executive Board, or its management Data Confidentiality,
Disclosure Risk and Grid Computing Mark Elliot, Kingsley Purdam, Duncan Smith and Stephan Pickles CCSR, University of Manchester
Paper III Qualitative research methodology. Objective 1.4 Discuss ethical considerations in qualitative research.
Ethical Guidelines in Research Ethics refers to doing what is morally and legally right in conducting research. Research ethics deals primarily with the.
The Review of the Dissemination of Health Statistics Carole Abrahams Office for National Statistics.
Census 2011 – A Question of Confidentiality Statistical Disclosure control for the 2011 Census Carole Abrahams ONS Methodology BSPS – York, September 2011.
Big Data Analytics Are we at risk? Dr. Csilla Farkas Director Center for Information Assurance Engineering (CIAE) Department of Computer Science and Engineering.
The London Health Observatory: monitoring health and health care in the capital, supporting practitioners and informing decision-makers Disclosure control.
ISO17799 / BS ISO / BS Introduction Information security has always been a major challenge to most organizations. Computer infections.
Expanding the Role of Synthetic Data at the U.S. Census Bureau 59 th ISI World Statistics Congress August 28 th, 2013 By Ron S. Jarmin U.S. Census Bureau.
Principles of Good Governance
Chapter 7: Modifiability
Chapter 7 Group Counseling
Introduction paragraph – what looking to investigate.
Decade of Roma Inclusion Progress monitoring
Development of Strategies for Census Data Dissemination
“Software for Tabular Data Protection”
Data Accessibility, Confidentiality and Copyright United Nations Statistics Division Demographic Statistics Section.
Julia Lane New York University
Issues regarding effective use of ICT resources
Security.
of Land and Natural Resource Conflict
Dissemination Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May 2008,
Jamaica’s Drought tool
"Development of Strategies for Census Data Dissemination".
CIS 333Competitive Success/tutorialrank.com
CIS 333 Education for Service-- tutorialrank.com.
Writing Ethics/ Writing Rights
Introduction to Cyber Security
Looking at Data - Relationships Data analysis for two-way tables
Training of Trainers Workshop
Confidentiality in data dissemination
Research Ethics and Integrity Officer
Data Quality By Suparna Kansakar.
Why are you collecting data in the first place
Ethical questions on the use of big data in official statistics
Improving and Using Family Survey Data
Questions for break-out sessions GROUP 2 messages Participants : state administrations in charge of MSFD and/or WFD, ESA and GES experts, shipping industry,
Enhancing statistical practices to improve data sharing
Information technologies/NBIC and Big data
The activity of Art. 29. Working Party György Halmos
Quality, efficiency and productivity: a challenge for official statistics EFTA/CROSTAT/EUROSTAT Strategic Management Seminar, Split, November 2007.
Introduction to Health Privacy
On data accessibility and confidentiality……..
International Statistics
Assessment Literacy: Test Purpose and Use
Federal Statistical Office Germany Research Data Centre
Jan Byfuglien Statistics Norway
Item 2.2 Use of waivers in business statistics
Item 5 Wim Kloek, Eurostat
Open Data Sharing and its Statistical Limitations
Some contents are borrowed from Adam Smith’s slides
Shasta CCD Board Retreat CEO Search, Accreditation & Student Success
Differential Privacy (1)
Imputation as a Practical Alternative to Data Swapping
Presentation transcript:

Census Data for Transportation Planning—Some Thoughts George Duncan 11/15/2018

Three Themes Confidentiality Disclosure limitation methods Multiple sources of Information 11/15/2018

Confidentiality Identified by Jay Waite of Census More restrictions? Concern of George Schoener of DOT Repeated theme throughout conference 11/15/2018

Privacy “Privacy is dead, deal with it,” Sun MicroSystems CEO Scott McNealy But like the rattlesnake, it can bite you … 11/15/2018

Why Confidentiality Matters Ethical: Keeping promises; basic value tied to privacy concerns of solitude, autonomy and individuality Pragmatic: Without confidentiality, respondent may not provide data; worse, may provide inaccurate data Legal: Required under law Fundamental to the workings of a democratic system and its respect for the individual. Pragmatically, patients fear not getting it. Without assurances of confidentiality will some patients shy from providing information about potentially embarrassing conditions or activities? 11/15/2018

Restricted Access Restricted Data There is an inverse relationship between restricted data and restricted access: as data restrictions increase, fewer restrictions on access are needed, and vice versa. Some user needs cannot be met with restricted data, however, because the data transformation required to ensure data confidentiality is so extreme that the restricted data are useless for inference purposes. In response, an agency may allow more restricted access to less restricted data. Conversely, to ensure confidentiality, access may have to be so restricted that a legitimate user cannot, as a practical matter, obtain the data. Again in response, an agency may allow less restricted access to more restricted data. Confidentiality protection shields must be both effective and faithful to the original data. 11/15/2018

R-U Confidentiality Map Data Utility U Disclosure Risk R Original Data Maximum Tolerable Risk Released Data Disclosure Risk: Information about Confidential Items Data Utility: Information about legitimate objects (DATA) No Data 11/15/2018

R-U Confidentiality Map Attention from Census Data Utility U Disclosure Risk R Original Data Maximum Tolerable Risk Attention from Transportation Community Disclosure Risk: Information about Confidential Items Data Utility: Information about legitimate objects (DATA) No Data 11/15/2018

Questions for Joint Consideration Does old data decay and so pose less disclosure risk? Can providing seasonal data increase data utility without increasing disclosure risk? How can the transportation community be educated about the impact of disclosure limitation? What specifically causes increased disclosure risk? How to improve access to Research Data Centers? 11/15/2018

Disclosure Limitation for Tables Coarsening Aggregate attributes Suppress some cells Publish only the marginal totals Suppress the sensitive cells, plus others as necessary Perturb some cells Round Fuzz 11/15/2018

Perturbation Methods Controlled rounding (Cox) Cyclic perturbation (Duncan & Roehrig) Stochastically modify cell values in a known way, allowing a Bayesian analysis of cell value distributions 11/15/2018

Multiple Sources of Information Alan Pisarski, Thomas Palumbo Research and Academic Breakout Group Session 8 “Mixing apples and oranges produces fruit salad” “Institutionalize creative quilting” (Nancy McGuckin) 11/15/2018

“Institutionalize Creative Quilting” Quilting is the systematic use of multiple data sources Creative means drawing on the available resources to address a particular problem Institutionalize means establishing an apparatus whereby creative quilting can consistently happen 11/15/2018

Quilting ACS can’t provide all transportation planning data needs—national data not be all and end all Integrate with other data sets to maximize combined utility to support decision making Complement with NHTS, private sector databases, public domain property data, satellite imagery, etc Build in dynamic databases that assess changing world—close to real-time data Need new methodologies to produce a high-quality data quilt (data stitching) 11/15/2018

Creative Constrain focus to particular decision making and policy needs Search broadly and be imaginative in use of data products 11/15/2018

Institutionalize Not left to individual planner/researcher Identify commonalities and differences among data users, say MPOs Find representation for users with common needs, say AASHTO Establish ongoing links with information organizations, say Census 11/15/2018

Scenario of Potential Several MPOs have similar data needs AASHTO solicits data needs and formulates a “quilt” to cover them AASHTO negotiates with Census for way to develop disclosure limited data product Access to Census Research Data Center to validate statistical inferences from original data (“red light”/”green light”) 11/15/2018

Opportunities and Threats Opportunity: Together, Census and Transportation support respective institutional needs for confidentiality and access to quality data Threat: Divided, Census and Transportation fall to privacy appeals or inability to cope with methodological challenges of disclosure limitation and complex, new data sources 11/15/2018