1 Sarah Franklin October 30 th, 2013 Disclosure Control for Tables of Frequency Counts using Administrative Justice Data.

Slides:



Advertisements
Similar presentations
Defining and Measuring Crime Chapter 3. To teach the social expectations of society To protect citizens from criminal harm and punish wrong doers To express.
Advertisements

Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Class Name, Instructor Name Date, Semester Chapter 2 The Crime Picture Criminal Justice Today.
Campus Security Authority Training “University police and campus security authorities must report crimes in the annual Uniform Campus Crime & Fire.
The Evolution of Measuring Violence Against Women at Statistics Canada UN Global Forum on Gender Statistics December 10-12, 2007 Presented by Heather Dryburgh.
Copyright 2010, The World Bank Group. All Rights Reserved. Police Statistics, Crime, Criminals and Resources Part 1 Crime, Justice & Security Statistics.
What is a Crime? Part 1 Offenses The Major Crimes.
MILWAUKEE POLICE DEPARTMENT Chief Edward A. Flynn MILWAUKEE POLICE DEPARTMENT Chief Edward A. Flynn.
Law III Chapter Two: The nature and extent of crime.
CONTROVERSY: Will gun control make us safe? Debunking the myths.
Crime & Deviance Part 2: Crime & Capital Punishment.
Crime Chapter 8 Section 2. Crime Prohibited by law Punishable by the government.
Chuck Humphrey Data Library University of Alberta.
Study of Virginia’s Parole- Eligible Inmate Population.
Chapter 2 Crime and Criminals Irwin/McGraw-Hill
Copyright © 2006 Pearson Education Canada Inc Chapter 8 Violent Crimes “To all of us crime seems like violence” K. Menninger, ’68:157.
Offences against the person
S AFE C ITY M ISSISSAUGA C RIME SEVERITY INDEX  New method of quantifying crime  Attaches higher weights to major crimes and lower weights to.
Geo-referenced data and DLI aggregate data sources Chuck Humphrey University of Alberta September 29, 2008.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 26, 2009.
Chuck Humphrey & Lynne Robinson University of Alberta Surviving Statistics Strategies for dealing with statistical questions on the reference desk.
Violent Crimes “To all of us crime seems like violence” K. Menninger, ’68:157.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library March 6, 2009.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
Crime Victims: An Introduction to Victimology Seventh Edition
1 Book Cover Here Chapter 18 ROBBERY Criminal Investigation: A Method for Reconstructing the Past, 7 th Edition Copyright © 2014, Elsevier Inc. All Rights.
Geo-referenced data and DLI aggregate data sources Chuck Humphrey University of Alberta ACCOLEDS 2007.
1 Measuring violence against women: The Canadian experience François Nault Director, Statistics Canada November 2013.
The Crime Scene Justice Data and the Case of Multiple Files in GSS 18 Chuck Humphrey University of Alberta Atlantic DLI Workshop April 20-21, 2006.
Justice “Canada's criminal justice system is a complex network of independent but procedurally connected police, prosecutors, courts, correctional agencies,
Ontario DLI Training April 12, 2005 Jillian Oderkirk and Shelley Crego.
Canadian Health Measures Survey (CHMS) Biobank Joanne Boisjoli Health Statistics Division Statistics Canada 1 Statistics Canada Statistique Canada.
An Estimation of the Economic Impact of Spousal Violence in Canada, 2009 Research and Statistics Division Department of Justice Canada October 2013.
The Crime Picture Chapter 2 Frank Schmalleger Criminal Justice Today 13 th Edition.
Screening Data for Disclosure Risk and the Research behind One Possible Tool Kristine Witkowski Research support from the National Institute of Child Health.
Copyright © 2008 Pearson Education Canada Inc Crime Statistics Chapter 2.
Disclosure Avoidance: An Overview Irene Wong ACCOLEDS/DLI Training December 8, 2003.
Health Statistics Information on STC website Calgary–DLI training–Dec 2003 Michel B. Séguin, Statistics Canada,
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
Snohomish County Sheriff’s Office Special Investigations Unit n 98% of our investigations involve crimes where the victim has been assaulted by someone.
1 Access to Justice Data Research Data Centres & Real Time Remote Access Kathy AuCoin Chief Data Access and Data Development August 2013.
Criminal Justice Today Twelfth Edition CHAPTER Criminal Justice Today: An Introductory Text for the 21st Century, 12e Frank Schmalleger Copyright © 2014.
The Census of Canada and Immigration & Ethno-cultural Data Chuck Humphrey University of Alberta February 10, 2006.
Challenges in Collecting Police-Reported Crime Data Colin Babyak Household Survey Methods Division ICES III - Montreal – June 20, 2007.
Learning from MAPPA Significant Case Reviews Bob Thomson.
1 Methods of Measuring Crime Uniform Crime Reports Self- Report Surveys Victim Surveys.
Copyright © 2005 Pearson Education Canada Inc Chapter 2 Crime Statistics.
Chapter 2 Adapted from: Frank Schmalleger’s CRIMINAL JUSTICE TODAY, 9E.PRENTICE HALL, Education Inc. ©2007 Pearson Education, Inc.
Creating Something from Nothing: Synthetic and Dummy files Bo Wandschneider University of Guelph Chuck Humphrey University of Alberta DLI Training: Ottawa,
Catherine Millington Scottish Crime and Justice Survey,
WP 19 Assessment of Statistical Disclosure Control Methods for the 2001 UK Census Natalie Shlomo University of Southampton Office for National Statistics.
Disclosure Avoidance at Statistics Canada INFO747 Session on Confidentiality Protection April 19, 2007 Jean-Louis Tambay, Statistics Canada
Using Targeted Perturbation of Microdata to Protect Against Intelligent Linkage Mark Elliot, University of Manchester Cathie.
DEVELOPMENTS IN AUSTRALIAN CRIME VICTIMISATION SURVEYS.
Women and the Criminal Justice System Women and men have similar overall risks of victimization According to the 1999 General Social Survey (GSS) approximately.
Chapter Two Measurement of Crime and Its Effects.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
Proposed Recommendations for Guidelines Revisions.
Disclosure Analysis: What do RDC Analysts do? Research Data Centre Program, Statistics Canada James Chowhan Ontario DLI Training, Queen's University
Copyright 2010, The World Bank Group. All Rights Reserved. Prison Statistics Part 1 Crime, Justice & Security Statistics Produced in Collaboration between.
Mapping for the Next Millennium How CrimeRisk™ scores are formed.
Chapter Two CRIME AWARENESS Uniform Crime Reporting System (UCRS) The FBI’s Uniform Crime Reporting System began in U.S. Attorney General authorized.
Soc 332.6: Principles of research design Finding statistics.
Reporting Requirements Under Title IX and The Clery Act
Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS)
PART 1 UNIFORM CRIME REPORT
Part F-I The Economic Theory of Crime and Punishment
Disclosure Avoidance: An Overview
Gun Control Telling the Emperor he has no clothes Political marketing Academic freedom The NRA and methodology September 11th opened eyes Terrorists,
Presentation transcript:

1 Sarah Franklin October 30 th, 2013 Disclosure Control for Tables of Frequency Counts using Administrative Justice Data

Overview of presentation  In 2013, the Canadian Centre for Justice Statistics (CCJS) placed two administrative crime surveys in the Research Data Centres (RDCs)  Methodologists and subject matter experts developed a scoring approach for tables of frequency counts to identify ‘safe’ tables  Each variable in a dataset is assigned a sensitivity score. A table’s overall score is the sum of the variable scores. If the score is below a given threshold, the table is safe. 2

Uniform Crime Reporting Incident- based Survey (UCR) and the Homicide Survey  Administrative datasets  Mandatory reporting by all police services  Criminal incidents substantiated by the police  UCR is a sample of crime data not all crime comes to the attention of the police over 2 million incidents of crime annually  Homicide Survey data more sensitive than UCR All homicides; 543 homicides in 2012  Information on incident, victim(s), accused(s) 3

UCR, Homicide variables available to researchers  most serious violation for the incident of crime (e.g., homicide, robbery, mischief)  geography (region, province, CMA)  location (e.g., residential home, convenience store)  weapon causing injury (e.g., handgun, knife)  relationship between victim and accused  age and sex of victim and/or accused  clearance status (accused charged vs cleared otherwise) 4

Publicly available STC police- reported crime data UCR and homicide data available to all Canadians:  CANSIM tables (very aggregate)  Tables and graphs appearing in Juristat articles  Custom tabulations upon request 5

Homicides by CMA victimsrate per 100,000 victimsrate per 100,000 Edmonton Toronto St. John’s Montréal Ottawa Kingston0000 Saguenay Trois-Rivières Sherbrooke10.51 Moncton0000 Québec Brantford Canada

2009 RDC Pilot - Homicide Survey  Homicide Survey was available through RDCs  Results positive, 4 proposals submitted and 3 research reports completed  Researchers commented on the ease of use of data file, documentation and wealth of data/information  Researchers noted that vetting of data tables too long  RDC analysts noted that data disclosure rules difficult to implement and required additional work 7

 Statistics Act, paragraph 17(1)(b): No person […] shall disclose […] any information obtained under this Act in such a manner that it is possible from the disclosure to relate the particulars obtained from any individual return to any identifiable individual person, business or organization.  Main disclosure issues: Identity disclosure: can identify an individual Attribute disclosure: learn something new  Group attribute disclosure: learn something about a group Residual disclosure: disclosure by combining results 8 Disclosure Issues : What are we concerned about?

RDC disclosure control rules for tabular administrative justice data  Scoring approach developed by the Institut de la Statistique du Québec and is used by all STC administrative datasets in the RDCs assign a sensitivity score to each variable table’s score = sum of variables’ scores if table score greater than a threshold value, cannot release table  Go back and use more aggregated variables with lower scores Or perform controlled rounding 9

10 Reviewed all variables to appear on the RDC files Identified variables to be excluded due to: unique identifiers name of victim/accused, date of birth of victim/ accused, fingerprint of accused, incident file identifier data quality issues aboriginal variable, firearm information (registered, licensed) too sensitive homicide victim was pregnant, blood alcohol level of homicide victim, person accused of homicide has suspected mental or developmental disorder

11 Aggregated sensitive codes of variables Incident clearance status (UCR, Homicide Survey) suicide of accused → cleared otherwise Most serious violation aggregations Homicide Survey 1 st degree murder, 2 nd degree murder → murder manslaughter, infanticide → other homicides UCR sexual violations against children → other sexual assaults

12 Scored all UCR variables to appear on the RDC files 0 = not sensitive region=national; sex of victim/accused; vehicle type; target vehicle; motor vehicle recovered; fraud type; property stolen; location of incident; attempted vs completed violation; most serious weapon status 1-7 = sensitive but can be used in a table 8 = sensitive, cannot appear in a table police service id, exact date of incident Table threshold: ≤7 pass; ≥ 8 fail

13 Sensitive variables on the UCR, Homicide surveys Variables deemed sensitive (score 1-7)  geography (region, province, CMA)  age of victim/accused (aggregated, detailed)  most serious violation (aggregated)  most serious weapon (aggregated, detailed)  clearance status (aggregated, detailed)  level of injury (aggregate, detailed)  relationship of victim and accused (detailed, aggregated)

Detailed relationship between victim and accused (score=4) 14 Homicide victim was killed by: Husband/wifeSeparated husband/wife Common-law husband/wifeSeparated common-law h/w Divorced husband/wifeExtra-marital lover Same-sex spouseex same-sex spouse Father/motherStep-father/mother Son/daughterStep-Son/step-daughter Sister/brotherOther family Close friendOther intimate relationship Authority figureNeighbour Criminal relationshipBusiness relationship strangerCasual acquaintance unknownOther

Aggregated relationship between victim and accused (score=3) Homicide victim was killed by:  Family – spouse  Family – parent  Family – other  Other intimate relationship  Casual acquaintance  Criminal relationship  Stranger  Other  Unknown, n/a 15

Factors considered when scoring a variable  Scores, thresholds consistent across surveys  Maximum number of dimensions for RDC tables 8 dimensions for UCR; 3 for Homicide  Homicide data: single year vs 10 year data  Wanted scores to work for all CCJS tables: UCR scores: passed all CANSIM and Juristat tables Homicide scores: passed all CANSIM tables but not all Juristat tables 16

Factors considered when scoring a variable  Principle behind scoring approach: table is safe as long as sensitive characteristics cannot be attributed to a person or a group  Scrutinized tables with scores < 8 for sensitive characteristics revealed through: Identity disclosure Examined cells with counts of 1 or 2 Attribute disclosure Examined full cells, zero cells 17

extract of UCR table with score=7 Sexual violation incidents, victim=female age 25-34, accused=male, Canada, 2011 relationshipWeapon causing injury Unknownphysical force Firearmknifeothern/aTotal friend Business Criminal Casual Stranger Step-parent Step-child Other intimate Neighbour Total ,306 18

Status of UCR, Homicide RDC pilots UCR:  Crime data for available in RDCs  7 research proposals submitted and accepted  Disclosure control vetting committee for the pilot ensure disclosure control rules applied correctly evaluate/fine tune disclosure control approaches Homicide:  Homicide data for available in RDCs 19

Pros and cons of scoring  Pros easy for RDC researchers and CCJS to apply rules rules are consistently applied no distortion of data  Cons determining scores and thresholds is time-consuming difficult to determine scores if lots of variables or variables have lots of categories for Homicide, the pass/fail scoring approach for RDCs is very restrictive not immune to residual disclosure 20

Conclusion The scoring approach for frequency counts works well:  for crime-reported data and effectively mimics subject matter experts’ judgement when vetting  for census administrative data with an extensive history of published tables that set the standard for releasing tables  when there are a manageable number of variables and categories within variables Once developed, the scoring approach is easy to apply 21

For more information, please contact / Pour plus de renseignements, veuillez contacter: Sarah Franklin 22