Using Name Change and Non-Education Administrative Data to Assist in Identity Matching 26th Annual Management Information Systems (MIS) Conference February.

Slides:



Advertisements
Similar presentations
HRMS 8.9 Upgrade Person Model. Introduction One of the significant changes to HRMS with the upgrade to 8.9 is the new Person Model. This course provides.
Advertisements

Education Data Warehouse Building Blocks: Identity Matching and Data Governance IPMA May 21,
Wisconsin Department of Health Services Richard Miller Research Scientist Wisconsin Office of Health Informatics October 28, 2014 Matching Traffic Crash.
Welcome North Carolina Community College IIPS 2007 Summer Conference July 24, 2007.
Federal Guidance on Statistical Use of Administrative Data Shelly Wilkie Martinez, Statistical and Science Policy, OIRA U. S. Office of Management and.
A PRESENTATION To the Workshop on the Improvement of Civil Registration and Vital Statistics in SADC Region Department of National Registration, Passport.
Regarding the Use of SSNs in Education and Related Data Systems.
M AY 21, 2014 I DENTITY M ATCHING : SSN S ARE NOT ENOUGH ! J OHN S ABEL ERDC ARRA SLDS Conference.
NHVRINweb Real-Time WDQS William R. Bolton, Jr. State Registrar and Director Division of Vital Records Administration New Hampshire Department of State.
2006 Lehigh County Court of Common Pleas Version 1.0   Learn to Navigate Lehigh County Court’s New Civil Records Inquiry System The.
Washington State: Collaboration Efforts Amongst Agencies DQI June 7, 2012 Presented by: Phouang Sixiengmay Hamilton Bill Huennekens.
PERSONAL RECORDS.  What is Personal Identification?  Do you have any forms of Personal Identification? Is so, What?  When would you need Personal Identification?
Transition and the IEP Why is effective transition planning important?
A WDQI RESEARCH REPORT TOBY PATERSON AND GREG WEEKS FORECASTING DIVISION OFFICE OF FINANCIAL MANAGEMENT MAY 2014 The economic returns to a bachelor’s degree.
2013 Management Information Systems Conference 1 L ABOR & E DUCATION D ATA : S UCCESS S TORIES Wednesday, February 13, 2013 Carol Jenner, Washington State.
Sub-session 1B: General Overview of CRVS systems.
14 February 2012 Maternal and Child Health Conference Registering Victorian births.
Oregon Mortuary and Cemetery Board November 18, 2014 Jennifer A. Woodward, State Registrar Public Health Division Center for Public Health Practice Center.
November 2014 MINNESOTA’S Statewide Longitudinal Education Data System (SLEDS) Minnesota Department of Education Minnesota Department of Employment and.
Data Warehouse New Data Administrator Training October 3, 2014 Data Coordinators: Larry Hunt & Angie Russell.
9/10/2015 What’s New? Edline at Valley View!! Joyce Potempa Technology Department presentation to Building Support Staff February 2, 2010 Institute Day.
DATA GOVERNANCE Presentation to CSG September 27, 2007 Mary Weisse Manager, MIT Data & Reporting Services
OSEP National Early Childhood Conference December 2007.
1 Country: Kingdom Of Bahrain Improving Statistics on Fertility and Mortality in ESCWA Region Cairo, Egypt 3 – 6 December 2007.
How to Use a Sectored Employment Strategy to Increase Student Success Presented by Ruben Garcia & Hiwot Berhane TAIR 2005 – Arlington, TX.
2012 SLDS P-20W Best Practice Conference 1 M ANAGING V ENDOR R ELATIONSHIPS Monday, October 29, 2012 Facilitator: Jim Campbell (SST) Panelists: John Brandt,
Education Research & Data Center Spring 2014 Conference Carol Jenner, ERDC.
Enrollment Services Informed Budget Process: FY 2013 IBP Progress Report & FY 2014 IBP Budget Request.
WASHINGTON HIGHER EDUCATION COORDINATING BOARD 1 Washington State & Regional Needs Assessment Pacific Northwest Association for Institutional Research.
General Register Office for S C O T L A N D information about Scotland's people General Register Office for Scotland “Information about Scotland’s people”
EGovernment Services in Poland Today & in The Future Dariusz Bogucki Ph.D, IDA II, National Co-ordinator National Registers Department, Ministry of Internal.
EDUID Education Unique Identification SDE Support.
Apirl 2009 Copyright © 2009 Mississippi Department of Education 1 Funds Educable Child.
ON THE ROAD TO COLLEGE What College Bound Students Need to Know About Their Scholarship Fall
Welcome North Carolina CCPRO Winter Conference February 25, 2007.
1 The New York State Education Department New York State’s Student Data Collection and Reporting System.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
THE BUSINESS CASE FOR SUSTAINED INVESTMENT IN THE MD LDS CENTER RESEARCH CAPABILITY Treva Stack David Stevens Ting Zhang Jacob France Institute University.
EDUID Education Unique Identification. EDUID What is EDUID PROVIDED BY THE IDAHO STATE DEPARTMENT OF EDUCATION Educational Unique Identification A nine.
Terra Dominguez Mark Hausner Michael Riles Data Literacy for ELLs.
OBTAINING WIOA COMMON MEASURES BEFORE AND AFTER WDQI Strengthening Washington workforce development data.
Washington’s Education Research & Data Center 26 th Annual Management Information Systems Conference Concurrent Session I-B: Using a Research Center or.
1 P-20W Identity Management November 16, :15 – 12:15 Bob Swiggum, GA Bill Hurwitch, ME Cathy Wagner, MN.
ICT in Education. Use of computers for school / college administration Schools and colleges employ staff, pay for goods and services, keep records of.
Second-Order Integrated Developmental Database Systems: EHDI Applications Craig A. Mason, Ph.D.Shihfen Tu, Ph.D. University of Maine Centers for Disease.
United Nations Workshop on Principles and Recommendations for a Vital Statistics System, Revision 3, for African English-speaking countries Addis Ababa,
United Nations Workshop on Principles and Recommendations for a Vital Statistics System, Revision 3, for African English-speaking countries Addis Ababa,
How to use the Graduate Employment Outcomes tool to assess labor market outcomes of Minnesota graduates Alessia Leibert, Research Project Manager Minnesota.
Using administrative data to produce official social statistics New Zealand’s experience.
Massachusetts Community Colleges & Workforce Development m Transformation Agenda Transformation Agenda Summer Gathering August 20, 2014 Session 1E & 2E:
1 Identifiers from Early Learning to P-20W Wednesday, November 16, 2011 Carol Jenner (WA) Jay Pennington (IA)
Enrollment and Degree Verification Form Revised 06/2016 Process The University of Oklahoma Health Sciences Center Office of Admissions and Records Robert.
Welcome to Workforce 3 One U.S. Department of Labor Employment and Training Administration Webinar Date: Thursday, October 23, 2014 Presented by: Division.
"The findings and conclusions in this report are those of the author(s) and do not necessarily represent the official position of the Centers for Disease.
North Carolina Community College IIPS 2007 Summer Conference
Expert Group Meeting on Censuses Using Registers Geneva, 22nd May 2012
Betty McGrath North Carolina North Carolina Department of Commerce
Civil Registration Process: Place, Time, Cost, Late Registration
Postsecondary Pathways of High School Graduates:
PRODUCTION PROCESS AND FLOW
Ideal SBRs and the Australian SBR
The Need for a Civil Registration System
Legal Framework for Civil Registration, Vital Statistics
Career Outcomes for Higher Education Graduates
SCHS and Health Statistics
Regional Economic Trends
Leveraging the Power of Geographic Information System
Functioning of the vital statistics system
Federated States of Micronesia By Mitzue S. Paulis
Federated States of Micronesia By Mitzue S. Paulis
Presentation transcript:

Using Name Change and Non-Education Administrative Data to Assist in Identity Matching 26th Annual Management Information Systems (MIS) Conference February 14, 2013 John Sabel and Carol Jenner Washington Education Research & Data Center

Overview Background Identity Resolution Challenges Non-Education Data Sources How to Apply to Identity Resolution Value Added State Sources of Name-Change Data Contact Information 2

Washington’s P20W Data System Based in Education Research & Data Center in the state Office of Financial Management o Forecasting & Research Division – specialists in education, economics, human services and demography with experience in management and analysis of large administrative data sets o Since 1999, home of state’s unit-record public baccalaureate enrollment data system P20W data system o Centralized, research-oriented o Comprehensive data from early learning, K-12, public postsecondary, workforce o Also apprenticeship, corrections, GED completers, National Student Clearinghouse and selected non-education sources 3

Washington’s P20W Data Warehouse 4 All PII data is isolated within the Informatica MDM (Master Data Management) ORS where at P20_ID token is assigned to unique individuals. In addition, a Token_ID is created using a combination of Source System Identifier and Source System Person Identifier and attached to all data received from a system to allow for identity merging and identity unmerging at the P20 Level and at the detailed data level. MDM - Master Data Management ORS - Operational Reference Store PII DATA with P20_ID Token PERSON P20_ID Token PERSON P20_ID Token ROLE ROLE_ID Token ROLE ROLE_ID Token ORGANIZATION ORG_ID Token ORGANIZATION ORG_ID Token PRO Enrollment + Source ID Token PRO Enrollment + Source ID Token PRO Achievement + Source ID Token PRO Achievement + Source ID Token PRO Event + Source ID Token P20 Data Warehouse PRO P20_ID, ROLE_ID, ORG_IDPRO

Names: Challenges in administrative records Actual name changes – some “official” and some not Marriage, Divorce, Adoption Personal decision Different expression of same name Use of nicknames Missing middle names or middle initial only Switched first and middle names Cultural name conventions Universal problems High frequency surnames (Smith, Anderson, Nguyen) Twins 5

Some name changes are easy to determine. Within a single sector: K-12: LastNameFirstNameMiddleNameBirthDateSchoolK12StateIDSSN WilsonJohnEdward AndersonJohnEdward Postsecondary: LastNameFirstNameMiddleNameBirthDateCollegeCollegeIDSSN SmithMaryElizabeth JonesMaryElizabeth Workforce (Unemployment Insurance Wage): LastNameFirstNameMiddleNameYYYYQEmployerIDSSN GreggPJ20011A5326B BrownPJ20012A5326B Note: Information presented here has been fabricated to provide illustrative examples. As of June 24, 2011, SSNs beginning with and had not been issued by the Social Security Administration. 6

Cross-sector linking provides resolution 7 Cross-sector: K-12: LastNameFirstNameMiddleNameBirthDateSchoolStudentID SmithJamesEdward SmithJimE SmithBubblegum Postsecondary: LastNameFirstNameMiddleNameBirthDateCollegeSSN SmithJamesE “ Bubblegum ” Note: Information presented here has been fabricated to provide illustrative examples. As of June 24, 2011, SSNs beginning with had not been issued by the Social Security Administration.

Non-education data source provides resolution 8 Cross-sector plus additional non-education information: K-12: LastNameFirstNameMiddleNameBirthDateSchoolStudentID SmithJamesEdward SmithJimE SmithBubblegum Postsecondary: LastNameFirstNameMiddleNameBirthDateCollegeSSN SmithJamesE “Bubblegum” Driver license: LastNameFirstNameMiddleNameBirthDateSSN(last 4) SmithJamesEdward (no other James E Smiths – any birthdate – in driver license data) Note: Information presented here has been fabricated to provide illustrative examples. As of June 24, 2011, SSNs beginning with had not been issued by the Social Security Administration.

Two people or one? 9 K-12: LastNameFirstNameMiddleNameBirthDateSSN AndersonBrittneyJanice AndersonBrittneyT Driver License LastNameFirstNameMiddleNameBirthDateSSN (last 4) AndersonBrittney Janice AndersonBrittney Theresa Note: Information presented here has been fabricated to provide illustrative examples.

First-Middle-Last format doesn’t fit all 10 María Theresa Garcia López (birth date same in all records) K-12: LastNameFirstNameMiddleNameSchoolStudentID LopezMariaTheresa Garcia GarciaMa Theresa LopezTheresaGarcia Postsecondary: LastNameFirstNameMiddleNameCollegeSSN Garcia LopezMariaTheresa GarciaLopezM Driver License LastNameFirstNameMiddleNameSSN (last 4) Garcia LopezMaria Theresa 1234 Note: Information presented here has been fabricated to provide illustrative examples. As of June 24, 2011, SSNs beginning with had not been issued by the Social Security Administration. For discussion of cultural naming conventions, see Marcus, N., Adger, C.T., & Arteagoitia, I. (2007). Registering students from language backgrounds other than English (Issues & Answers Report, REL 2007-No. 025). Washington, DC: U.S. Department of Education, Institute of Education Sciences, National Center for Education Evaluation and Regional Assistance, Regional Education Laboratory Appalachia. Retrieved from

Name Change Data: Old Names / New Names Four sources of non-education name change data: 1.WA State court system name changes 2.WA State Department of Licensing data 3.WA State marriage data, for women only 4.WA State divorce data, for women only With all four sources, raw data are massaged into old name / new name pairs For divorce data, the potential old last name is inferred from the husband’s last name. 11

Using Old Name / New Name Pairs The old name / new name pairs act as a bridge: Used to create tuples of data where one name matches an “old name” and the “new name” matches a different name.* In practice, an exact match is done on the first and last names only in the tuples. Example: Name 1A = Joy V. Chuit Old Name/ = Joy Volanda Chuit New Name = Roberta S. Almeida Name 1B = Roberta Almeida Then the resulting data set is organized into “classes” based on similarities in the middle names. * Subject to the birth dates being the same 12 Note: Information presented here has been fabricated to provide illustrative examples.

Using “classes” to organize potential matches Potential matches are organized by middle name based classes: Class 1: The middle names in tuple match perfectly. – Class 1b: As above, but the day and month of birth is Jan. 1 st Class 2: Somewhere in tuple a full middle name matches a middle initial where only a middle initial is available. – Class 2b: As above, but the day and month of birth is Jan. 1 st Class 3: Somewhere in tuple, a null middle name matches a non-null middle name. – Class 3b: As above, but the month and day of birth is Jan. 1 st These potential matches are then reviewed in a spreadsheet format. 13

Value added by use of non-education sources Enhances accuracy of longitudinal tracking  more accurate calculation of graduation rates, postsecondary enrollment rates, etc. – Reduced undercount of numerators – Reduced overcount of denominators Reduces bias – More complete and accurate information for certain subgroups (name changes after marriage/divorce, blending of families) – Improves matching and linking of names from a variety of cultural backgrounds 14

State Level Sources of Name Change Data Marriage and divorce data – All states have a Vital Records Office and/or a Center for Health Statistics. These agencies should maintain each state’s marriage and divorce data. Court-sanctioned name change data – All states have an office that is responsible for providing administrative, business and technology support services to their courts. Common names for such an office include “Administrative Office of the Courts” and “Office of the State Courts Administrator.” If a state maintains court-sanctioned name change data, this office will have it. Driver license data 15

C ONTACT U S John Carol Washington Education Research & Data Center 16