Automating the Production of Descriptive Tables at Statistics Canada mog.ado, a user-written program with quality controls Questions and comments may be.

Slides:



Advertisements
Similar presentations
Take another look Alison Hayman Search Solutions Unit Dissemination Divison February 2011 Statistics Canada site search.
Advertisements

January OLA Super Conference 2009 S.Giles, D.Jakubek Ryerson University Census? Statistics? E-Stat can give you the answers! Ontario Library Association.
Joe Wilkinson Public Sector Employee Pension Liabilities and Assets: Implication for Canadas Total Government Debt Measures.
Collecting data on Care-giving and Unpaid work Heather Dryburgh Statistics Canada.
THE 2004 LIVING CONDITIONS MONITORING SURVEY : ZAMBIA EXTENT TO WHICH GENDER WAS INCORPORATED presented at the Global Forum on Gender Statistics, Accra.
Collection of international migration data from official statistics: opportunities and challenges Adriana Skenderi Editor, UN Demographic Yearbook
Data Quality Assurance and Dissemination International Workshop on Energy Statistics Aguascalientes, Mexico.
The Scope of Energy Statistics in Canada International Workshop on Energy Statistics Aguascalientes, Mexico.
Arthur Berger Regional Products and Income Accounts, Beijing, China, March 2010 Canadas Provincial and Territorial Economic Accounts.
Presented by: Denise Sjahkit SURINAME. Introduction Overview of the main policy issues Scope Current compilation practices Data-sources Requirements for.
International Telecommunication Union Stocktaking – metadata collection Household survey results Market, Economics and Finance Unit.
The UK Census: future directions Peter J Fullerton Administrative Sources and Integration Division.
1 Questionnaire design Module 3 Session 3. 2 Overview (of Session) This session starts by introducing some aspects that need to be considered when designing.
Collecting data for informed decision-making
2011 Census Can Succeed: Census Makes Sense to Most Canadians Jack Jedwab for the Association for Canadian Studies and the Canadian Race Relations Foundation.
Sampling in Marketing Research
Attributing Monetary Values to Volunteer Contributions Jack Quarter Laurie Mook October 18, 2004.
Name of presenter(s) or subtitle Canadian Netizens February 2004.
Unit 3: Changing Populations Get to Mississippi Mills…STAT!
2011 WINNISQUAM COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=1021.
2011 FRANKLIN COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=332.
Statistical Disclosure Control (SDC) at SURS Andreja Smukavec General Methodology and Standards Sector.
Static Equilibrium; Elasticity and Fracture
Discrepancies Between National and International Data By Shelton Kanyanda (Chief Statistician Malawi)
Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.
Preparing Data for Quantitative Analysis
Implementation of the CoP in SLOVENIA Cooperation with data users Genovefa RUŽIĆ Deputy Director-General.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences CHAPTER.
TAJSTAT: Strengthening the National Statistical System Project Mustafa Dinc TLSS and MICS Conference Dushanbe, Tajikistan July 1, 2008.
CMNS 261 Statistical Sources Sylvia Roberts Liaison Librarian for CMNS September 2009.
The Dutch Censuses of 1960, 1971 and 2001 Producing public use files in the IPUMS project Wijnand Advokaat Statistics Netherlands Division Social and Spatial.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
An Integrated Approach to Economic Statistics “ The Canadian Experience” UNSD – IBGE Workshop on Manufacturing Statistics Kevin Roberts Rio de Janeiro,
1 Human resources management in NSOs Training workshop for SADC member states. Luanda, 2-6 Dec 2006 Olav Ljones, Deputy Director General, Statistics Norway.
POLICIES AND PROCEDURES FOR ARCHIVING DATA IN BURUNDI.
National Household Survey: collection, quality and dissemination Laurent Roy Statistics Canada March 20, 2013 National Household Survey 1.
Administrative Data at Statistics Canada – Current Uses and the Way Forward 27 th Voorburg Group Meeting Warsaw, Poland André Loranger October 4, 2012.
Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September 2011 Overview of Archiving of Microdata Session 4 United Nations.
Overview of 2002 CIPSEA: Methods to Protect Confidential Tabular Data Amrut Champaneri, Ph.D. U.S. Department of Transportation Bureau of Transportation.
DATA VS STATISTICS. Data Facts or figures* from which conclusions can be drawn Numeric files created and organized – for analysis, or to create a new.
The Statistical Business Register of Macao SAR Government of Macao SAR Statistics and Census Service.
Modernization and Reengineering of the Census of Governments A focus on the Quarterly Tax Survey June 4, 2010.
Copyright 2010, The World Bank Group. All Rights Reserved. Planning and programming Planning and prioritizing Part 1 Strengthening Statistics Produced.
Copyright 2010, The World Bank Group. All Rights Reserved. Part 2 Labor Market Information Produced in Collaboration between World Bank Institute and the.
1 1 Best practice template Introduction Prepared for the 6 th Oslo Group meeting in Canberra 2 – 5 May 2011 Elisabeth Isaksen Senior Executive Officer.
The MDGs - Country experience Republic of Mauritius Meera Ganoo Central Statistcis Office Mauritius May 2008.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis for Arabic Speaking Countries, Amman, Jordan May 2011 Identification.
By Darlene Fichter and Harpreet Aulakh Data Library Services October 2, 2007 Overview of Government Statistics Sociology Principles of Research Design.
Chapter Twelve Copyright © 2006 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
Legal and institutional foundation of economic statistics Overview of international experience Regional Workshop for African Countries on Compilation of.
JOINT UN-ECE/EUROSTAT MEETING ON POPULATION AND HOUSING CENSUSES GENEVA, MAY 2009 DETERMINING USER NEEDS FOR THE 2011 UK CENSUS IAN WHITE, Office.
National Boot camp Vancouver Heather Dryburgh and Michel B. Séguin May 31 st, 2011 Survey Life cycle.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis for Arabic Speaking Countries, Amman, Jordan May 2011 Identification.
Marketing Information System A Marketing Information System is the structure of people, equipment, and procedures used to gather, analyze, and distribute.
Data in context Chapter 1 of Data Basics. Frameworks Today, we will be presenting two frameworks for thinking about the content of data services. A.Statistics.
Methods of Statistical Analysis and Dissemination of Census Results in Guyana MORGAN CLITUS DIAS SENIOR CARTOGRAPHER BUREAU OF STATISTICS GEORGEOWN,GUYANA.
Use of Standardized Metadata to Find, Select and Access Statistical Data - Experience of Statistics Canada - Joint UNECE/Eurostat/OECD Work Session on.
Marketing Information System A Marketing Information System is the structure of people, equipment, and procedures used to gather, analyze, and distribute.
A Training Course for the Analysis and Reporting of Data from Education Management Information Systems (EMIS)
Presented: 2009 Canadian Users Stata Group Meeting
Camilla Stoltenberg IANPHI Annual Meeting Roma, 24 October 2017
Business Research Methods
Idendification of and Consultation with Census Data Users
Albania 2021 Population and Housing Census - Plans
Data collection.
Overview Results Director of various statistical and IT projects
Dissemination of data and metadata…
Data collection.
Presentation transcript:

Automating the Production of Descriptive Tables at Statistics Canada mog.ado, a user-written program with quality controls Questions and comments may be sent to the author at

Statistics Canada Statistique Canada Contents Environment of where mog was developed Statistics Canada Purpose of mog Examples Options: present and future

Statistics Canada Statistique Canada Statistics Canada Statistics Canada produces statistics that help Canadians better understand their countryits population, resources, economy, society and culture Objective statistical information is vital to an open and democratic society. It provides a solid foundation for informed decisions by elected representatives, businesses, unions and non-profit organizations, as well as individual Canadiansinformed decisions As Canadas central statistical agency, Statistics Canada is legislated to serve this function for the whole of Canada and each of the provinceslegislated In addition to conducting a Census every five years, there are about 350 active surveys on virtually all aspects of Canadian life Data uses include: GDP, CPI, unemployment rate; health, social and education statistics We at Statistics Canada are committed to protecting the confidentiality of all information entrusted to us and to ensuring that the information we deliver is timely and relevant to Canadiansprotecting the confidentialityrelevant to Canadians Visit us at for more informationwww.statcan.gc.ca Source:

Statistics Canada Statistique Canada Collection and Dissemination Collecting data (census, administrative data and surveys) Questionnaire development, testing, collection, and data processing Check data Verification (errors in processing, coding mistakes) Certification (compare estimates to other data sources) Preparations for dissemination (e.g. for an analysis made on the data) Reliability of the estimates is acceptable Suppression (confidentiality of respondents is being protected) Significance testing between estimates

Statistics Canada Statistique Canada Purpose of mog mog designed to automate the dissemination quality control steps of: reliability, suppression, and significance testing As well, it displays estimates by up to two other classification variables in tabular form Result: a table giving estimates (mean or total) of one variable over one or two other categorical variables Useful for simple, descriptive statistics

Statistics Canada Statistique Canada Example I Make a table showing the mean of retired by age and education categories (similar to table education age, c(m retired)), but with quality control checks mog retired education age, nodetail survey dec(0) Means of retired by education and age Estimation technique for standard errors: linearized Table 45 to to 75 Over 75 doctorate/maste~ 20 87^ 88^ diploma/certifi~ 16 86^ 97^ some university~ 18 83^ 92^ high school dip~ 18 88^ 79^ some secondary/~ 26 76*^ 76*^ Notes * significantly different from the reference group of the variable educ5, category number 1, p <.05 ^ significantly different from the reference group of the variable age3, category number 1, p <.05 The data in the table is not real.

Statistics Canada Statistique Canada Example II Same as example I with additional options mog retired education age, nodetail /// survey dec(0) ref2(2) pubs pubdichot underscores varwidth(40) Means of retired by education and age Estimation technique for standard errors: linearized Table 45_to_65 66_to_75 Over_75 doctorate/masters/bachelor's_degree 20^E 87X 88X diploma/certificate_from_community_colle~ 16^ 86 97^X some_university/community_college 18^E 83X 92X high_school_diploma 18^ 88X 79 some_secondary/elementary/no_schooling 26^ 76* 76* Notes * significantly different from the reference group of the variable educ5, category number 1, p <.05 ^ significantly different from the reference group of the variable age3, category number 2, p <.05 The data in the table is not real.

Statistics Canada Statistique Canada Example I: the Long Way At Statistics Canada, to create the table in our example that meets key confidentiality and quality requirements (there are others) would need the following commands to be run: One table command to create a table of estimates One mean command and one estimates table command to examine individual significance of the 15 estimates 22 test or lincom commands requiring visual inspection of results One tabulate command and a visual inspection of 15 cell counts In total, 26 lines of code and 52 numbers that need to be visually inspected, as opposed to 1 line of code to run mog and inspecting the 15 estimates it produces, all in one place The work multiplies for each table you have All of the above needs to be done again if the sample changes

Statistics Canada Statistique Canada Copying Process Select the table rows from the mog output Right click and select: copy table if copying to a spreadsheet or word processor (in a Word table, select enough rows and columns in the table into which you are copying) Other options include: copy text if copying to a word processor where you will use a fixed width font copy html if copying to a location where you want a table to be automatically generated mogs underscore option useful when value labels have spacesensures the correct number of columns are created

Statistics Canada Statistique Canada Other Options Display Options: Number of decimal places displayed; number rounding Control of column width (although columns will automatically enlarge if large numbers/many decimal places are to be displayed) Reshow table by typing mog with no arguments Reshow table with different reference groups (or other display options) without re-estimating the variances (time saver when bootstrapping) Can show quality control symbols that indicate: individual statistical significance of results at two user-defined thresholds (e.g. F = do not publish if cv > 1/3, E = publish with warning if 1/3 >= cv >= 1/6); and whether the estimate is based on enough observations (e.g. X if too few) The cut-offs and symbols can be changed as per the users needs Statistics Canada surveys have User Guides that indicate these values Analysis Significance level used for tests between classification levels can be changed (.05,.01, …) mog is byable Will use svyset information in variance estimation via survey option (not through svy prefix)

Statistics Canada Statistique Canada Future Options Save table as a csv file Show standard errors/t-ratios under estimates Harmonize syntax with Statause over() option to specify classification variables Use estimates based on different populations by one classification variable Use with proportion command Find alternative to the underscores option

Statistics Canada Statistique Canada Requests for the Program Contact me directly at and I will send you the Please provide me with any comments you may have on bugs, wording, inconsistencies, etc. After receiving enough feedback, I will update the program and make it available online at one of the stata program archive sites