BIG Data and OFFICIAL Statistics The Council of Professional Associations on Federal Statistics (COPAFS) Michael W. Horrigan Associate Commissioner Office.

Slides:



Advertisements
Similar presentations
ICP 7-th Regional Coordinators Meeting World Bank, Washington D.C.
Advertisements

Introduction: the New Price Index Manuals Presentation Points IMF Statistics Department.
Sampling: Theory and Methods
Keeping an Eye on Your World Economic Slide Library … Will grow continuously as updates continue to get added.
Cost Management ACCOUNTING AND CONTROL
WP2 Labour input in the National Account: Italy Consortium meeting Helsinki 9-11 June 2005.
BEAs Fixed Assets Accounts : An Overview Dave Wasshausen The First World KLEMS Conference Harvard University August 19-20, 2010.
BEA’s KLEMS Statistics: Measuring Outputs and Intermediate Inputs
Re-design of the trade in commercial services program in Canada October 2010 OECD Working Party on Trade in Goods and Services.
Treatment of social insurance schemes in the 2008 SNA Regional Seminar on Developing a Programme for the Implementation of the 2008 SNA and Supporting.
Statistics NZs experience in using Administrative Data in an Integrated Programme of Economic Vince Galvin General Manager Strategy & Communications.
Arthur Berger Regional Products and Income Accounts, Beijing, China, March 2010 Canadas Provincial and Territorial Economic Accounts.
Republic of the Philippines NATIONAL STATISTICAL COORDINATION BOARD 1 International Workshop From Data to Accounts: Measuring Production in National Accounting.
A Statistical Architecture for Economic Statistics Ron McKenzie ICES III.
1 ESTIMATION IN THE PRESENCE OF TAX DATA IN BUSINESS SURVEYS David Haziza, Gordon Kuromi and Joana Bérubé Université de Montréal & Statistics Canada ICESIII.
ENTREPRENEURSHIP (Ms. Hawkins)
1 WTO Statistics Division Trends in Services Trade under GATS Recent Developments Symposium on Assessment of Trade in Services World.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Monitoring School District Human Resource Cost Pressures Presented by Tom Gallagher October 30, 2012 Research & Planning Wyoming Department of Workforce.
Research into an alternative sampling frame for the FRS Antonia Simon, Development Team, DWP.
Outline of talk The ONS surveys Why should we weight?
1 WATER AUTHORITY Dr. Or Goldfarb CENTRAL BUREAU of STATISTICS Zaur Ibragimov Water Accounts in Israel Vienna January 2009.
Company Name PRESENTATION NAME Compilation and Dissemination of Energy Statistics International Workshop on Energy Statistics, Beijing, Sep 2012 International.
Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.
1 Wyomings Labor Market: A Brief Overview Doug Leonard, Principal Economist Wyoming Department of Workforce Services, Research & Planning
Measuring Quality in the BLS Business Register Richard Clayton David Talan Joint UNECE/OECD/Eurostat Meeting of the Group of Experts on Business Registers.
Micro Data: Collecting and Integrating Them in a Central Banks Research and Policy Kasper Roszbach World Congress on National Accounts and Economic Performance.
Information Systems Today: Managing in the Digital World
Secondary Data, Literature Reviews, and Hypotheses
GSA Federal Supply Service DOING BUSINESS WITH GSA.
ABC Technology Project
MARKETING INFORMATION AND RESEARCH
1 Third Workshop on ICP Western Asia Beirut, October 2004 Design of ICP price survey Sultan Ahmad, Consultant Based on Keith.
1 Wyoming Labor Market Information – Theres a Website for That! Presented by Sara Saulcy, Senior Economist Wyoming Department of Workforce Services Research.
Research Department 1 Global Economic Crisis and the Israeli Economy Herzliya conference Dr. Karnit Flug Research Director, Bank of Israel February 2009.
Labour Force Historical Review Sandra Keys, University of Waterloo DLI OntarioTraining University of Guelph, Guelph, ON April 12, 2006.
1 Longitudinal Employer- Household Dynamics (LEHD) Program Jeremy S. Wu U.S. Census Bureau May 11, 2005 Jeremy S. Wu U.S. Census Bureau May 11, 2005.
Research & Planning: Your Source for Labor Market Information Presented to SHRM, Gillette, WY March 12, 2014 Research & Planning Wyoming Department of.
25 seconds left…...
Annual Industry Accounts Overview George Smith & Nicole Mayerhauser Current Industry Analysis Division Bureau of Economic Analysis Industry Accounts Users’
Impact of Globalization on the US Statistical System Maureen Doherty All views expressed in this presentation are those of the author and do not necessarily.
Historical Changes in Stay-at-Home Mothers: 1969 to 2009 American Sociological Association Annual Meeting Atlanta, GA August 14-17, 2010 Rose M. Kreider,
12 Financial Management 12-1 Financial Planning
Local Employment Dynamics Training October 2014 Earlene Dowell Longitudinal Employer-Household Dynamics U.S. Census Bureau 1.
1 Volume measures and Rebasing of National Accounts Training Workshop on System of National Accounts for ECO Member Countries October 2012, Tehran,
Measuring the US Economy Economic Indicators. Understanding the Lingo Annualized Rates Example: GDP Q3 (Final) = $11,814.9B (5.5%) Q2: GDP = $2,
Institutional Framework for Economic Statistics in Decentralized System J. Steven Landefeld Director Friends of the Chair Group on Integrated Economic.
Data Collection in a Decentralized Statistical System – The U.S. Perspective Friends of the Chair Group on Integrated Economic Statistics, Work Group Meeting.
What are Wage Records? Wage records are an administrative database used to calculate Unemployment Insurance benefits for employees who have been laid-off.
U. S. Bureau of Labor Statistics The Trials and Tribulations of Developing International Services Price Indexes The 2008 World Congress on National Accounts.
Economic Indicators. Concepts  Variables that provide information about the state of the economy.  Every economic indicator has a story to tell.  Need.
Service Sector Improvements in the National Economic Accounts J. Steven Landefeld, Director Measuring Up in a Changing Economy: A Look at New.
The Importance of Economic Census Data for Federal Policy Katharine G. Abraham Member, Council of Economic Advisers Hi-Beams for the Economic Road Ahead.
A “Soup to Nuts” Guide for Modernizing and Integrating the Production and Dissemination of Statistics J. Steven Landefeld, Director High-Level.
Improvements in the BLS Business Register Richard Clayton David Talan 12th Meeting of the Group of Experts on Business Registers Paris, France September.
Data Sharing to Reduce Respondent Burden for the U.S. Census Bureau’s Business Register Presented to 12 th Meeting of the Group of Experts on Business.
Labor Market Information in the Americas: the United States Workshop On Labor Migration and Labor Market Information Systems Inter-American Network for.
Big Data activities at the U.S. Census Bureau Cavan Capps Big Data Lead U.S. Census Bureau February 13, 2014 Prepared for MIT Libraries Program on Information.
1 Business Register: Quality Practices Eddie Salyers
12th Meeting of the Group of Experts on Business Registers
1 Presentation to OG6 Canberra, Australia May 2011 Statistical Uses of Administrative Data in Canada.
1 “The Future of Data Collection, Access, and Dissemination: Uses of Administrative Data and Data Matching” J. Steven Landefeld, Director Population.
Expanding Business Employment Dynamics Industry and Survival 18 th International Roundtable on Business Survey Frames Beijing, China 10/22/04 Richard L.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Inflation Report May Output and supply Chart 3.1 Whole-economy GDP (a) (a) Chained volume measures. Annual growth of GDP at basic prices for 2005.
PowerPoint Presentation by Charlie Cook Copyright © 2004 South-Western. All rights reserved. Chapter 21 The Macroeconomic Environment.
Census quality evaluation: Considerations from an international perspective Bernard Baffour and Paolo Valente UNECE Statistical Division Joint UNECE/Eurostat.
Using administrative data to produce official social statistics New Zealand’s experience.
Measuring Data Quality in the BLS Business Register Richard Clayton Sherry Konigsberg David Talan WiesbadenGroup on Business Registers Tallin, Estonia.
Presentation transcript:

BIG Data and OFFICIAL Statistics The Council of Professional Associations on Federal Statistics (COPAFS) Michael W. Horrigan Associate Commissioner Office of Prices and Living Conditions March 1, 2013

Big Data and Official Statistics What are big data? How big data are already being used. The future of using big data by statistical agencies – perspective from a quality framework. 2

Big Data and Official Statistics 3 Big Data Admin Data Sampled survey data Non-sampled data

Bureau of Economic Analysis 4 Big Data Admin Data Sampled survey data Non-sampled data GDP

How are Big Data being used? Webscraping - Billion Prices Project Webscraping – BLS CPI Create data base of product characteristics for use in quality adjustment hedonic models – Televisions – Camcorders – Camera – Washing Machines Research to expand use to collect prices for cable TV plans and airline prices 5

How are Big Data being used? Google Tools to create large data files that combine publicly available data on social and economic activity stratified by geography, and social- demographic characteristics – Flu outbreaks, social unrest, job search, unemployment, etc. Modelling form combines google search index data in the current period with past values of an economic measure from the statistical system to predict a future value of the same concept. 6

How are Big Data being used? Tweets University of Michigan Study database Case study of job loss related tweets that examines the correlation with unemployment data to predict initial claims Intuit Time series of employment, compensation, hours worked, hourly rates of pay, % full time, new hire rate Stratified by size, industries ADP Payroll Over the month change in payroll employment 7

How are Big Data being used? Scanner data: Homescan, Nielson Actual sales transactions Comparison of national distribution of selected products with results from CPI disaggregation process JD Power Used car frame for CPI Researching use for CPI production of new car price indexes 8

How are Big Data being used? Medicare part B PPI and CPI use reimbursements to doctors by procedure code in indexes Claims data Validation of MEPS and CPI inflation rates Note: CPI constructs experimental disease based price indexes using annual weights from the MEPS household survey data 9

How are Big Data being used? Stock Exchange Security Trades PPI receives a monthly census of all bid and ask prices and trading volume for all traded securities as of market close for 3 selected days of the month. These data are used for index estimation 10

How are Big Data being used? Company provided data – Corp X Research by CPI to use company provided data on all register transactions for sampled outlets Challenges: – Can the matched model requirement be satisfied – Accounting for substitutes – IT production requirements – Risk of losing access 11

How are Big Data being used? Administrative data Published data using universe counts Sampled surveys Drawing samples Frame refinement Development of weights Imputation 12 Estimation

How are Big Data being used? BLS Quarterly Census of Employment and Wages: Some examples of uses: BLS sampling: PPI, NCS, CES, OES, OSH, JOLTS, Green Jobs Imputation: State based estimates use QCEW data to impute for key non-respondents Use of QCEW data to develop forecasts that are used in the CES birth death model Census of establishments by industry Census of the Population Customs Bureau trade flow data 13

How are Big Data being used? Administrative data Used directly in estimation IPP uses EIA data on crude petroleum for their import indexes PPI uses Department of Transportation data on baggage fees CPI uses SABRE data for airline prices 14

How are Big Data being used? Administrative data Linking Census Bureaus Longitudinal Establishment…. BLS Business Employment Dynamics Linking within agencies Sharing across agencies: CIPSEA 15

Assessing Big Data through the lens of Quality frameworks Statistical agencies use a variety of quality dimensions to judge the efficacy of their direct data collection programs. It is reasonable to ask how the use of Big Data by Billion Prices, Google, Intuit and others fare along the same dimensions The use of external data sets (Big, Administrative, Other surveys) by statistical agencies to produce blended estimates should come under the same scrutiny 16

Quality as a three-level concept Product QualityProcess Quality Organizational Quality 17

Product Quality TimelinessThe two primary quality features of Billion, Google, Intuit Relevance Objectivity Clear, unbiased Accuracy – sampling errors Calculated, published, used in analysis 18

Product Quality Accuracy – non sampling errors Coverage – Primary challenge to statistical systems – Often an advantage of Big Data Non response bias – Significant concern of statistical systems about their own data and for Big Data Classification/specification – Lack of cross walks across different classification systems across statistical systems, administrative data, firm data, big data 19

Product Quality Timeliness Relevance Objectivity Integrity Accuracy – sampling and non-sampling errors VariancesCoverage Standard ErrorsNonresponse bias Specification errors Data processing errors Measurement error 20

Product Quality Timeliness Relevance Objectivity Integrity Accuracy – sampling and non-sampling errors VariancesCoverage Standard ErrorsNonresponse bias Specification errors Data processing errors Measurement error 21

Product Quality Metadata/transparency/interpretability Coherence / comparability Accessibility Serviceability 22

The Future of Using Big Data by the U.S. Statistical System Here to stay but quality assessment is lacking Groves, Washington Post, August 7, 2012 Costs and declining budgets make using big data in constructing blended estimates a reality Assumes time more valuable than privacy, respondents willing to give permission to access bank records, credit card reports, taxes, etc. 23

The Future of Using Big Data by the U.S. Statistical System Will households cooperate? Asking respondent permission is key Concerned about impact on both response rates and non-respondent bias. More likely greater progress will be made using big data from businesses than households What about integrating private sources of data such as Google, Intuit and Billion Prices? Without transparency, not likely Comparability more likely 24

Contact Information Michael Horrigan Associate Commissioner Office of Prices and Living Conditions

What are Big Data? 26