R: Innovating at the Bureau of Labor Statistics

Slides:



Advertisements
Similar presentations
“The Honeywell Web-based Corrective Action Solution”
Advertisements

AS ICT Finding your way round MS-Access The Home Ribbon This ribbon is automatically displayed when MS-Access is started and when existing tables.
RETRIEVING DATA FROM FCC LICENSE DATABASE Steps for obtaining query results, and importing it into MS Excel Spreadsheet.
ClubRunner Connect. Communicate. Collaborate. ClubRunner and Rotary International Database Integration Introduction and Overview November 2010.
CE Overview Jay T. Ryan Chief, Division of Consumer Expenditure Survey December 8, 2010.
Importance of Data Quality April 21, Agenda  Uses & Benefits of Data  Elements of Quality  IBC Data Quality Safeguards  Metrics  Data Quality.
ClubRunner Connect. Communicate. Collaborate. ClubRunner and Rotary International Database Integration Introduction and Overview Introduced: November 2010.
Using 3D-SURFER. Before you start 3D-Surfer can be accessed at For visualization.
Chapter 5 Application Software.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
A detailed guide on how to set-up your printing storefront. Please Note: Storefronts are compatible with all browsers, however for optimal use of the admin.
Developing Health Geographic Information Systems (HGIS) for Khorasan Province in Iran (Technical Report) S.H. Sanaei-Nejad, (MSc, PhD) Ferdowsi University.
NAWD National Conference on Student Activities – 2009 can produce Yip-pees! Saturday December 5, 2009 – Fort Lauderdale, FL Lou Miller – Executive Director,
1 CADE Finance and HR Reports Administrative Staff Leadership Conference Presenter: Mary Jo Kuffner, Assistant Director Administration.
© Paradigm Publishing, Inc. 5-1 Chapter 5 Application Software Chapter 5 Application Software.
Classroom User Training June 29, 2005 Presented by:
LSP 121 Week 1 Intro to Databases. Welcome to LSP 121 Quantitative Reasoning and Technological Literacy II Continuation of quantitative data concepts.
Landlord Utility Services Work Instructions. To enter the portal, simply enter your assigned User ID and Password, provided by Consumers Energy. Log In.
Page Up or Down to navigate through the program.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
TheDataWeb & DataFerrett Rebecca Blash Bill Hazard The DataWeb Applications Branch U.S. Census Bureau.
Nobody’s Unpredictable Ipsos Portals. © 2009 Ipsos Agenda 2 Knowledge Manager Archway Summary Portal Definition & Benefits.
1 Working with MS SQL Server Textbook Chapter 14.
ITGS Case Study Theatre Booking System Ayushi Pradhan.
Google Apps (Education Edition) A step guide to a successful deployment January 10 th, 2008 California Technology Assistance Project
© Paradigm Publishing Inc. 5-1 Chapter 5 Application Software.
Making Information Available to the World with HTML and Web Pages Computational Thinking Computational thinking involves a set of problem-solving skills.
Education And Training CTC IT DIVISION PivotLink User Training April 2010.
1 Introduction to MBProject. 2 Part I: MBProject and how it is used. Part II: What MBProject might do in the future. Also, as-built data collection and.
Week 1 Intro to the Course Intro to Databases.  Formerly ISP 121  “Continuation” of LSP 120 concepts  Topics include: ◦ Databases ◦ Basic statistics.
Reducing manual steps in the cost roll/cost copy utilizing XML and RUNUBEXML Prepared by Name: John Howe Title: Senior Business Systems Analyst Company:
ClubRunner and Rotary International Database Integration
Data quality & VALIDATION
Create Online Surveys for Free by Using Google Documents
U.S. Bureau of Labor Statistics
Hydromet Cloud Presentation
Objectives Create a folder in Google Drive.
Objectives Overview Identify the four categories of application software Describe characteristics of a user interface Identify the key features of widely.
Software Application Overview
DART Request Training for New Zealand Distributors
PowerPoint presentation
Learning Objectives Today we will Learn:
LAUSD & FYAP Student Information Systems Overview for New Counselors
DART Request Training For Asia Distributors
CaRT eCapacity Initiative Ghana Productivity Apps
DePaul Bears Try Your Luck!.
Required Data Files Review
Introduction to WRDS data platform
Data File Import / Export
Comments on ASFA Input Helen Wibley, FAO 2016 ASFA Advisory Board Meeting – Hanoi, Viet Nam.
Database-Driven Web Sites
INTAKE OF NEW PORTFOLIO AND INVOICES
CSDR Submit-Review Website Submitter Guide
Retail Markets Producer Portal Demo.
Mileage Reporting On The WEB
E-permits Tutorial for first-time users
Data Upload & Management
Annual routines with chaining
Network Reports William Gahr.
ClubRunner and Rotary International Database Integration
Mapping Data Production Processes to the GSBPM
Visualizing BLS data in Google Public Data Explorer
Service management system at cloud
AV 13 Avantis Capabilities for Effective Asset Management
Income Poverty Status Education The Labor Force Journey To Work
TPAF – My Pension Online
Generate Data with Google Analytics SQL Saturday /04/2019.
Provider Maintenance—Referrals Module
Contract Management Software 100% Cloud-Based ContraxAware provides you with a deep set of easy to use contract management features.
Presentation transcript:

R: Innovating at the Bureau of Labor Statistics Arcenis Rojas Economist Division of Consumer Expenditure Surveys EPA R Users’ Group Workshop Monday, September 11, 2017

Overview Automation (International Prices) Quality control (Industrial Prices) Data Visualization (Consumer Expenditures) Maps (Survey Methods – OES) R package development (Survey Methods) Other applications and programs Education

International Prices sample refinement International Prices Program Receive data from Census and Customs Must verify Establishment ID (EIN), name, and address to provide to field economists 1700 export collections units per sample 2400 import collection units per sample 6 IPP sample team members 16 copies, 20 pastes, and 46 clicks per unit Developed by Ara Khatchadourian, Mike Murphy, and Rob Sutton

Data sources Upper left - ACE is the Automated Commercial Environment. This is the portal where exporters and importers enter all of their mandatory customs documentation Right - Longintudinal Database (LDB) used to verify EIN after getting it from ACE Lower left – Google is used to verify establishment name and address. If name and address could not be verified on Google, would have to use the Secretary of State website for the firm’s respective state … repeat the process hundreds of times

Automating Sample Refinement Now you just have to… Enter the collection unit identifier. The program populates the company name and address. The user can change the company name and street to update the search of the master list in real time. The program queries a data dump from ACE verifying the EIN. If the EIN mailing or physical address do not match, the program highlights them. This prompts the sampling team member to investigate further and note it in processing comments. The program checks by name, and separately by address the RTS master list. The user can search within these fields as well to narrow down the results. The program highlights current reporters in green, and highlights old reporter addresses. Seeing a current reporter quickly allows us to verify the sampled address. By clicking “Search on SOS website” the program will open a web browser tab with the correct State SOS page opened. By clicking create future note the program will create an appropriate future note on the clipboard that can be pasted into the sampling system

Benefits of automation 80-100 hours per sample of time savings Much less clicking Better and more thorough sample review More time to review more problematic collection units Other scripts have been created to improve efficiency.

Industrial Prices Visualization Dashboard Developed by Steven York and Neil Wagner

Index comparisons * Simulated data used due to confidentiality.

Index review and revision

Consumer Expenditure Survey (CE) Developed by Arcenis Rojas Interview Survey is a 3-month recall interview focusing on large expenditures and consists of 4 interviews per Consumer Unit (CU) Diary Survey is a record of smaller-scale expenditures that are more difficult to recall and consists of up to 2 weeks of expenditure records per CU We collect data on family and member characteristics, expenditures, income, taxes, assets, and liabilities.

CE Public-Use Microdata (PUMD) PUMD = Public-Use Microdata Family-level characteristics Expenditures by Universal Classification Code (UCC) Member-level characteristics Expenditures and their characteristics by type of expenditure (EXPN… > 50 files each year!) And more! CE PUMD provides all this cell-level data as a package of many spreadsheets. Some files are quarterly and others annual. The 2015 Microdata Release contained 95 files of 2015 data.

Interactive CE Visualization Tool

Mapping unemployment data OEUS JavaScript map. Uses pre-made tables to generate maps.

Mapping unemployment data OSMR working on generating R Choropleth maps which would allow more dynamic map generation. CPI also working on generating ArcGIS maps for their own data.

Other apps and programs EIA Analyzer (Industrial Prices) Text analysis Shiny App (Survey Methods) rpms: Recursive Partitioning for Modeling Survey Datapackage (Survey Methods) growfunctions: Bayesian Non-Parametric Dependent Models for Time-Indexed Functional Data package (Survey Methods) EIA Analyzer - Shiny app that takes webscraped (also via R) data from EIA’s Weekly Petroleum Status Report and allows user to visualize the data and easily calculate percent changes between any two data points. Developped by Stephen York and Neil Wagner Text analysis Shiny app is general app. Developed by Brandon Kopp, Randall Powers, and Wendy Martinez

Challenges Data confidentiality Need for an R server to make apps/programs public Can only put Shiny apps on a webpage via iFrames or setting up an account on a cloud server (i.e., Digital Ocean)

Division of Consumer Expenditure Surveys Arcenis Rojas Economist Division of Consumer Expenditure Surveys www.bls.gov/cex 202-691-6884 rojas.arcenis@bls.gov