Using Shiny to Efficiently Process Survey Data Carl Ganz, Akbar Akbari Esfahani, Hongjian Yu & Ninez Ponce UCLA Center for Health Policy Research Company.

Slides:



Advertisements
Similar presentations
April, 2004 Lars Thygesen International Trade Expert meeting Whats going on at OECD: statistical information management.
Advertisements

© RightNow Technologies, Inc. RightNow Connect Web Services for SOAP Chris Omland.
WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Key Considerations for Report Generation & Customization Richard Wzorek Director, Production IT Confidential © Almac Group 2012.
What’s New in Office Visio 2007 Microsoft Office Visio 2007 drawing and diagramming software makes it easy for IT and business professionals to.
Chapter 7 UNDERSTANDING AND DESIGNING FORMS. Input Forms: Content and Organization Need for forms Event analysis and forms Relationship between input.
These courseware materials are to be used in conjunction with Software Engineering: A Practitioner’s Approach, 6/e and are provided with permission by.
Dr. Kalpakis CMSC 461, Database Management Systems Introduction.
Managing Master Data with MDS and Microsoft Excel
CRYSTAL REPORTS Jacob Grogan. CRYSTAL REPORTS AND WHY IT’S USEFUL? “ Crystal Reports is a popular Windows-based report generation program that allows.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Data Transformation for Analysis Purposes Presented By: Gregg Ravenscroft Khulisa Management Services
ISOWare Presentation January 2009 ISOWARE is a management tool, that simple and efficient describes and communicates Business Processes. ISOWARE is also.
Introduction to the Enterprise Library. Sounds familiar? Writing a component to encapsulate data access Building a component that allows you to log errors.
CS110/CS119 Introduction to Computing (Java)
Systems Analysis And Design © Systems Analysis And Design © V. Rajaraman MODULE 14 CASE TOOLS Learning Units 14.1 CASE tools and their importance 14.2.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
SednaSpace A software development platform for all delivers SOA and BPM.
>> Building a PPT from the ActiveInterface web pages Chris Harrington Active Interface, Inc.
Mastering Char to ASCII AND DOING MORE RELATED STRING MANIPULATION Why VB.Net ?  The Language resembles Pseudocode - good for teaching and learning fundamentals.
The Network Performance Advisor J. W. Ferguson NLANR/DAST & NCSA.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
Mark Brady 11/19/2012 Southwest Florida Water Management District Data Analyst Interview.
Foundations of Technology The Systems Model
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
Using MS Excel to validate & load your data into Oracle EBS.
ETERE NUNZIO The ultimate end-to-end solution for your NewsRoom.
Scripting Just Enough SSIS to be Dangerous. 6/13/2015 Visit the Sponsor tables to enter their end of day raffles. Turn in your completed Event Evaluation.
Getting the Most outof EPM Converting FDM to FDMEE – What’s it all about? March 16, 2016 Joe Mizerk
AM18 DATA INTEGRATION TODAY AND TOMORROW Henning Lund, RapidiOnline.
Proposal of Satellite Data Center India Meteorological Department A.K.Sharma (Chairman), Virendera Singh (Member), R.K.Giri (Member) and N.Puviarasan (Member.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
SAP MDG (Master Data Governance) online training Online | classroom| Corporate Training | certifications | placements| support CONTACT US: MAGNIFIC TRAINING.
WEB API AND CLOUD DEVELOPMENT BY TRAWEX TECHNOLOGIES.
Take Your Data Analysis and Reporting to the Next Level by Combining SAS Office Analytics, SAS Visual Analytics, and SAS Studio David Bailey Tim Beese.
Advanced Higher Computing Science
Creo Spec-Driven Piping
Investigating System Requirements
Wallpaper only – on screen during welcome and chat
Understanding SPSS II Workshop Series July 19, 2017.
Creo Spec-Driven Piping
A very brief introduction to R
Principles of Information Systems Eighth Edition
Software Specification Tools
Center-wide strategy and plans Clark Judy Julia Collins
Excel-to-PowerPoint Document Automation
Accessing Spatial Information from MaineDOT
Business Intelligence Design and Development Michael A. Fudge, Jr.
Enterprise Content Management, Shared Services, & Contract Management
PRG 421 MART Lessons in Excellence-- prg421mart.com.
PRG 421 GUIDE Lessons in Excellence -- prg421guide.com.
PRG 421 GUIDE Education for Service-- prg421guide.com.
What’s New in Colectica 5.3 Part 1
Data Collection in MTM Choosing the right method for survey data collection.
Chapter 12: Automated data collection methods
SDMX: A brief introduction
The 2nd Generation Live Database
YTY − an integrated production system for business statistics
Experience with XML – based production of publications Case of « Statistical yearbook 2005 and 2006  » Guy Zacharias Centralisation et Diffusion STATEC.
Performance and Scalability Issues of Multimedia Digital Library
Agenda Why go Mobile? Why Navara? Challenges to going Mobile?
Development Goals for Year 2
Sample Assessment & Governance Results
Data compilation and pre-validation
Just Enough SSIS Scripting to be Dangerous.
Integrated Statistical Production System WITH GSBPM
Presentation transcript:

Using Shiny to Efficiently Process Survey Data Carl Ganz, Akbar Akbari Esfahani, Hongjian Yu & Ninez Ponce UCLA Center for Health Policy Research Company or University Logo Problem Example 1: Public Use File Example 2: Upcoding Example 3: Metadata Content experts must thoroughly review our data for sensitive information before public use files (PUF) are created. This requires creating new, less sensitive variables by collapsing, top/bottom coding, and grouping variables. These new variables need to be generated in SAS, but content experts are not comfortable with SAS. In the past, programmers would generate frequencies for content expert who in turn would recommend new variables via email. With Shiny we have custom GUI that allows content experts to interactively generate new variables, and the SAS code required to create them. At the UCLA Center for Health Policy Research, we continuously process the annual California Health Interview Survey (CHIS). As is the case with many large surveys, CHIS relies on the work of statisticians along with a variety of content experts from public health, sociology, epidemiology, and other areas. Many tasks require extensive back and fourth between the statisticians, and the content experts. The majority of this work is done via Excel, because it is accessible to a variety of stakeholders. Typically, any input required from the content expert (i.e. the names of the requested variables) is inputted to excel. A Statistician then writes a SAS program to read the excel table, and generate the required output. This non-interactive workflow is slow, and error-prone, because the content experts get delayed feedback. Many questions from our survey are open response meaning there are unaccountably many possible answers. Content experts must review the open responses, and categorize the answers into one of finitely many categories. Many responses can be matched to similar responses in the past, but others require human review. In the past, content experts would manually review, and categorize in Excel. With Shiny we created a custom interface where content experts can harness R’s text-mining capabilities to suggest similar upcodes. For each year’s survey, we generate metadata for each variable, including label, formats, variable type, etc. In the past, the metadata was managed in Excel because it is accessible to a wide audience. Excel presented problems because data was untidy, and there were referential integrity issues across years. SQL is a logical alternative for solving these issues, but it is much less user friendly for non-programmers. With Shiny, we developed an interface that allows content experts to download SQL data to Excel, make changes in Excel, and then upload those changes back to SQL. App A App B App C Solution Ideally, content experts would work in an interactive environment that validates their work and gives them the results immediately. Luckily, R’s web-framework Shiny makes it easy to create such tools. To see examples visit: www.github.com/carlganz/CSP2017