datalibweb – Stata module to access micro data

Slides:



Advertisements
Similar presentations
EBSCO Discovery Service
Advertisements

DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
New Release Announcements and Product Roadmap Chris DiPierro, Director of Software Development April 9-11, 2014
Unveiling ProjectWise V8 XM Edition. ProjectWise V8 XM Edition An integrated system of collaboration servers that enable your AEC project teams, your.
1 SDMX Reference Infrastructure (SDMX-RI) Work in progress, status and plans Bengt-Åke Lindblad, Adam Wroński Eurostat Eurostat Unit B3 – IT and standards.
11 CONFIGURING AND MANAGING SHARED FOLDER SECURITY Chapter 8.
Michael Donovan, River Campus Libraries – 12/03 DocuShare Overview and Training.
1 Computing for Todays Lecture 22 Yumei Huo Fall 2006.
1 of 6 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
MCDST : Supporting Users and Troubleshooting a Microsoft Windows XP Operating System Chapter 5: User Environment and Multiple Languages.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Installing software on personal computer
What is so good about Archie and RevMan 5
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 7 Configuring File Services in Windows Server 2008.
Working with SharePoint Document Libraries. What are document libraries? Document libraries are collections of files that you can share with team members.
REDCap Overview Institute for Clinical and Translational Science Heath Davis Fred McClurg Brian Finley.
World Bank: Microdata Library Development Data Group.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
SOFTWARE.
©Kwan Sai Kit, All Rights Reserved Windows Small Business Server 2003 Features.
Understanding the Web Site Development Process. Understanding the Web Site Development You need a good project plan Larger projects need a project manager.
Module 7: Fundamentals of Administering Windows Server 2008.
Using CIITS to Create Common School & District Assessments Copyright © 2011 Schoolnet, Inc. All rights reserved.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
What’s new in Kentico CMS 5.0 Michal Neuwirth Product Manager Kentico Software.
A Networked Machine Management System 16, 1999.
Module 3 Configuring File Access and Printers on Windows 7 Clients.
REDCap Overview Institute for Clinical and Translational Science Fred McClurg Neil Nuehring.
REDCap Overview Institute for Clinical and Translational Science Heath Davis Fred McClurg Brian Finley.
Gateway to Global Aging Data September 17 th, 2014 APRU Data Workshop Drystan Phillips.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
Module 8 : Configuration II Jong S. Bok
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Sitecore.net Training, Oct ECM 2.1 UPDATE 2 PART 1 CRAWL BEFORE YOU WALK.
XP Creating Web Pages with Microsoft Office
Reference Management Module I: Introduction By Rehema Chande-Mallya(PhD)
Network and Server Basics. Learning Objectives After viewing this presentation, you will be able to: Understand the benefits of a client/server network.
Knowledge Hub Walkthrough August
Knowledge Hub Walkthrough August
The Ultimate SharePoint Admin Tool
REDCap General Overview
Architecture Review 10/11/2004
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
SmartCenter for Pointsec - MI
Installation The Intercompany Integration Solution for SAP Business One Version 2.0 for SAP Business One 9.1 Welcome to the course on the installation.
Internet Made Easy! Make sure all your information is always up to date and instantly available to all your clients.
Session
Getting Started with... Business Partner Express
Presenter: Chris Blake, Associate Director
IST 220 – Intro to Databases
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
Using E-Business Suite Attachments
CARA 3.10 Major New Features
Get to know SQL Manager SQL Server administration done right 
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 6: Planning, Configuring, And Troubleshooting WINS.
Section 15.1 Section 15.2 Identify Webmastering tasks
Installation The Intercompany Integration Solution for SAP Business One Version 2.0 for SAP Business One 9.1 Welcome to the course on the installation.
Increased Efficiency and Effectiveness
Getting Started.
Getting Started.
Microsoft Office Access 2003
Training course Part 2: Administration tasks
Unit4 Customer Portal Knowledge User Access.
A Guide for Getting Started
Microsoft Azure Data Catalog
Contract Management Software 100% Cloud-Based ContraxAware provides you with a deep set of easy to use contract management features.
SysKit Security Manager
Palestinian Central Bureau of Statistics
Presentation transcript:

datalibweb – Stata module to access micro data The World Bank The Poverty and Equity Global Practice Global Solutions Group on Welfare Measurement and Statistical Capacity July 2017

Motivation Improve access to data databases version control Organize micro-data in a logical and structured way version control Keep track of corrections, modifications or updates of the micro-data Ability to replicate numbers calculated with different versions from the current one. knowledge sharing Ensure knowledge is sharing and results can be replicated users working with micro-data at the same time Ensure that all the users are using the same data source

What is datalibweb? It is a global data system and improved version of datalib/datalib2 WEB: User registration and subscription and Management System, Web API Stata: Core encrypted functions to authenticate registered WB users, identifies and grants access to the microdata from the server based on users’ profiles of survey subscriptions Stata: Support to modular plugins (i.e. other related functions to process the retrieved data within user’s local machine [such as metadata on the surveys, convert to PPP, merge with different modules and raw data, etc])

Benefits of datalibweb Accessibility All users have access to the same micro-data and use by default the latest versions When available, quick access to metadata such as questionnaires, interviewers manual, reports, among others (through Microdata Library) Institutional memory Ensures that all results can be replicated Saves hard-drive space Nobody needs to have the micro-data on their computer’s hard-drive It is easy for quality control programing All do-files that call –datalibweb- might be shared and run among users directly No changes or modifications needed No micro-data is lost Faster communication between different units within the WB

Benefits of datalibweb It is IT-independent IT does not have to provide permissions to new users. Users need to register and subscribe to raw/original and harmonized collections they want (one time) Increase level of security Users can access the microdata based on their registration and subscriptions to surveys from multiple regional and GP network drives (must access through Stata) It works in a modular way Each region, Global Practice is able to contribute their own collections/modules that adapt to specific needs of use Easy management and reporting Better management of users (monitoring the usages, and informing users about new and updated database). Better integration of the system with other Bank applications/databases such as Microdata Library, the IW portal Regions and GPs maintain their own collection of databases Each region maintains and curates their databases, and ensure that users always get the latest data (real time).

Adoption of datalibweb is growing

Adoption of datalibweb is growing

How it works Go to FURL: datalibweb Follow instruction to install the Datalibweb package Subscribe to the surveys/collections you want to have access to. Poverty GP colleagues by default have access to the Global Poverty Working Group Database (GPWG-DB). GPWG-DB is the first global database of 1100 microdata that was used to produce the international poverty indicators and Shared Prosperity, it is maintained by the GPWG with contributions. Raw/original and harmonized data: Currently we have about 18 collections (both raw and harmonized), maintained by each regional statistical teams and the global team. Highly interactive (no code) or deeply nested code Users can use interactive interface with adaptive guided information or go straight to the code. Working with vintages/version controls Better control of vintages for users based on unique Survey Identifier. Survey identification enables the user to call a specific version or vintage of the survey of choice. For example, TZA_2011_HBS_v01_M refers to “Tanzania - Household Budget Survey 2011“

Installing datalibweb Manual installation: Get the file from this link: http://eca/povdata/datalibweb/_ado/datalibweb.zip  Copy with replacement all the files into c:/ado/plus, without changing the folder structure Enter this line in Stata net install datalibweb, all replace force from("http://eca/povdata/datalibweb/_ado/") datalibweb, update(ado)

Adaptive guided information for datalibweb users

datalibweb users can also use script to access data Typing multiple entries in “country” option produces an appended dataset of selected countries and year(s) of the chosen collection. datalibweb, country(ALB ARM) year(2008) type(GPWG) clear Survey ID: ALB_2008_LSMS_v01_M Title: Living Standards Measurement Survey 2008 Link to DDI: Survey metadata (microdata portal)   Survey ID: ARM_2008_ILCS_v01_M Title: Integrated Living Conditions Survey 2008 When raw data is requested, user has an option to choose amongst individual files within the survey. datalibweb, country(ALB) year(2012) type(ECARAW) clear Survey ID: ALB_2012_LSMS_v01_M Title: Living Standards Measurement Survey 2012 Link to DDI: Survey metadata (microdata portal) 1. [datalibweb, country(ALB) year(2012) type(ECARAW) surveyid(ALB_2012_LSMS_v01_M) filename(bookbread.dta)] ...   41. [datalibweb, country(ALB) year(2012) type(ECARAW) surveyid(ALB_2012_LSMS_v01_M) filename(weights_identification.dta)]

Future features of datalibweb User web interface for subscriptions and usage Regional admin can upload subscription and maintain their data repositories Much better performance Local option/getfile to save and reuse the data in your machine while on mission Interactive explore of the data availability (raw and harmonized data, documentations and dofile) API for querying different types of files (data, dofile, questionnaire) Graphical User Interface

datalibweb performance We care a lot about performance of the system and the secured environment of your data! Getting the catalog and a specific file each might take about 3-7 seconds, conditional on the network location, traffic usage and machine. We are trying to reduce the time with some caching listings at the cost of real-time listings.

Flexible subscription mechanism User subscriptions will be based on “Types”, which can be: Data Documentation Dofile Or anything Within each Type, for example, you can have access to different folders of data. For example under the Data type, you can have “Data\Stata” or “Data\Base” folders, and each folder contains different stages of data harmonization. Similarly, within Type “Documentation”, users can have access to Doc/Questionnaire, Doc/Technical, or Doc/Reports Or you can create your own “Type”

Web interface for user’s subscription and usage

datalibweb from your laptop Users can run datalibweb while on mission or on planes (no need to have WB intranet network) Users need to use datalibweb to save the files into your machines – option “getfile” Users use the “local” option – just add the “local” to the end of your normal code The system can remember the query “getfile” and update or remove the files at the user’s request

datalibweb interactivity Users can explore the availability of data and documentation for both Raw/original and harmonization, and their own subscription.

datalibweb interactivity – view by vintage Users can explore the availability of data and documentation for both Raw/original and harmonization, and their own subscription.

datalibweb interactivity – view by available modules Users can explore the availability of data and documentation for both Raw/original and harmonization, and their own subscription.

datalibweb interactivity – view by available modules Users can explore the availability of data and documentation for both Raw/original and harmonization, and their own subscription.

datalibweb interactivity – view by available modules Users can explore the availability of data and documentation for both Raw/original and harmonization, and their own subscription.

datalibweb Stata GUI – Country view The graphical user interface will be replacement of Windows Explorer to see what surveys we have across surveys and what type of documentations we can access organized by Types. It also links with user subscriptions and let users know what they subscribed to. Users can filter the surveys by various options such as “latest version”, “data only”, “subscription status” or by keywords matched.

datalibweb Stata GUI – collection view The collection view shows all the harmonized collection within a server. It is organized by the collection type and allow you to see all the countries and vintages within that collection.

datalibweb Stata GUI – export survey lists Users can export the list of surveys in the view to text or clipboard for sharing or to prepare the list of non-subscribed surveys by sub-menu options.

datalibweb Stata GUI – Subscription request email

SOL – Statistics online (future)

Please let us know if you have any question/suggestion. datalibweb@worldbank.org FURL: datalibweb