Project? Microdata? Say what? TRY Conference May 5, 2008 Suzette Giles, Ryerson University Laine Ruus, University of Toronto.

Slides:



Advertisements
Similar presentations
Aggregate Data and Statistics
Advertisements

EQUINOX DATA DELIVERY SYSTEM May 31, 2011 –Elizabeth Hill Equinox.uwo.ca.
DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
The Economic and Social Data Service (ESDS) Kevin Schürer ESDS/UKDA ESDS Awareness Day 5 December 2003.
The Economic and Social Data Service (ESDS) Karen Dennison UK Data Archive Improving access to government datasets 18 January 2007.
Labour Force Historical Review Sandra Keys, University of Waterloo DLI OntarioTraining University of Guelph, Guelph, ON April 12, 2006.
SPSS Session 1: Levels of Measurement and Frequency Distributions
Chuck Humphrey Data Library University of Alberta.
First Year in Focus at Canadian Colleges and Universities.
11 ACS Public Use Microdata Samples of 2005 and 2006 – How to Use the Replicate Weights B. Dale Garrett and Michael Starsinic U.S. Census Bureau AAPOR.
Working with Taxfiler (T1FF) Data Wayne Chu Planning Analyst Social Development, Finance & Administration, City of Toronto Toronto CDP Face-to-Face, September.
Introducing Statistics and Data Geographic, Statistical and Government Information Centre, Susan Mowers.
LSP 121 Week 2 Intro to Statistics and SPSS/PASW.
Unlocking Public Opinion Poll Data in Canada May 27, 2009 IASSIST 2009 Michelle Edwards, PhD, University of Guelph Jane Fry, Carleton University.
Chuck Humphrey, Leah Vanderjagt and Anna Bombak University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying statistics.
Chuck Humphrey & Lynne Robinson University of Alberta Surviving Statistics Strategies for dealing with statistical questions on the reference desk.
Searching the University of Alberta Library’s Statistics Canada-based Websites 2001 Census of Canada Canadian Centre for Justice Statistics Canadian Business.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library March 6, 2009.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
IT Student Survey 2009 Your name here. Overview Over 1,400 responses were received. Students across all faculties, nationalities and years were represented.
ISR Training February 12, 2010 Data Retrieval from Statistics Canada Surveys.
Basic Concept of Data Coding Codes, Variables, and File Structures.
 Definition of HTML Definition of HTML  Tags in HTML Tags in HTML  Creation of HTML document Creation of HTML document  Structure of HTML Structure.
Merging census aggregate statistics with postal code-based microdata Laine Ruus University of Toronto. Data Library Service ,
The Field (California) Poll. What is the Field Poll? The Field Poll was established in 1947 by Mervin Field. An independent non-partisan survey of California.
U.S. Decennial Census Finding and Accessing Data Summer Durrant October 20, 2014 Data & Geographical Information Librarian Research Data Services
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
SDA: a tool for teaching and research with microdata Laine Ruus University of Toronto. Data Library Service.
Next on OPRAH – Bringing Data Out of the Closet Walter Giesbrecht, Data Librarian York University Jeff Moon, Head, Documents Unit Queen’s University OLA.
Finding Data & GIS Files at the U of S Library Darlene Fichter & Elise Pietroniro
MADGIC Workshop Series, 2009 MADGIC Presents: How to make the most of your data resources (and have fun in the process)
Searching for Statistics Why can’t we find the data we need? Where should we even start?
EScience in Action Paula Hurtubise, Anna Laurence, Jeff Moon.
Nesstar: A Web-based Data Extraction and Analysis System Richard Pinnell & Sandra Keys, University of Waterloo Libraries.
DLI Training April 2004 Kingston Ontario. DDI What, Why, How?
Michel Séguin DLI Chief December 2006 The Need to Liberate The Data.
How women drivers compare to men? MALE (Column %) FEMALE (Column %) Better Drivers As Good Drivers Worse Drivers Don’t Know
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
Collaborative Markup of Library and Research Data Examples from Ontario Council of University Libraries (OCUL)
A Journey in Data Discovery Wendy Watkins TSES October, 2007.
DLI Workshop -- Mar Hosted by Dalhousie University March 2000 DLI Training Workshop.
Ontario Data Documentation, Extraction Service and Infrastructure IASSIST 2008 Palo Alto, California.
The Census of Canada and Immigration & Ethno-cultural Data Chuck Humphrey University of Alberta February 10, 2006.
DLI Boot Camp 2011 Finding Statistics: Tools and Techniques Jean Blackburn Vancouver Island University Library SDA.
Panel Study of Entrepreneurial Dynamics Richard Curtin University of Michigan.
Soc : Principles of Research Design LONGITUDINAL DATA Sunny Kaniyathu, Data Services Librarian.
Data Reference The data reference interview And… Cool tools and strategies.
A Few IQSS Quasi-Research Projects Gary King Institute for Quantitative Social Science Harvard University (talk at the Harvard Academic Computing Committee,
Editing of linked micro files for statistics and research.
Jeff Moon Data Librarian & Academic Director, Queen’s Research Data Centre Statistics & Data& Data An OverviewAn Overview
Ontario Data Documentation, Extraction Service and Infrastructure.
Handling Reference Questions DLI Orientation Session Kingston, Ontario April 5, 2004.
DLI and EQUINOX Question 1 How do I find out what survey datasets are available from Statistics Canada ?
How to be a happy data back up, impress your users and keep learning. Erin Alcock Memorial University of Newfoundland.
Hosted by the University of Regina Library December 1999 DLI Training Workshop Chuck Humphrey.
Data and Statistics: As easy as 1-2-3? Carolyn DeLorey, MLIS St. Francis Xavier University Atlantic DLI Workshop UNB Fredericton April 28, 2015.
Laine Ruus University of Toronto.Data Library Service
OVERVIEW OF THE DATA LIBERATION: Licence, Products, & Services Mike Sivyer Ontario DLI Training, April 5, 2004.
Real Time Remote Access: Educational resources Susan Mowers, University of Ottawa.
New Web Tools from NCCS Linda Lampkin & Tom Pollak Center on Nonprofits and Philanthropy at the Urban Institute ARNOVA Annual Conference Denver November.
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. REDCap:
ESTAT & CANSIM DLI Equinox <odesi> ICPSR
The Institute of Quantitative Social Science
DDI for the Uninitiated
From information to data, and back again
University of Regina Library
From information to data, and back again
Data Liberation Initiative (DLI)
Exploring the DLI Product line
Presentation transcript:

project? Microdata? Say what? TRY Conference May 5, 2008 Suzette Giles, Ryerson University Laine Ruus, University of Toronto

Acronym of “Ontario Data Documentation, Extraction Service and Infrastructure Initiative” A new product delivered through Scholars Portal Collaboration between OCUL and Ontario Buys Web-based resource discovery to a growing collection of Canadian data 3 rd generation statistics and data extraction system Equalizes access to selected statistics and data for all Ontario universities

A web-based extraction system Provides equal access to these resources for all Ontario universities Contains diverse, quality, numeric (microdata) data sets Allows data resource discovery, extraction and analysis

First some background about Why is this resource so important that Ontario Buys and OCUL (Ontario Council of Ontario Libraries) are investing over $1.4 million? How will it support teaching and research in quantitative methods and contribute to statistical literacy? How will it help me at the Reference Desk?

Why is this resource so important that Ontario Buys and OCUL (Ontario Council of Ontario Libraries) are investing over $1.4 million? 23 universities & colleges in Ontario belong to DLI, only 7 have full-time staff ‘doing’ data therefore, large differences in data access by faculty and students Goes some way to making access to data and statistics more easily findable.

How will it support teaching and research in quantitative methods and contribute to statistical literacy? As a resource discovery tool, ability to search across collections at a more finely-grained level than Stats Can provides Access to resources that have not previously been readily accessible, eg Canadian Gallup polls Ability to quickly and easily display descriptive statistics, regardless of whether the resource is aggregate statistics or microdata. Web-based access supports use in classroom as well as 24x7 access for research.

How will it help me at the Reference Desk? Easy access to a collection of statistics and data in a uniform interface Blurs the distinctions between aggregate statistics and microdata Enable the creation, on the fly, of aggregate statistics that have not been published elsewhere

What are microdata and why do I need to know about them? Microdata are the actual responses that survey or census respondents give to a questionnaire, Usually translated into a numeric format so that one can do arithmetic with them For example: Income: (1) high, (2) medium, (3) low –Average = ???? Income: (1) $13,725 (2) $118,297 (3) $63,958 –Average = $65,327

What are microdata? A person’s sex or gender could be recorded as Male or Female, but instead it will be coded 1 or 2. 1 = male 2 = female or the other way 1 and 2 are the VALUE given to the VARIABLE in this case sex (or gender) A microdata file consists of numbers

From microdata are generated descriptive statistics Microdata –A person is working or not working Aggregate statistics –A count of the number of persons not working (in a geographic area) –A count of the number of persons not working divided by the number of persons in the labour force = the unemployment rate

…and more descriptive statistics Microdata –A family has a gross annual income in year 2005 Aggegregate (descriptive) statistics –Families in a geographic area have an average income –50% of families in a geographic area have an income above (or below) the median income –LICO is the % of families in a geographic area that have an income below the low income cut-off for that geographic area and family size

Two systems Available to all Ontario universities Searching metadata across files Descriptive statistics only Download system files (users needs to have software) SDA 10 universities subscribe, incl Ryerson & York More advanced statistical analysis functions Download raw data & syntax files (user needs to create system files)

What’s changed? In the ‘bad old days’ –Statistics were published in books/periodicals, data were ‘published’ mainly as files of microdata or very extensive aggregate statistics –You needed access to a mainframe or PC.Mac –You needed special software (SAS, SPSS, Stata) –You needed training in the production of descriptive statistics (weighting, types of data and appropriate types of descriptive statistics)

But nowadays! Statistics are published as Excel files, Beyond 20/20 files, and microdata are available in 3 rd generation interfaces All you need is a computer and a web browser Generate descriptive statistics with a few mouse clicks – tho’ you still need to know how to interpret them (and knowing about weighting is a good idea too!) Users can download data 24x7 for further processing on their own workstations with appropriate software