Data and Social Research Chuck Humphrey Data Library Rutherford North Library.

Slides:



Advertisements
Similar presentations
DLI Orientation: Concepts
Advertisements

January OLA Super Conference 2009 S.Giles, D.Jakubek Ryerson University Census? Statistics? E-Stat can give you the answers! Ontario Library Association.
Aggregate Data and Statistics
Elizabeth Hamilton Atlantic DLI Training April 29, 2005.
Public Use Microdata File (PUMF) 1. Change factors 2. Scenarios : characteristics 3. Analytic Content: additions and losses Outline DLI Ontario.
DLI Orientation: Concepts A Framework for Thinking about Statistical Information Train the Trainers Montreal, March 9, 2004 Chuck Humphrey Data Library.
Using American FactFinder John DeWitt Project Manager Social Science Data Analysis Network Lisa Neidert Data Services Population Studies Center.
Chuck Humphrey Data Library University of Alberta.
SOCI 272 Library Instruction Class: Locating and Using Census Data Brenda Smith, MA, MLIS 18 November 2009.
Geo-referenced data and DLI aggregate data sources Chuck Humphrey University of Alberta September 29, 2008.
Using American FactFinder John DeWitt Project Manager Social Science Data Analysis Network Lisa Neidert Data Services Population Studies Center.
SOCI 272 Library Instruction Class: Locating and Using Census Data November 2010.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 26, 2009.
Chuck Humphrey University of Alberta Digital Reference: Statistics & Data LIS 536 March 4, 2009.
Chuck Humphrey, Leah Vanderjagt and Anna Bombak University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying statistics.
Anna Bombak, Chuck Humphrey, Lindsay Johnston and Leah Vanderjagt University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying.
Chuck Humphrey & Lynne Robinson University of Alberta Surviving Statistics Strategies for dealing with statistical questions on the reference desk.
Searching the University of Alberta Library’s Statistics Canada-based Websites 2001 Census of Canada Canadian Centre for Justice Statistics Canadian Business.
Anna Bombak, Chuck Humphrey, Lindsay Johnston, Angie Mandeville and Leah Vanderjagt Winter Institute on Statistical Literacy for Librarians, February 18-20,
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library March 6, 2009.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
ISR Training February 12, 2010 Data Retrieval from Statistics Canada Surveys.
Geo-referenced data and DLI aggregate data sources Chuck Humphrey University of Alberta ACCOLEDS 2007.
NAICS? YIKES! (North American industry classification system (NAICS)? Yearly index of constant (k) dollar estimates (YIKES)!) Jeff Moon, Queens
The Crime Scene Justice Data and the Case of Multiple Files in GSS 18 Chuck Humphrey University of Alberta Atlantic DLI Workshop April 20-21, 2006.
Anna Bombak, Chuck Humphrey, Angie Mandeville, Leah Vanderjagt and Amanda Wakaruk Winter Institute on Statistical Literacy for Librarians, February 23-25,
11 The American Community Survey Steve Murdock, Ph.D. Director, Hobby Center for the Study of Texas Rice University.
The Winter Institute on Statistical Literacy for Librarians Demystifying statistics for the practitioner Anna Bombak, Chuck Humphrey, Larry Laliberte,
Next on OPRAH – Bringing Data Out of the Closet Walter Giesbrecht, Data Librarian York University Jeff Moon, Head, Documents Unit Queen’s University OLA.
Finding Data & GIS Files at the U of S Library Kiran Doranalli Lucy Li
Searching for Statistics Why can’t we find the data we need? Where should we even start?
Statistics are ubiquitous “Statistics are generated today about nearly every activity on the planet. Never before have we had so much statistical information.
Packaged Serendipity: Preserving Context through Metadata Robert Cole Sharon Farnel Chuck Humphrey Digital Preservation Seminar University of Alberta 5.
Units of Analysis The Basics. Outline Definitions Elements of the unit of analysis Data structure.
 According to UN recommendations and the Statistical Law of Cambodia, the Royal Government of Cambodia is Committed to conducting a general population.
American Community Survey Overview September 4, 2013 Tim Gilbert American Community Survey Office.
Chuck Humphrey, University of Alberta Atlantic DLI Training, 2008 DLI Orientation: Concepts A Framework for Thinking about Data and Statistics.
1 The 2001 Census PUMFS Odyssey Sponsored by HAL and PALS Presented by Chuck Humphrey.
DLI Workshop -- Mar Hosted by Dalhousie University March 2000 DLI Training Workshop.
NAICS? YIKES! Or North American industry classification system (NAICS)? Yearly index of constant (k) dollar estimates (YIKES)!
Chuck Humphrey, University of Alberta Digital Reference: Statistics and Data LIS 536 March 5, 2008.
The Census of Canada and Immigration & Ethno-cultural Data Chuck Humphrey University of Alberta February 10, 2006.
Framework of Statistical Information. This is a typology of the categories or classes of statistical information. Remember the relationship between statistics.
Finding Time Series Data U of S Library Harpreet Aulakh Darlene Fichter
October 2008 Getting to Know Data Sources SOC 3140 Prof. Sylvie Lafrenière Susan Mowers, GSG / Library.
ISR Training February 12,  Types of information you’ll find  Searching the website  Finding statistics using... ◦ Browse By Subject (Summary.
Soc : Principles of Research Design LONGITUDINAL DATA Sunny Kaniyathu, Data Services Librarian.
ISR Training Jan. 21,  Canada’s largest survey  Complete population count  Gathers information on the demographic, social and economic conditions.
January 20089SOC4112 Getting to Know Data Sources Geographic, Statistical and Government Information Centre GSG Team Susan Mowers.
Data on the Foreign Born in 2010: Accessing Information on Immigrants and Immigration from the U.S. Census Bureau’s American Community Survey Thomas A.
Beyond 20/20 for Beginners. Plan Who needs Beyond 20/20 anyway? ◦ What is Beyond 20/20, and what can we do with it? Pros and cons of using 20/20 How to.
Units of Analysis The Basics. Outline An illustration Definitions Elements of the unit of analysis Complexity Data structure.
DATA and STATISTICS … at your service! S.Mowers & the GSG team ©2009, University of Ottawa.
RRM : Resource Data and Environmental Modeling DATA SOURCES Sunny Kaniyathu, Data Services Librarian.
Sociology 343 Chuck Humphrey Data Library University of Alberta.
CTPP in TranStats The One-Stop Shop of Transportation Data
Finding Data: Vital Statistics Geography 342.3; Community planning in Canada Kiran Doranalli Lucy Li Data & GIS Library Services, U of S Library
DLI Training - Ontario 16 April, 2015 Elizabeth Hill, Western University Survey of Household Spending.
Stretching Your Data Management Skills Chuck Humphrey University of Alberta Atlantic DLI Workshop 2003.
Data in context Chapter 1 of Data Basics. Frameworks Today, we will be presenting two frameworks for thinking about the content of data services. A.Statistics.
Anticipating Great Things: A 2006 Census Preview June, 2006 DLI, Ottawa, ON Paul Schwets // Stuart Fyffe.
Hosted by the University of Regina Library December 1999 DLI Training Workshop Chuck Humphrey.
Data and Statistics: As easy as 1-2-3? Carolyn DeLorey, MLIS St. Francis Xavier University Atlantic DLI Workshop UNB Fredericton April 28, 2015.
User Services Focus, value and attitude Vocabulary stories: wash & wear, circ & dingo Statistics and data.
Rural Development Finding data and statistics.  Statistics Canada: Federal statistical agency  Data released under the Data Liberation Initiative (DLI)
Geo-referenced data and DLI aggregate data sources
Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS)
DLI Orientation: Concepts
University of Regina Library
Presentation transcript:

Data and Social Research Chuck Humphrey Data Library Rutherford North Library

Outline Research data Connect how social research uses quantitative evidence with data Discuss how statistics are related to data Statistics are about definitions and classifications Being a critical user of statistics Understanding the Census Aggregate data and microdata

Uses of quantitative evidence Providing a description of social phenomena This typically entails answering the question about the scale or scope of some social group or characteristics of the group. Making a comparison among social entities This typically involves establishing the degree of similarity or dissimilarity among social entities. Identifying relationships among social variables This approach looks at the correlation among social phenomena. How are things related?

How statistics and data differ Statistics numeric facts & figures derived from data, i.e, already processed presentation-ready needs definitions published Data numeric files created and organized for analysis or processing requires processing not display-ready needs detailed documentation disseminated, not published

Six dimensions or variables in this table The cells in the table are the number of estimated smokers. Geography Region Time Periods Social Content Smokers Education Age Sex Statistics and data

Statistics are about definitions! Statistics are dependent on definitions. You may think of statistics as numbers, but the numbers represent measurements or observations based on specific definitions.definitions Tables are structured around geography, time and social content based on attributes of the unit of observation. These properties all need definitions.

Statistics are about definitions! Consider the following example from the 2006 Canadian Census on the data behind some statistics about visible minorities. Visible Minority Groups (15), Generation Status (4), Age Groups (9) and Sex (3) for the Population 15 Years and Over of Canada, Provinces, Territories, Census Metropolitan Areas and Census Agglomerations, 2006 Census - 20% Sample Data

Statistics are about definitions! How is visible minority status identified in the Census? Are aboriginals among the visible minority in Canada? What is the definition of visible minority?

Classifications Sex Total Male Female Periods Statistics involve classifications

Some classifications are based on standards while others are based on convention or practice. For example, Standard Geography classificationsStandard Geography classifications Statistics involve classifications

Statistics are derived from observational, experimental and computational data. A table is a format for displaying statistics and presents a summary or one view of the data. Tables are structured around geography, time and attributes of the unit of observation. Statistics are dependent on definitions. Working with data requires some computing skills with analytic software. Facts about statistics and data

Who published this statistic?  Can you name the producer or distributor of the data?  You need this information to provide a citation for each statistic.  You should ask yourself what motive is behind this published statistic. What view of the data is shown in this statistic?  What level of geography is shown?  What time period is shown?  What social characteristics are shown? Questions to ask about statistics

What concepts are represented in this statistic?  Are definitions provided with the statistic for geography, time or the social characteristics?  Was a standard classification system used for the categories of the statistic? Can you identify a data source for the statistic?  If there isn’t a data source, the statistic isn’t real.  Is there enough information that you could find the data?  Can you name the data source itself? Questions to ask about statistics

Statistics are presentation ready Tables and charts (or graphs) are typically used to display many statistics at once. You will find statistics sprinkled in text as part of a narrative describing some phenomenon; but tables and charts are the primary methods of organizing and presenting statistics.

Population and demographics The Census is one of the most important sources of statistical information about Canada. It is the largest survey conducted in Canada and, consequently, is the primary source for small area statistics. To use data from the Census, you must know:  The characteristics collected in the Census that are available for the spatial units used to disseminate results;  The variety of spatial units used to disseminate Census results.

Census of Population Two forms are used to collect the Census: 2A, which goes to 80% of the households, and 2B, which goes to the other 20%. In 2006, the 2A form contained 8 questions while the 2B form had these 8 plus 53 additional questions. Long history of specific questions (see the Census Dictionary.)history of specific questions You need to understand the content of the Census to know what statistics are possible from the Census.

Post- Censal Post- Censal PALS EDS APS PALS EDS APS PUMF RDC DATA Public Use Microdata Aggregate STATS STC Website Custom Tabulations DLI CENSUS 2006 CENSUS 2006

Microdata and aggregate data Microdata from observational methods created from the respondents in a survey Aggregate Data statistics organized in a data file structure derived from microdata sources used in GIS & time series analysis

Spatial Unit Geo-code

Geo-referenced data The unit analysis makes up the rows in the data file and is the object being described by the other variables the file. The values for this variable are geo-codes for Census tracts.

Geo-referenced data This case in the data file represents Census Tract , which was shown in the image two slides earlier.

The variety of spatial units Statistics Canada groups the variety of spatial units associated with the Census into two groups: Source for the graphics: Illustrated Glossary, 2006 Census Geography, Statistics Canada Source: Illustrated Glossary, 2006 Census Geography, Statistics Canada

Urban small area statistics Census Metropolitan Areas Source for the graphic: Illustrated Glossary, 2006 Census Geography, Statistics Canada Metropolitan Areas 2006Map of Edmonton CMA

Census results for 2006 Standard Census data products Highlight tables Profiles Census trends Topic-based tabulations For smaller areas outside CMAs or for dissemination areas, need to retrieve from the Data Library Data Library Public use microdata files for individuals

CANSIM CANSIM is a very large database containing socio-economic statistics for Canada. There are currently over 38 million time series organized in approximately 2,800 tables. The statistics in CANSIM come from surveys (e.g., the Labour Force Survey), administrative data (e.g., crime and justice) and simulations or models (e.g., population projections).surveyscrime and justicepopulation projections Geography, content and time are basic to retrieving time series from CANSIM.

Online copies of questionnaires and data documentation from DLI.   Online catalogues, such as the Statistics Canada DLI title list, the ICPSR catalogue, the CESSDA portal and the ASSDA Nesstar server.     Tools for working with data

Online access to data through IDLS  Off line access through the Data Library  Rutherford North, 1 st Floor ( ) Statistical software, such as SPSS  Tools for working with data