EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.

Slides:



Advertisements
Similar presentations
DLI Orientation: Concepts
Advertisements

January OLA Super Conference 2009 S.Giles, D.Jakubek Ryerson University Census? Statistics? E-Stat can give you the answers! Ontario Library Association.
Aggregate Data and Statistics
DLI Orientation: Concepts A Framework for Thinking about Statistical Information Train the Trainers Montreal, March 9, 2004 Chuck Humphrey Data Library.
Using American FactFinder John DeWitt Project Manager Social Science Data Analysis Network Lisa Neidert Data Services Population Studies Center.
Chuck Humphrey Data Library University of Alberta.
2004 OLA - E-STAT Census and CANSIM data: Comparison of providers Presentation for OLA Conference 2004 “Discovering the World of Numbers: Statistics Canada’s.
SOCI 272 Library Instruction Class: Locating and Using Census Data Brenda Smith, MA, MLIS 18 November 2009.
Geo-referenced data and DLI aggregate data sources Chuck Humphrey University of Alberta September 29, 2008.
Using American FactFinder John DeWitt Project Manager Social Science Data Analysis Network Lisa Neidert Data Services Population Studies Center.
SOCI 272 Library Instruction Class: Locating and Using Census Data November 2010.
GEOG 3P47 Mapping Stats Colleen Beard Map & Geography Librarian Room MC C306
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 26, 2009.
Chuck Humphrey University of Alberta Digital Reference: Statistics & Data LIS 536 March 4, 2009.
Chuck Humphrey, Leah Vanderjagt and Anna Bombak University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying statistics.
ECON3610 E-STAT Presented by: Christina Nilsen, Data Services Librarian Thompson Rivers University.
Anna Bombak, Chuck Humphrey, Lindsay Johnston and Leah Vanderjagt University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying.
Chuck Humphrey & Lynne Robinson University of Alberta Surviving Statistics Strategies for dealing with statistical questions on the reference desk.
Searching the University of Alberta Library’s Statistics Canada-based Websites 2001 Census of Canada Canadian Centre for Justice Statistics Canadian Business.
Anna Bombak, Chuck Humphrey, Lindsay Johnston, Angie Mandeville and Leah Vanderjagt Winter Institute on Statistical Literacy for Librarians, February 18-20,
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library March 6, 2009.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
Small Area Statistics Standard Census Geography and Locating Small-Area Statistics.
Introduction to the Canadian Census of Population With Peter Peller Maps, Academic Data, Geographic Information Centre (MADGIC)
Geo-referenced data and DLI aggregate data sources Chuck Humphrey University of Alberta ACCOLEDS 2007.
Joint Canada/U.S. Health Survey Catherine Simile, National Center for Health Statistics Patrice Mathieu, Statistics Canada Ed Rama, Statistics Canada NCHS.
NAICS? YIKES! (North American industry classification system (NAICS)? Yearly index of constant (k) dollar estimates (YIKES)!) Jeff Moon, Queens
The Crime Scene Justice Data and the Case of Multiple Files in GSS 18 Chuck Humphrey University of Alberta Atlantic DLI Workshop April 20-21, 2006.
Anna Bombak, Chuck Humphrey, Angie Mandeville, Leah Vanderjagt and Amanda Wakaruk Winter Institute on Statistical Literacy for Librarians, February 23-25,
Merging census aggregate statistics with postal code-based microdata Laine Ruus University of Toronto. Data Library Service ,
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
Finding Data & GIS Files at the U of S Library Kiran Doranalli Lucy Li
Statistics are ubiquitous “Statistics are generated today about nearly every activity on the planet. Never before have we had so much statistical information.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Units of Analysis The Basics. Outline Definitions Elements of the unit of analysis Data structure.
Doing data & statistics at the reference desk (some of) what you’ll need to know OLA Super Conference Walter W. Giesbrecht Data Librarian,
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
American Community Survey Overview September 4, 2013 Tim Gilbert American Community Survey Office.
Chuck Humphrey, University of Alberta Atlantic DLI Training, 2008 DLI Orientation: Concepts A Framework for Thinking about Data and Statistics.
1 The 2001 Census PUMFS Odyssey Sponsored by HAL and PALS Presented by Chuck Humphrey.
DLI Workshop -- Mar Hosted by Dalhousie University March 2000 DLI Training Workshop.
NAICS? YIKES! Or North American industry classification system (NAICS)? Yearly index of constant (k) dollar estimates (YIKES)!
Chuck Humphrey, University of Alberta Digital Reference: Statistics and Data LIS 536 March 5, 2008.
The Census of Canada and Immigration & Ethno-cultural Data Chuck Humphrey University of Alberta February 10, 2006.
Framework of Statistical Information. This is a typology of the categories or classes of statistical information. Remember the relationship between statistics.
Finding Time Series Data U of S Library Harpreet Aulakh Darlene Fichter
October 2008 Getting to Know Data Sources SOC 3140 Prof. Sylvie Lafrenière Susan Mowers, GSG / Library.
ISR Training February 12,  Types of information you’ll find  Searching the website  Finding statistics using... ◦ Browse By Subject (Summary.
Soc : Principles of Research Design LONGITUDINAL DATA Sunny Kaniyathu, Data Services Librarian.
Beyond 20/20 for Beginners. Plan Who needs Beyond 20/20 anyway? ◦ What is Beyond 20/20, and what can we do with it? Pros and cons of using 20/20 How to.
Units of Analysis The Basics. Outline An illustration Definitions Elements of the unit of analysis Complexity Data structure.
ACCESSING DATA Geog 342: Community Planning in Canada.
RRM : Resource Data and Environmental Modeling DATA SOURCES Sunny Kaniyathu, Data Services Librarian.
Sociology 343 Chuck Humphrey Data Library University of Alberta.
Finding Data: Vital Statistics Geography 342.3; Community planning in Canada Kiran Doranalli Lucy Li Data & GIS Library Services, U of S Library
Stretching Your Data Management Skills Chuck Humphrey University of Alberta Atlantic DLI Workshop 2003.
Data in context Chapter 1 of Data Basics. Frameworks Today, we will be presenting two frameworks for thinking about the content of data services. A.Statistics.
Anticipating Great Things: A 2006 Census Preview June, 2006 DLI, Ottawa, ON Paul Schwets // Stuart Fyffe.
Use of Standardized Metadata to Find, Select and Access Statistical Data - Experience of Statistics Canada - Joint UNECE/Eurostat/OECD Work Session on.
Hosted by the University of Regina Library December 1999 DLI Training Workshop Chuck Humphrey.
Data and Statistics: As easy as 1-2-3? Carolyn DeLorey, MLIS St. Francis Xavier University Atlantic DLI Workshop UNB Fredericton April 28, 2015.
User Services Focus, value and attitude Vocabulary stories: wash & wear, circ & dingo Statistics and data.
Soc 332.6: Principles of research design Finding statistics.
Rural Development Finding data and statistics.  Statistics Canada: Federal statistical agency  Data released under the Data Liberation Initiative (DLI)
Geo-referenced data and DLI aggregate data sources
Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS)
DLI Orientation: Concepts
Units of Analysis The Basics.
University of Regina Library
Telling Canada’s story in numbers Marie-Josée Major
Presentation transcript:

EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008

Outline  Statistics and data  Distinction between statistics and data  Statistics are derived from data  Statistics are about definitions  Census characteristics  E-STAT access  Online demonstration of access to CANSIM and the 2006 Census of the Population

Numeric Information Statistics  numeric facts/figures  created from data, i.e, already processed  presentation-ready Data  numeric files created and organized for analysis/processing  requires processing  not display-ready

Numeric Information Six dimensions or variables in this table The cells in the table are the number of estimated smokers. Geography Region Time Periods Unit of Observation Attributes Smokers Education Age Sex

Statistics are about definitions! Statistics are dependent on definitions. You may think of statistics as numbers, but the numbers represent measurements or observations based on specific definitions.definitions Tables are structured around geography, time and content based on attributes of the unit of observation. These properties all need definitions.

Statistics involve classifications! Classifications Sex Total Male Female Periods

Some classifications are based on standards while others are based on convention or practice. For example, Standard Geography classifications Geography classifications Statistics involve classifications!

WHERE ARE THE DATA!

Microdata

Stories are told through statistics The National Population Survey had over 80,000 respondents in sample and the Canadian Community Health Survey in 2005 had over 130,000 respondents. How do we tell the stories about these people? We use statistics to create summaries of these life experiences. Data enable us to construct the tables or analyses to tell these summarized stories.

Methods producing data Observational Methods Experimental Methods Computational Methods Focus is on developing observational instruments to collect data Focus is on manipulating causal agents to measure change in a response agent Focus is on modeling phenomena through mathematical equations CorrelationCausationPrediction Replicate the analysis (same data or similar) Replicate the experiment Replicate the simulation Statistics summarize observations Statistics summarize experiment results Statistics summarize simulation results

Summary Statistics are derived from observational, experimental or simulated data. A table is a format for displaying statistics and presents a summary or one view of the data. Tables are structured around geography, time and attributes of the unit of observation. Statistics are dependent on definitions and classifications. Statistics summarize individual stories into common or general stories.

The Census The Census is one of the most important sources of statistical information about Canada. It is the largest survey conducted in Canada and, consequently, is the primary source for small area statistics. To use data from the Census, you must know:  The aggregate characteristics from the Census available for the various spatial units;  The variety of spatial units used to disseminate Census results; and  The codes used to represent the various Census spatial units.

Census of Population Two forms are used to collect the Census: 2A, which goes to 80% of the households, and 2B, which goes to the other 20%. In 2006, the 2A form contained 8 questions while the 2B form had these 8 and 53 additional questions. Long history of specific questions (see the Census Handbook.)history of specific questions You need to understand the content of the Census to know what statistics are possible from the Census.

Post- Censal PALS EDS APS PUMF RDC STATS STC Website E-STAT Custom Tabulations DLI CENSUS 2006 DATA Public Use Microdata Aggregate Confidential Microdata

Microdata and aggregate data Microdata from observational methods created from the respondents in a survey Aggregate Data statistics organized in a data file structure derived from microdata sources used in GIS & time series analysis

Spatial Unit Geo-code

Geo-referenced data The unit analysis makes up the rows in the data file and is the object being described by the other variables the file. The values for this variable are geo-codes for Census tracts.

Geo-referenced data This case in the data file represents Census Tract , which was shown in the image two slides earlier.

The variety of spatial units Statistics Canada groups the variety of spatial units associated with the Census into two groups: Source for the graphics: Illustrated Glossary, 2006 Census Geography, Statistics Canada Source: Illustrated Glossary, 2006 Census Geography, Statistics Canada

Administrative areas Source: Illustrated Glossary, 2006 Census Geography, Statistics Canada

Statistical areas

Census geo-codes Statistics Canada has two categories of geo-code systems:  Standard Geographic Classification (SGC)  Other geographic entities Source for the graphic: Illustrated Glossary, 2006 Census Geography, Statistics Canada

Standard geographic classification Source: Illustrated Glossary, 2006 Census Geography, Statistics Canada

Standard geographic classification, 2006 The link to Definitions, data sources and methods on the main page of the Statistics Canada website provides a link to Standard Classifications, which includes Geography. Definitions, data sources and methods Geography

Other geographic entities Census Metropolitan Areas Source for the graphic: Illustrated Glossary, 2006 Census Geography, Statistics Canada Metropolitan Areas 2006Map of Edmonton CMA

CANSIM CANSIM is a very large database containing socio-economic statistics for Canada. There are currently over 38 million time series organized in approximately 2,800 tables. The statistics in CANSIM come from surveys (e.g., the Labour Force Survey), administrative data (e.g., crime and justice) and simulations or models (e.g., population projections).surveyscrime and justicepopulation projections Geography, content and time are basic to retrieving time series from CANSIM.

E-STAT E-STAT is a free portal to retrieve Census results and CANSIM holdings, which is Statistics Canada’s large time series database. You can access more Census results from the Statistics Canada website, but E-STAT provides a wider variety of output formats for Census data. You can also access CANSIM from the Statistics Canada website, but you must pay $3.00 per time series.

E-STAT E-STAT is available from the Library’s homepage: Go to the list of Databases for access

Data Library If you need assistance, the Data Library is located in Rutherford North on the first floor next to the main staircase. Hours: 9:00 to noon and 1:00 to 4:30 M-F Phone: