1 OLA Conference February 2008 Session 1022 Jeff Moon Head, Maps, Data, & Government Information Centre (MADGIC) Queen’s University An Introduction to.

Slides:



Advertisements
Similar presentations
Archiving Trevor Croft MICS3 Data Archiving, Dissemination and Further Analysis Workshop Geneva - November 6th, 2006.
Advertisements

MICS4 Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Data Archiving.
EQUINOX DATA DELIVERY SYSTEM May 31, 2011 –Elizabeth Hill Equinox.uwo.ca.
DDI for the Uninitiated ACCOLEDS /DLI Training: December 2003 Ernie Boyko Statistics Canada Chuck Humphrey University of Alberta.
DLI Training Nesstar Workshop
The Economic and Social Data Service (ESDS) Kevin Schürer ESDS/UKDA ESDS Awareness Day 5 December 2003.
Nesstar, ESDS International and ESDS Qualidata online demonstrations ASLIB visit to the UK Data Archive Wednesday 24 November 2004 Louise Corti, Associate.
The Economic and Social Data Service (ESDS) Karen Dennison UK Data Archive Improving access to government datasets 18 January 2007.
1 Adding a statistics package Module 2 Session 7.
Data Made Easy! Away Day MacOdrum Library, Carleton University Jane Fry May 1st, 2008.
Metadata at ICPSR Sanda Ionescu, ICPSR.
An introduction to data entry, data analysis, and graphing using SPSS
ODESI Introduction to Data, Library Data Services, and Why we need ODESI?
Jeff Moon Data Librarian & Academic Director, Queen’s Research Data Centre Statistics & Data& Data An OverviewAn Overview
Table manners GAP Toolkit 5 Training in basic drug abuse data management and analysis Training session 10.
WELCOME TO THE ANALYSIS PLATFORM V4.1. HOME The updated tool has been simplified and developed to be more intuitive and quicker to use: 3 modes for all.
Ontario University Library Consortia Activity Ontario University Library Consortia Activity Gwendolyn Ebbett Dean of the Library University of Windsor.
First Year in Focus at Canadian Colleges and Universities.
NESSTAR Limitedw w w. n e s s t a r. c o m DDI-Publishing Made Easy- the Nesstar Way Jostein Ryssevik Nesstar Ltd.
The Minority Data Resource Center Felicia LeClere, Ph.D. Director, MDRC.
Unlocking Public Opinion Poll Data in Canada May 27, 2009 IASSIST 2009 Michelle Edwards, PhD, University of Guelph Jane Fry, Carleton University.
SADC Course in Statistics Adding a statistics package Module I3, Session 13.
Web of Science: An Introduction Peggy Jobe
1 Introduction to OBIEE: Learning to Access, Navigate, and Find Data in the SWIFT Data Warehouse Lesson 8: Printing and Exporting an OBIEE Analysis This.
eListen is a product of Scantron’s mission is to deliver the most advanced testing and assessment, data collection and systems maintenance products and.
ISR Training February 12, 2010 Data Retrieval from Statistics Canada Surveys.
PubMed/How to Search, Display, Download & (module 4.1)
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
DLI Training – Ontario Region April 3, 2008 Carleton University An Introduction to.
The Field (California) Poll. What is the Field Poll? The Field Poll was established in 1947 by Mervin Field. An independent non-partisan survey of California.
Support.ebsco.com EBSCOhost Basic Searching for Academic Libraries Tutorial.
Tabs to main publication types Links in the orange navigation bar for: News Librarians Users Guide Price List alerts 1. Top Navigation Bar General.
SDA: a tool for teaching and research with microdata Laine Ruus University of Toronto. Data Library Service.
Next on OPRAH – Bringing Data Out of the Closet Walter Giesbrecht, Data Librarian York University Jeff Moon, Head, Documents Unit Queen’s University OLA.
Lecture Four: Steps 3 and 4 INST 250/4.  Does one look for facts, or opinions, or both when conducting a literature search?  What is the difference.
EScience in Action Paula Hurtubise, Anna Laurence, Jeff Moon.
Chuck Humphrey Data Library Co-ordinator University of Alberta May 16, Capitalising on Metadata Tool development plans IASSIST 2007.
Nesstar: A Web-based Data Extraction and Analysis System Richard Pinnell & Sandra Keys, University of Waterloo Libraries.
DLI Training April 2004 Kingston Ontario. DDI What, Why, How?
How women drivers compare to men? MALE (Column %) FEMALE (Column %) Better Drivers As Good Drivers Worse Drivers Don’t Know
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
Collaborative Markup of Library and Research Data Examples from Ontario Council of University Libraries (OCUL)
A Journey in Data Discovery Wendy Watkins TSES October, 2007.
IASSIST 2008 Collection, Communication, Access and Preservation IASSIST 2008 – session E3 Yesterday, Today and Tomorrow: Data on the Web from Vision to.
About the OECD Why am I here? Why is access to online information important? Libraries and Librarians play a crucial role in the innovation process.
Ontario Data Documentation, Extraction Service and Infrastructure IASSIST 2008 Palo Alto, California.
DLI Boot Camp 2011 Finding Statistics: Tools and Techniques Jean Blackburn Vancouver Island University Library SDA.
Accessing journals by via PubMed Note the link to find articles through HINARI/PubMed. Using this option will be covered in later in the Short Course.
Beyond 20/20 for Beginners. Plan Who needs Beyond 20/20 anyway? ◦ What is Beyond 20/20, and what can we do with it? Pros and cons of using 20/20 How to.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman - Jordan 16 – 19 May 2011 Determination of the scope and form of.
Project? Microdata? Say what? TRY Conference May 5, 2008 Suzette Giles, Ryerson University Laine Ruus, University of Toronto.
Jeff Moon Data Librarian & Academic Director, Queen’s Research Data Centre Statistics & Data& Data An OverviewAn Overview
DATA and STATISTICS … at your service! S.Mowers & the GSG team ©2009, University of Ottawa.
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
Ontario Data Documentation, Extraction Service and Infrastructure.
Ontario Data Documentation, Extraction Service and Infrastructure.
Real Time Remote Access: Educational resources Susan Mowers, University of Ottawa.
Finding Data Files at the U of S Library Sociology 398, Social Inequality and Health Kiran Doranalli Lucy Li Data & GIS Library Services, U of S Library.
Data Access North of the (US) Border
Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS)
Introduction to OBIEE:
General Social Survey Enquête sociale générale
Ontario Data Documentation, Extraction Service and Infrastructure centralised, standardised web-based data extraction/analysis system.
Beyond 20/20 for Beginners.
What’s New in Colectica 5.3 Part 1
General Social Survey Enquête sociale générale
Ontario Data Documentation, Extraction Service
ICPSR: Resources for Instructors Finding and Analyzing Data 9/26/2012
The role of metadata in census data dissemination
Data Liberation Initiative (DLI)
Presentation transcript:

1 OLA Conference February 2008 Session 1022 Jeff Moon Head, Maps, Data, & Government Information Centre (MADGIC) Queen’s University An Introduction to

No statistics Do I want to use Statistics ? NO Flowchart: ‘Do I want to use statistics?’

What we’ll cover: What is survey data, and what’s the big deal? What’s happening in Ontario on the ‘data front’? Show me the goods… Why is this important at my library?

What is Survey Data and what’s the big deal? Tables, Charts, Graphs (in Books, CD-ROM, the WWW) A ‘number’ Survey Data (machine-readable) Data continuum… (Microdata)

What is Survey Data and what’s the big deal? Percentages Counts Standard Deviations Cross-tabs More advanced AnalysisMeans Statistical Analysis continuum… Descriptive Statistics Inferential Statistics

What is Survey Data and what’s the big deal? Tables, Charts, Graphs (in Books, CD-ROM, the WWW) A ‘number’ Survey Data (machine-readable) Statistics… Percentages Counts Standard Deviations Cross-tabs More advanced AnalysisMeans Statistical Analysis… (Microdata)

Survey DataAggregate Data PostcardCamera “Fixed” “Flexible” What is Survey Data and what’s the big deal?

We’ll look at the flexibility of survey data a bit later on… In the mean time, let’s look at the situation in Ontario right now…

1990’s Home-grown survey data systems - Guelph, Western, Queen’s - No ‘cataloguing’ standard - Varying features/capabilities - Served a purpose at the time 2000’s Emerging data cataloguing standards Data Documentation Initiative -- an international standard for describing survey data. Like ‘MARC’, only for data Mature commercial software solutions Software such as Nesstar, SDA, and others

In 2005, the Data IN Ontario (DINO) working group of OCUL (O ntario Council of University Libraries) started thinking about moving beyond ‘home-grown’ data solutions, adopting the DDI standard, and building a province-wide data solution. A discussion paper followed… In 2007, with funding from OCUL and “Ontario Buys”, a Project Director was hired, and hardware/software purchased through Scholars Portal. OCUL & Ontario Buys Commercial Software Scholars Portal DDI Standard O ntario D ata Documentation, E xtraction S ervice and I nfrastructure Initiative

Lead institutions in are Carleton and Guelph, with in-kind assistance from Queen’s University. First step was developing a Canadian ‘best practices’ document for cataloguing data files using DDI – analogous to AACR2 for MARC. Next, survey files were ‘marked up’ (catalogued) and loaded onto a test server at Guelph. The team at Scholars Portal is working with to establish a data server and load data files.

12 Use of the Data Documentation Initiative standard facilitates: Interoperability. XML-compliant DDI Codebooks can exchanged and transported seamlessly, and applications can be written to work with these homogeneous documents. Richer content. The DDI encourages better description of social science datasets, providing researchers with a better ‘window’ into what is available Single document - multiple purposes. DDI codebook contain all of the information necessary to produce several different types of output, including: a traditional social science codebook, a bibliographic record, and SAS/SPSS/Stata data definition statements. Thus, the document may be repurposed for different needs and applications. On-line subsetting and analysis. Because the DDI markup extends down to the variable level and provides a standard uniform structure and content for variables, DDI documents are easily imported into on-line analysis systems, rendering datasets more readily usable for a wider audience. Precision in searching. Since each of the elements in a DDI-compliant codebook is tagged, searches across documents and studies are possible.

13 SOFTWARE CHOSEN  NESSTAR Developed by the “Norwegian Social Science Data Services” -- Ne tworked S ocial S cience T ools a nd R esources In use internationally (Europe, UK, US, Canada) In Ontario: Queens, Guelph, Carleton, Windsor, Ottawa, U. of T. and Statistics Canada use Nesstar DDI compliant Search by keyword for surveys and survey questions Do basic data exploration and analysis on the web Download full datasets or subsets in popular formats Export tables and charts

15 Nesstar Publisher produces DDI-compliant metadata using a set of structured tags, grouped into ‘tabs’ in Publisher.

Document Description Tab

17 Study Description Tab

18 Other Study Materials Tab

19 File Description Tab

20 Variables Tab

21 Variable Groups Tab

22 Data Entry Tab

23 Other Materials Tab

24 Once ready, a ‘marked up’ survey file is ‘published’ to the Nesstar Server where it becomes available through Nesstar Webview.

Let’s take a look at how can be used to answer a research question. How do men and women differ in perceptions of their health (using weight as an example). Concepts? Health Body Mass Index (BMI) Weight Males/Females

Starting point: A simple search on the Statistics Canada web site…

“Fixed” “Flexible”

29

30

31

32

33

34 Variable ‘groups’ Variables

35 Basic ‘frequencies’ or ‘marginals’ for categorical variables…

36 Descriptive statistics for ‘continuous’ variables…

37 But what if we want to look at more than one variable at a time? Say, for instance, the issue of weight and gender ?

38 Before proceeding, you must log into the Nesstar System

39 OK… now we want to add gender as a variable.

40

41 Opinion of own weight, by sex Proportionally, more women than men had the opinion that they were “Overweight”.

42 OK, but how does this change if we add an ‘objective’ measure of weight, such as ‘Body Mass Index’ (BMI)?

43 Start where we left off… ‘opinion of own weight’, by sex But add another variable as a ‘layer’…

44 Add ‘BMI class’ as a layer…

45 Of respondents who were ‘objectively’ underweight, proportionally more women than men had the ‘subjective’ opinion that they were “Just About Right”. Layer = those with a BMI indicating ‘underweight’

46 Of respondents who were ‘objectively’ normal weight, proportionally more women than men had the ‘subjective’ opinion that they were “Overweight”. Layer = those with a BMI indicating ‘normal weight’

47 Layer = those with a BMI indicating ‘overweight’ Of respondents who were ‘objectively’ overweight, proportionally more MEN than women had the ‘subjective’ opinion that they were “Just About Right”.

OK, I have an confession to make…

Statistical Weight… All the previous slides ignored an important concept… that of weight. Not ‘weight in kilograms’ but rather ‘statistical weight’. We don’t want to describe the sample… we want to describe the population at large (in this case, Canadians 18+). Statistical weights are assigned by statisticians, not surprisingly, to each individual in a sample, based on a variety of demographic and sampling considerations. These weights reflect how many people a given respondent ‘represents’ in the population being studied. Sample count  Population Estimate Statistical weight

Weight ‘off’: Note the sample sizes Weight ‘on’: Note the sample sizes But also note the differences in percentages…

In general, you must apply the Statistical Weight in order to get valid results. It is easy to turn weight ‘on’ in Nesstar ( ), or other statistical packages (e.g. SPSS, SAS, STATA). BUT READ THE DOCUMENTATION

They say a picture is worth a thousand words… If this is true, then a good chart has to be worth at least a couple of hundred… Let’s revisit our data visually using the ‘bar chart’ feature of Nesstar.

Weight is on Barcharts showing weighted results: Proportionally, of those who are objectively underweight, more women than men think they are ‘just about right’

Weight is on Barcharts showing weighted results: Proportionally, of those who are objectively normal weight, more women than men think they are overweight

Weight is on Barcharts showing weighted results: Proportionally, of those who are objectively overweight, more men than women think they are ‘just about right’

Searching for ‘questions’ in Nesstar: Simple Search

Search results – Simple search You get all the surveys that have the ‘keyword’ you searched for… but specific questions (variables) are NOT highlighted.

Searching for ‘questions’ in Nesstar: Advanced Search Advanced Search

Advanced Search Screen

Search results – Advanced search Here, specific variables that meet the search criteria are shown, with the option of “opening in context”

61 Barchart Table Time series graph Map Clear Weight Subset Export to spreadsheet Download Export PDF Print Create bookmark Help Menu options:

OK, so what kind of data can I expect to find using ODESI? 1.Statistics Canada survey files released through the Data Liberation Initiative (Census PUMF’s, Special Surveys, General Social Surveys, and more) 2.Public Opinion Polls (e.g. Gallup) 3.Survey files from other sources (academics) These surveys and polls include questions on all manner of topics (politics, health, work, leisure, education, drug use, aging, spending, internet use, and many more)…

Let’s take a look at some Gallup questions… Dataset: Canadian Gallup Poll, August 1951, #212 In some cities in Canada, horsemeat is now being sold, because of the high price of other meats. If horsemeat were available here, would you be willing to try it? 35.9% of respondents said “Yes” they’d be willing. Of course, this questions begs for a yea or ‘ neigh ’ answer

Dataset: Canadian Gallup Poll, September 1956, #251 WOULD YOU FAVOR REQUIRING EVERY ABLE-BODIED YOUNG MAN IN THIS COUNTRY, WHEN HE REACHES THE AGE OF 18, TO SPEND ONE YEAR IN MILITARY TRAINING AND THEN JOIN THE RESERVES OR MILITIA? 65.7% favoured this.

Dataset: Canadian Gallup Poll, August 1953, #231 HOW MUCH DO YOU THINK A YOUNG MAN SHOULD BE EARNING PER WEEK BEFORE HE GETS MARRIED? $41 - $50 per week equals roughly $ $2600 annually.

Dataset: Canadian Gallup Poll, August 1953, #231 THERE'S AN ATTEMPT BEING MADE BY SOME FASHION LEADERS TO SHORTEN WOMEN'S SKIRTS. DO YOU THINK THAT WOMEN SHOULD FOLLOW THIS LEAD - AND WEAR SKIRTS SHORTER THAN THEY ARE NOW? 13% Shorter 82 % About the same 5 % Longer

DO YOU APPROVE OF THE USE OF BIRTH CONTROL? Tracking Opinions over time

1.Researchers can search across all surveys in a collection. 2.Researchers have the ability to explore surveys in more detail (e.g. looking at questions by gender, province, age group, income, etc.). 3.Tables can be saved in Excel or Adobe format. 4.Researchers can download data for use in more powerful statistical packages (SPSS, SAS, etc.) Key points about survey data in

In conclusion, ODESI will: 1.Provide a more level ‘data’ playing field for Ontario Universities. 2.Provide students and researchers with access to a substantial and growing body of survey and polling data, both current and historical. 3.Provide an easy, yet powerful, search and exploration tool (Nesstar) that will serve both beginners and ‘power users’. 4.Encourage cooperation and sharing of data and metadata in Ontario. 5.Serve as a potential model for other jurisdictions.