Data and Statistics: As easy as 1-2-3? Carolyn DeLorey, MLIS St. Francis Xavier University Atlantic DLI Workshop UNB Fredericton April 28, 2015.

Slides:



Advertisements
Similar presentations
Chapter 1 Getting Started Understandable Statistics Ninth Edition
Advertisements

Research methods – Deductive / quantitative
Chuck Humphrey Data Library University of Alberta.
Canada’s Health by Region By: Jack Wei and Colin McClenaghan.
Chapter 1 The Where, Why, and How of Data Collection
Demystifying Data Reference Helping non-specialists make sense of data.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 26, 2009.
Causal-Comparative Research
Chuck Humphrey, Leah Vanderjagt and Anna Bombak University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying statistics.
Anna Bombak, Chuck Humphrey, Lindsay Johnston and Leah Vanderjagt University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying.
Chuck Humphrey & Lynne Robinson University of Alberta Surviving Statistics Strategies for dealing with statistical questions on the reference desk.
Anna Bombak, Chuck Humphrey, Lindsay Johnston, Angie Mandeville and Leah Vanderjagt Winter Institute on Statistical Literacy for Librarians, February 18-20,
Business research methods: data sources
Chuck Humphrey, Leah Vanderjagt and Anna Bombak University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying statistics.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library March 6, 2009.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
Chapter 5 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
THE UNIVERSITY OF MISSISSIPPI The University of Mississippi Institute for Advanced Education in Geospatial Science Census to American Community Survey.
Introduction to Data Analysis *Training Session*
Anna Bombak, Chuck Humphrey, Angie Mandeville, Leah Vanderjagt and Amanda Wakaruk Winter Institute on Statistical Literacy for Librarians, February 23-25,
The Winter Institute on Statistical Literacy for Librarians Demystifying statistics for the practitioner Anna Bombak, Chuck Humphrey, Larry Laliberte,
INTRODUCTION TO STATISTICS MATH0102 Prepared by: Nurazrin Jupri.
© The McGraw-Hill Companies, Inc., by Marc M. Triola & Mario F. Triola SLIDES PREPARED BY LLOYD R. JAISINGH MOREHEAD STATE UNIVERSITY MOREHEAD.
Research Methodology. Refers to search for knowledge. Research is an academic activity.
Statistics The science of collecting, analyzing, and interpreting data. The Statistical Problem Solving Process: 1.Ask a question of interest 2.Produce.
Statistics are ubiquitous “Statistics are generated today about nearly every activity on the planet. Never before have we had so much statistical information.
Outcome Based Evaluation for Digital Library Projects and Services
Statistics, Data, and Statistical Thinking
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Bangkok,
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
Chuck Humphrey, University of Alberta Atlantic DLI Training, 2008 DLI Orientation: Concepts A Framework for Thinking about Data and Statistics.
1 The 2001 Census PUMFS Odyssey Sponsored by HAL and PALS Presented by Chuck Humphrey.
DLI Workshop -- Mar Hosted by Dalhousie University March 2000 DLI Training Workshop.
Sociological Research Methods Sociology: Chapter 2, Section 1.
An Overview of Statistics
PSY 307 – Statistics for the Behavioral Sciences Chapter 1.
American Community Survey (ACS) 1 Oregon State Data Center Meeting Portland State University April 14,
LECTURE 2 TUESDAY, 1 September STA 291 Fall
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Agresti/Franklin Statistics, 1 of 33 Chapter 1 Statistics: The Art and Science of Learning from Data Learn …. What Statistics Is Why Statistics Is Important.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 1-4 Critical Thinking.
PROCESSING OF DATA The collected data in research is processed and analyzed to come to some conclusions or to verify the hypothesis made. Processing of.
Psychological Research Methods Psychology: Chapter 2, Section 2.
Introduction to Scientific Research. Science Vs. Belief Belief is knowing something without needing evidence. Eg. The Jewish, Islamic and Christian belief.
1 Statistics Statistics can be found in all aspects of life:
Week 2 The lecture for this week is designed to provide students with a general overview of 1) quantitative/qualitative research strategies and 2) 21st.
Sociology 343 Chuck Humphrey Data Library University of Alberta.
4.4 Marketing Research.
Introduction to Statistics BUSA 2100, Sect. 1.0, Dr. Moore.
If you had a million dollars, what would you spend it on? Think about this question and be prepared to share aloud with the class.
Major Science Project Process A blueprint for experiment success.
1-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.
Data in context Chapter 1 of Data Basics. Frameworks Today, we will be presenting two frameworks for thinking about the content of data services. A.Statistics.
BY: ALEJANDRA REYES DALILA OCHOA MARY GARCIA Part A Introduction to Research Methods Topics 1-5.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
Hosted by the University of Regina Library December 1999 DLI Training Workshop Chuck Humphrey.
User Services Focus, value and attitude Vocabulary stories: wash & wear, circ & dingo Statistics and data.
Avoiding Bias Chapter 2.5 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 1 Statistics: The Art and Science of Learning from Data Section 1.1 Using Data to Answer.
A Training Course for the Analysis and Reporting of Data from Education Management Information Systems (EMIS)
Chapter 29 Conducting Market Research. Objectives  Explain the steps in designing and conducting market research  Compare primary and secondary data.
Data Coaching Services Types of Data 1. 2 o Qualitative Data vs. o Quantitative Data o Where do student and teacher data fall within these categories?
Chapter 8 Research: Gathering and Using Information.
2.1 Simple Random Sampling
Presentation transcript:

Data and Statistics: As easy as 1-2-3? Carolyn DeLorey, MLIS St. Francis Xavier University Atlantic DLI Workshop UNB Fredericton April 28, 2015

What are we talking about? 1. Data 2. Statistics 3. Statistical Literacy A study or a surveyVariables are measuredDataThe data is analyzedStatistics

1. Data 0 The direct result of research (a study or a survey) 0 Not display-ready 0 Needs detailed documentation 0 Needs to be processed 0 Use data if you need to do an in-depth analysis or research

2. Statistics 0 Numeric facts and figures 0 Produced when data is analyzed 0 Quantitative evidence: Provide a description Make a comparison Identify a relationship 0 Presentation-ready 0 Usually presented as percentages, charts, or graphs 0 Needs definitions and classifications 0 Use statistics if you need quick facts to make a point

Statistics are “real” only if they are derived from data.

3. Statistical Literacy 0 Ability to use numerical information in everyday life 0 Ability to think critically about numbers and statistics used in arguments, media, advertising, reports, etc. 0 Some knowledge of the process 0 Consider the source 0 Be aware of errors or biases 0 Ability to read, interpret, and describe numbers in statements, surveys, tables, and graphs 0 Understand terms and key concepts used 0 Conventions in creating graphs and charts; violations

Key Concepts 0 Location 0 Variables 0 Unit of Observation 0 Universe There are 6 variables in this table.

Unit of Observation 0 The entity for which data are collected and that statistics describe or summarize 0 Business surveys: 0 Company 0 Establishment 0 Social surveys: 0 Census family 0 Economic family 0 Household 0 Individuals

Unit of Observation Unit of observation attributes: Smokers, education, age, sex.

Universe 0 Universe is the group, collection, or population from which a representative sample is drawn. 0 Members of the universe have common or defining characteristics or features. 0 The universe includes all members of the unit of observation, while the sample consists of just those members from whom data are collected. 0 Statistics Canada uses “target population” to describe each survey’s universe. Universe (Target Population) Sample (Unit of Observation)

Variables: Average tuition; Discipline; Academic year; Province Title Statistical Metric: Dollars Footnote Producer Date Universe: Undergraduate; Full-time; Canada

Think Critically 0 Quality of the data 0 Sampling method 0 Non-response bias 0 Leading questions 0 Presentation of the statistics 0 Misleading 0 Percentage vs. absolute numbers 0 Rankings 0 Qualifiers Compared to what? Says who? Since when?

Being a critical user of statistics 0 Currency 0 How recent is the statistic? 0 What time period is shown? 0 Reliability 0 Is the statistic based on quality data? 0 Is the data source provided? 0 Authority 0 Who published this statistic? 0 What is the source of the data? 0 Purpose / Point of View 0 What view of the data is shown in this statistic? 0 Fact, opinion, bias? 0 Are definitions provided (geography, time, social characteristics)?

Consider the Data Source: A “real” statistic may be derived from poor quality data. 1.Bad sampling Too small Sample chosen 2.Non-Response bias 3.Leading questions Image source:

Bad Sampling 0 Sample can be too small 0 Some groups may be over or under represented 75% of UNB students come to the library 5 days per week Based on a sample of 4 UNB students Increase my sample to 200 UNB students Survey was given to students who were studying in the library Universe (Target Population) Sample (Unit of Observation)

Non-Response Bias 0 What makes some people respond to a survey and others not? 0 May be the survey topic 0 May be characteristics of people being surveyed Image source:

Leading Questions 0 The wording of the question may cause one response to be given more often than another “Do you think the government should help people who are unable to find work?” “Do you feel you should be taxed so some people can get paid for staying home and doing nothing?” Image source:

Good Data - Misleading Statistics 0 Numbers can be chosen to make the point you want 0 Percentage vs. Absolute numbers 0 Rankings 0 Qualifiers Image source:

Percentage vs. Absolute Numbers 0 Which is more impressive? 0 Which is more useful? Company A and Company B both layoff 10% of their employees Company A had 200,000 employees. News report says: “Company A lays off 20,000 employees.” Company B had 100 employees. News report says: “Company B lays off 10% of employees.” Image source:

Rankings 0 Based on comparisons rather than specific amounts. 0 Are categories clear? 0 What do we know about the actual amounts? Heart disease is ranked the # 1 cause of death in Canada. 1.Cancer 2.Heart Disease 3.Cerebrovascular Disease 4.Respiratory Disease 5.Accidents 1.Heart Disease 2.Lung Cancer 3.Colon Cancer 4.Cerebrovascular Disease 5.Breast Cancer Cancer is ranked the # 1 cause of death in Canada.

Qualifiers 0 Some words may not seem important … but they are 0 Creating categories “The brown bear is the largest land predator in the world.” “land”“predator”

Statistics are about definitions 0 Statistics are dependent on the concepts they summarize. 0 Numbers represent measurements or observations based on specific definitions. Image source:

Statistics are about definitions 0 How is visible minority defined? 0 Are aboriginals among the visible minority in Canada? Visible Minority Groups (15), Generation Status (4), Age Groups (9) and Sex (3) for the Population 15 Years and Over of Canada, Provinces, Territories, Census Metropolitan Areas and Census Agglomerations, 2006 Census - 20% Sample Data Geography = Canada Age groups (9) = Total - Age groups Sex (3) = Total - Sex Visible minority groups (15)Total - Generation status 1st generation 2nd generation 3rd generation or more Total - Population by visible minority groups Total visible minority population [5] Not a visible minority [11]

Statistics are about definitions

Statistics involve classifications 0 Some classifications are based on standards 0 Standard Geography classifications 0 North American Industrial Classification System (NAICS) 0 International Classification of Diseases (ICD) 0 Others are based on convention or practice 0 Classifications involve categories 0 Sex (Total, Male, Female) 0 Age groups 0 Classifications are shaped by their definitions

Definitions and Metadata 0 Codebook 0 User’s Guide 0 Data Dictionary 0 Survey questionnaires 0 Instructions to interviewers Metadata

Statistical Literacy: Hands-On

Variables: Average tuition; Full-time students; Discipline; Province; Academic year; Columns: 5 academic years Rows: 16 disciplines Cells: Average tuition fees Unit of measure: Canadian dollars Geography: Canada

Write out a statement comparing the number of days spent freshwater fishing versus saltwater fishing. 1. “There were 433 million freshwater fishing days and 86 million saltwater fishing days.” 2. “There were more freshwater fishing days than saltwater fishing days.” 3. “The number of freshwater fishing days are 5 times more than the number of saltwater fishing days.”

What percentage of Great Lakes anglers fished for perch? 1. 2% 2. 36% OR 40% (0.5 million/1.4 million) OR (.02/.05 =.4) 0 The percentages provided in the table do not match the percentage requested in the question. 0 “36% of all Great Lakes anglers fish for perch and 2% of all anglers fish the Great Lakes for perch.”

How many bird watchers participate in that activity both around the home and away from home? 0 Bird watching around the home and bird watching away from home are not mutually exclusive activities. 0 Answer: – 47.7 = 14 million Isaacson, M. (2012). Lost: Assessing student basic survival skills in the statistical wilderness using real data [PDF document ]. Retrieved from

Reference DeskOfficeTotal