Introduction to Quantitative Data Analysis (continued) Reading on Quantitative Data Analysis: Baxter and Babbie, 2004, Chapter 11. Course website:

Slides:



Advertisements
Similar presentations
Chapter 3, Numerical Descriptive Measures
Advertisements

Descriptive Measures MARE 250 Dr. Jason Turner.
Descriptive Statistics
Measures of Dispersion or Measures of Variability
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Another Information-Gathering Technique & Introduction to Quantitative Data Analysis Neuman and Robson Chapter 11. Research Data library at SFU
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
QUANTITATIVE DATA ANALYSIS
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Analysis of Research Data
Introduction to Educational Statistics
Data observation and Descriptive Statistics
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Measures of Central Tendency
Some Introductory Statistics Terminology. Descriptive Statistics Procedures used to summarize, organize, and simplify data (data being a collection of.
Describing distributions with numbers
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
BIOSTATISTICS II. RECAP ROLE OF BIOSATTISTICS IN PUBLIC HEALTH SOURCES AND FUNCTIONS OF VITAL STATISTICS RATES/ RATIOS/PROPORTIONS TYPES OF DATA CATEGORICAL.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 3 – Descriptive Statistics
1.3 Psychology Statistics AP Psychology Mr. Loomis.
Methods for Describing Sets of Data
Thinking About Psychology: The Science of Mind and Behavior 2e Charles T. Blair-Broeker Randal M. Ernst.
Descriptive Statistics
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
Skewness & Kurtosis: Reference
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Introduction to Quantitative Data Analysis. Quantitative Data Analysis n Types of Statistics u Descriptive u Inferential—probabilistic sampling techniques,
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Chapter Eight: Using Statistics to Answer Questions.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Summary Statistics: Measures of Location and Dispersion.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Statistics and Data Analysis
Descriptive Statistics(Summary and Variability measures)
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Psychology’s Statistics Appendix. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Criminal Justice and Criminology Research Methods, Second Edition Kraska / Neuman © 2012 by Pearson Higher Education, Inc Upper Saddle River, New Jersey.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Descriptive Statistics ( )
Exploratory Data Analysis
Methods for Describing Sets of Data
Business and Economics 6th Edition
Statistics.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
NUMERICAL DESCRIPTIVE MEASURES
Description of Data (Summary and Variability measures)
STATS DAY First a few review questions.
Numerical Descriptive Measures
Descriptive Statistics
Research Statistics Objective: Students will acquire knowledge related to research Statistics in order to identify how they are used to develop research.
Descriptive and inferential statistics. Confidence interval
Data analysis and basic statistics
Univariate Statistics
Descriptive Statistics
Business and Economics 7th Edition
Presentation transcript:

Introduction to Quantitative Data Analysis (continued) Reading on Quantitative Data Analysis: Baxter and Babbie, 2004, Chapter 11. Course website: Audio recordings of Thursday lectures available on-line (for students registered in the course) at

Last Day: Beginning of Quantitative Data Analysis n Introduction to Common Ways of Presenting Statistics & Importance for Analysis (descriptive statistics) u Tables u Charts u Graphs n Univariate Statistics u Measures of Central Tendancy u Measures of Dispersion

Discrete & Continuous Variables n Continuous u Variable can take infinite (or large) number of values within range F Ex. Age measured by exact date of birth n Discrete u Attributes of variable that are distinct but not necessarily continuous F Ex. Age measured by age groups (Note: techniques exist for making assumptions about discrete variables in order to use techniques developed for continuous variables)

The Lexis Diagram

Core Notions in Basic Univariate Statistics n Ways of describing data about one variable (“uni”=one) u Measures of central tendency F Summarize information about one variable (“averages”) u Measures of dispersion F Variations or “spread”

Mode Babbie (1995: 378) n most common or frequently occurring category or value (for all types of data)

Bimodal n When there are two “most common” values that are almost the same (or the same)

Median Babbie (1995: 378) n middle point of rank-ordered list of all values (only for ordinal, interval or ratio data)

Mean (arithmetic mean) Babbie (1995: 378) u Arithmetic “average” = sum of values divided by number of cases (only for ratio and interval data)

Two Data Sets with the Same Mean

Another Diagram of Normal Curve (Showing Ideal Random Sampling Distribution, Standard Deviation & Z- scores)

Normal Distribution & Measures of Central Tendency Neuman (2000: 319) n Symmetric n Also called the “Bell Curve”

Skewed Distributions & Measures of Central Tendency Neuman (2000: 319) Skewed to the left Skewed to the right

Why Measures of Central Tendency are not enough to describe distributions n 7 people at bus stop in front of bar aged 25,26,27,30,33,34,35 u median= 30, mean= 30 n 7 people in front of ice-cream parlour aged 5,10,20,30,40,50,55 u median= 30, mean= 30 n BUT issue of “spread” socially significant

Another Illustration Normal & Skewed Distributions

Measures of Variation or Dispersion n range: distance between largest and smallest scores n standard deviation: for comparing distributions n percentiles: % up to and including the number (from below) n z-scores: for comparing individual scores taking into account the context of different distributions

Range & Interquartile range n distance between largest and smallest scores u what does a short distance between the scores tell us about the sample? u But problems of “outliers” or extreme values may occur

Interquartile range (IQR) n distance between the 75th percentile and the 25th percentile n range of the middle 50% (approximately) of the data n Eliminates problem of outliers or extreme values n Example from StatCan website (11 in sample) StatCan u Data set: 6, 47, 49, 15, 43, 41, 7, 39, 43, 41, 36 u Ordered data set:6, 7, 15, 36, 39, 41, 41, 43, 43, 47, 49 u Median:41 u Upper quartile: 41 u Lower quartile: 15 u IQR= 41-15

Standard Deviation and Variance n n Inter quartile range eliminates problem of outliers BUT eliminates half the data n n Solution? measure variability from the center of the distribution. n n standard deviation & variance measure how far on average scores deviate or differ from the mean.

Calculation of Standard Deviation Neuman (2000: 321)

Calculation of Standard Deviation Neuman (2000: 321)

Standard Deviation Formula Neuman (2000: 321)

Details on the Calculation of Standard Deviation Neuman (2000: 321)

Discussion The Bell Curve & standard deviation

Discussion of Preceding Diagram n “Many biological, psychological and social phenomena occur in the population in the distribution we call the bell curve (Portney & Watkins, 2000).” link to source link to sourcelink to source n Preceding picture u a symmetrical bell curve, u average score [i.e., the mean] in the middle, where the ‘bell’ shape tallest. u Most of the people [i.e., 68% of them, or 34% + 34%] have performance within 1 segment [i.e., a standard deviation] of the average score.”

Interpreting Standard Deviation n amount of variation from mean n Illustration: high & low standard deviation n meaning depends on exact case

Recall: Central Tendency & Dispersion (description of distributions) n 7 people at bus stop in front of bar aged 25,26,27,30,33,34,35 u median= 30, mean= 30 u Range= 10, standard deviation=10.5 n 7 people in front of ice-cream parlour aged 5,10,20,30,40,50,55 u median= 30, mean= 30 u Range= 50, standard deviation=17.9

Other ways of characterizing dispersion or spread n Techniques for understanding position of a case (or group of cases) in the context all of cases n Percentiles n Standard Scores u z-scores

Percentile n 1 st Calculate rank then choose a rank (score) and figure out percentage equal to or less than the rank (score) u Link to more complex definition of percentile Link n % up to and including the number (from below) u “A percentile rank is typically defined as the proportion of scores in a distribution that a specific score is greater than or equal to. For instance, if you received a score of 95 on a math test and this score was greater than or equal to the scores of 88% of the students taking the test, then your percentile rank would be 88. You would be in the 88th percentile” n Also used in other ways (for example to eliminate cases)

z-scores n For understanding how a score is positioned in the data set n to enable comparisons with other scores from other data sets u (comparing individual scores in different distributions) F example of two students from different schools with different GPAs u comparing sample distributions to population. How representative is sample to population under study? (Link to more complete discussion of use of z-scores to understand sampling distribution) Link to more complete discussion of use of z-scores to understand sampling distribution)Link to more complete discussion of use of z-scores to understand sampling distribution)

Calculating Z-Scores n z-score=(score – sample mean)/standard deviation of set u Link to formula Link u Link to z-score calculator Link

Calculating Z-Scores (p. 265 textbook)

Using Z-scores to compare two students’ from different schools: A n Susan with GPA of 3.62 and Jorge with GPA of 3.64 n Susan from College A u Susan’s Grade Point Average =3.62 u Mean GPA= 2.62 u SD=.50 u Susan’s z-score= =1.00/.50=2 u Susan’s grade is two Standard deviations above mean at her school

Using Z-scores to compare two students’ from different schools: B n Jorge from College B u Jorge’s GPA =3.64 u Mean GPA= 3.24 u SD=.40 u Jorge’s z-score= =.40/.40=1 u Jorge’s grade is one standard deviation above the mean at his school n Susan’s absolute grade is lower but her position relative to other students at her school is much higher than Jorge’s position at his school

Another Diagram of Normal Curve with Standard Deviation & Z-scores

Discussion of Previous Case n Relationship of sampling distribution to population (use mean of sample to estimate mean of population)

Recall: Results with two Variables-- Bivariate Statistics n Statistical relationships between two variables u Covariation (vary together) F a type of association F Not necessarily causal u Independence (Null hypothesis): no relationship between the two variables F Cases with values in one variable do not have any particular value on the other variable

Sample Mean Notation

Population Mean Notation

Standard Error (recall tutorial task about average ages in family)tutorial task Standard Error (recall tutorial task about average ages in family)tutorial task n Calculate mean for all possible samples n Divide by number of samples n Measures variability

Recall: Results with two Variables-- Bivariate Tables (Cross Tabulations) Singleton, R., Straits, B. & Straits, M. (1993) Approaches to social research. Toronto: Oxford

Interpretation issues (Bivariate Tables) n Calculate percentages within categories of attributes of independent variable n In example: u Independent variable: gender u Dependent variable: fear of walking alone at night u Women more afraid than men

Other Ways of Presenting Same Data n Link to other tables Link Calculating Expected Outcomes n If variables (gender & fear) not related then distribution of subgroups of independent variable (male & female) should be the same in each subgroup as in the group overall (therefore men and women should express fear in the same proportions) n Used in techniques for studying relationships (Chi-square) u Descriptive dimension (strength of relationship) u Inferential (probability that the association is due to chance)

Expected outcomes (Null Hypothesis) Singleton, R., Straits, B. & Straits, M. (1993) Approaches to social research. Toronto: Oxford

Next Day

Control variables: Trivariate Tables Men/Women Drivers Automobile Accidents by Sex Per Cent Accident Free Women68% (6,950) Men56% (7,080) Automobile Accidents by Sex and Distance Driven Distance Under 10,000 kmOver 10,000 km Per Cent Accident FreeAccident Free Women75% 48% (5,035) (1,915) Men75% 48% (2,070) (5,010) Women have fewer accidents than men because women tend to drive less frequently than do men, and people who drive less frequently tend to have fewer accidents n In, Say it with Figures, Hans Zeisel presents the following data: