1. Homework #2 2. Inferential Statistics 3. Review for Exam.

Slides:



Advertisements
Similar presentations
SPSS Review CENTRAL TENDENCY & DISPERSION
Advertisements

1. Exams 2. Sampling Distributions 3. Estimation + Confidence Intervals.
Sampling Distributions
Statistics for the Social Sciences
QUANTITATIVE DATA ANALYSIS
Chapter 13 Conducting & Reading Research Baumgartner et al Data Analysis.
Lecture 2 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
PSY 307 – Statistics for the Behavioral Sciences
Sampling Distributions
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
Today’s Agenda Review Homework #1 [not posted]
Chapter Sampling Distributions and Hypothesis Testing.
Measures of Dispersion
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Central Limit Theorem The Normal Distribution The Standardised Normal.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
SHOWTIME! STATISTICAL TOOLS IN EVALUATION DESCRIPTIVE VALUES MEASURES OF VARIABILITY.
Measures of Central Tendency
Agenda Review Exam I Sampling Start Probabilities.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Measures of Central Tendency
Measures of Central Tendency
1. Homework #2 2. Inferential Statistics 3. Review for Exam.
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Essentials of Marketing Research
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 7. Using Probability Theory to Produce Sampling Distributions.
Estimation of Statistical Parameters
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Why Draw A Sample? Why not just the get the whole enchilada? – Pragmatic reasons – The true population is typically “unknowable” When done right, a small.
Descriptive Statistics
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
KNR 445 Statistics t-tests Slide 1 Variability Measures of dispersion or spread 1.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Chapter 7 Estimation Procedures. Basic Logic  In estimation procedures, statistics calculated from random samples are used to estimate the value of population.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
The Normal Curve Theoretical Symmetrical Known Areas For Each Standard Deviation or Z-score FOR EACH SIDE:  34.13% of scores in distribution are b/t the.
Determination of Sample Size: A Review of Statistical Theory
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
SOC 3155 SPSS CODING/GRAPHS & CHARTS CENTRAL TENDENCY & DISPERSION h458 student
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
SOC 3155 SPSS Review CENTRAL TENDENCY & DISPERSION.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Confidence Interval Estimation For statistical inference in decision making:
Chapter Eight: Using Statistics to Answer Questions.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
1 Chapter 2: The Normal Distribution 2.1Density Curves and the Normal Distributions 2.2Standard Normal Calculations.
LIS 570 Summarising and presenting data - Univariate analysis.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 16, 2009.
Chapter 2 Describing and Presenting a Distribution of Scores.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
SOC 3155 SPSS CODING/GRAPHS & CHARTS CENTRAL TENDENCY & DISPERSION h458 student
Outline Sampling Measurement Descriptive Statistics:
LEARNING OUTCOMES After studying this chapter, you should be able to
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
1. Homework #2 (not on posted slides) 2. Inferential Statistics 3
15.1 The Role of Statistics in the Research Process
Chapter Nine: Using Statistics to Answer Questions
BUSINESS MARKET RESEARCH
Chapter 4 (cont.) The Sampling Distribution
Presentation transcript:

1. Homework #2 2. Inferential Statistics 3. Review for Exam

HOMEWORK #2: Part A Sanitation Eng. Z=.53 = =.7019 F.C. Z=.67 = = GPA’s, which are in the top 10%? GPA of 3.0 and 3.20 are not:  Z = ( )/.33 =.67  Area beyond =.2514 (25.14%)  Z=( )/.33=1.27  Corresponds to.8980 ( )  Area beyond =.1020 (10.2%) By contrast, for 3.21…  Z=( )/.33=1.30  Corresponds to.9032 ( )

HOMEWORK #2: Part B Question 1 a. Mean=18.87; median=15; mode=4 b. The mean is higher because the distribution is positively skewed (several large cities with high percents) c. When you remove NYC, the mean=16.43 & the median goes from 15 to Removing NYC’s high value from the distribution reduces the skew. The mean decreases more than the median because value of the mean is influenced by outlying values; the median is not—it only moves one case over.

HOMEWORK #2: Part B Question 2 For this problem, there are two measures of central tendency (indicating the “typical” score). The mean per student expenditure was almost $2,000 higher in 2003 ($9,009) than in 1993 ($7,050). The median also increased, but not nearly as much (from $7,215 to $7,516). The spread of the scores, as indicated by the standard deviation, was more than double 2003 (1,960) than it was in 1993 (804). Shape For 1993, the distribution of scores has a slight negative skew; this distribution is essentially normal (bell-shaped) as the mean ($7,050) and median ($7,215) are similar. By contrast, for 2003, the mean is much greater than the median; this distribution has a strong positive skew.

HOMEWORK #2: Part B Q3 a % Opposite sides of mean, add 2 areas together b. 6.38% Both scores on right side of mean, subtract areas c % “Column C” area for Z=1.25 is.1056 d % “Column B” area for Z= -0.5 is (for other half of normal curve) e % Z=2.5; Column B (for area between 2.5 & 0) = (for other half of normal curve) f. 6.68% Z = -1.5; Column C for area beyond -1.5 =.0668

HOMEWORK #2: Part B Q4 a Column B area (.4953) (for other half of normal curve) b % of area on either side of mean (47) c “Column B” for both – d Column B area (.4997) (for other half of normal curve) e “Column C” area for Z=1.6 f Scores on opposite sides of mean  add “Col. B” areas

HOMEWORK #2: Part C SPSS: All the info needed to answer these questions is contained in this output  Statistics HOURS PER DAY WATCHING TV NValid1426 Missing618 Mean3.03 Median2.00 Mode2 Std. Deviation2.766 Percentiles

Distribution (Histogram) for TV Hours

Sibs Distribution

College Science Credits

Sampling Terminology Element: the unit of which a population is comprised and which is selected in the sample Population: the theoretically specified aggregation of the elements in the study (e.g., all elements) Parameter: Description of a variable in the population  σ = standard deviation, µ = mean Sample: The aggregate of all elements taken from the pop. Statistic: Description of a variable in the sample (estimate of parameter)  X = mean, s = standard deviation

Non-probability Sampling Elements have unknown odds of selection Examples Snowballing, available subjects… Limits/problems Cannot generalize to population of interest (doesn’t adequately represent the population (bias) Have no idea how biased your sample is, or how close you are to the population of interest

Probability Sampling Definition: Elements in the population have a known (usually equal) probability of selection Benefits of Probability Sampling Avoid bias Both conscious and unconscious More representative of population Use probability theory to: Estimate sampling error Calculate confidence intervals

Sampling Distributions Link between sample and population DEFINITION 1 IF a large (infinite) number of independent, random samples are drawn from a population, and a statistic is plotted from each sample…. DEFINITION 2 The theoretical, probabilistic distribution of a statistic for all possible samples of a certain outcome

The Central Limit Theorem I IF REPEATED random samples are drawn from the population, the sampling distribution will always be normally distributed As long as N is sufficiently (>100) large The mean of the sampling distribution will equal the mean of the population WHY? Because the most common sample mean will be the population mean Other common sample means will cluster around the population mean (near misses) and so forth Some “weird” sample findings, though rare

The Central Limit Theorem II Again, WITH REPEATED RANDOM SAMPLES, The Standard Deviation of the Sampling distribution = σ √N This Critter (the population standard deviation divided by the square root of N) is “The Standard Error” How far the “typical” sample statistic falls from the true population parameter

The KICKER Because the sampling distribution is normally distributed….Probability theory dictates the percentage of sample statistics that will fall within one standard error 1 standard error = 34%, or +/- 1 standard error = 68% 1.96 standard errors = 95% 2.58 standard errors = 99%

The REAL KICKER From what happens (probability theory) with an infinite # of samples… To making a judgment about the accuracy of statistics generated from a single sample Any statistic generated from a single random sample has a 68% chance of falling within one standard error of the population parameter  OR roughly a 95% CHANCE OF FALLING WITHIN 2 STANDARD ERRORS

EXAM Closed book BRING CALCULATOR You will have full class to complete Format: Output interpretation Z-score calculation problems Memorize Z formula  Z-score area table provided Short Answer/Scenarios Multiple choice

Review for Exam Variables vs. values/attributes/scores variable – trait that can change values from case to case  example: GPA score (attribute)– an individual case’s value for a given variable Concepts  Operationalize  Variables

Review for Exam Short-answer questions, examples: What is a strength of the standard deviation over other measures of dispersion? Multiple choice question examples: Professor Pinhead has an ordinal measure of a variable called “religiousness.” He wants to describe how the typical survey respondent scored on this variable. He should report the ____. a. median b. mean c. mode e. standard deviation On all normal curves the area between the mean and +/- 2 standard deviations will be a. about 50% of the total area b. about 68% of the total area c. about 95% of the total area d. more than 99% of the total area

EXAM Covers chapters 1- (part of)6: Chapter 1 Levels of measurement (nominal, ordinal, I-R)  Any I-R variable could be transformed into an ordinal or nominal-level variable  Don’t worry about discrete-continuous distinction Chapter 2 Percentages, proportions, rates & ratios  Review HW’s to make sure you’re comfortable interpreting tables

EXAM Chapter 3: Central tendency ID-ing the “typical” case in a distribution Mean, median, mode  Appropriate for which levels of measurement?  Identifying skew/direction of skew  Skew vs. outliers Chapter 4: Spread of a distribution R & Q s 2 – variance (mean of squared deviations) s  Uses every score in the distribution  Gives the typical deviation of the scores DON’T need to know IQV (section 4.2)

Keep in mind… All measures of central tendency try to describe the “typical case” Preference is given to statistics that use the most information  For interval-ratio variables, unless you have a highly skewed distribution, mean is the most appropriate  For ordinal, the median is preferred If mean is not appropriate, neither is “s” S = how far cases typically fall from mean

EXAM Chapter 5 Characteristics of the normal curve  Know areas under the curve (Figure 5.3) KNOW Z score formula  Be able to apply Z scores  Finding areas under curve Z scores & probability Frequency tables & probability

EXAM Chapter 6 Reasons for sampling Advantages of probability sampling What does it mean for a sample to be representative? Definition of probability (random) sampling Sampling error Plus… Types of nonprobability sampling

Interpret Total IQ Score NValid1826 Missing9092 Mean88.98 Median91.00 Mode94 Std. Deviation Minimum0 Maximum160 Percentiles Number of cases used to calculate mean? 2.Most common IQ score? 3.Distribution skewed? Direction? 4.Q? 5.Range? 6.Is standard deviation appropriate to use here?

Scenario Professor Scully believes income is a good predictor of the size of a persons’ house IV? DV? Operationalize DV so that it is measured at all three levels (nominal, ordinal, IR) Repeat for IV

Express the answer in the proper format Percent Proportion Ratio Probability