Some Introductory Statistics Terminology. Descriptive Statistics Procedures used to summarize, organize, and simplify data (data being a collection of.

Slides:



Advertisements
Similar presentations
STATISTICAL ANALYSIS. Your introduction to statistics should not be like drinking water from a fire hose!!
Advertisements

ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 1-3 Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
Basic Data Analysis for Quantitative Research
Today’s Agenda Review Homework #1 [not posted]
Lect 10b1 Histogram – (Frequency distribution) Used for continuous measures Statistical Analysis of Data ______________ statistics – summarize data.
Intro to Descriptive Statistics
Introduction to Educational Statistics
Brown, Suter, and Churchill Basic Marketing Research (8 th Edition) © 2014 CENGAGE Learning Basic Marketing Research Customer Insights and Managerial Action.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Chapter 7 Probability and Samples: The Distribution of Sample Means
Chapter 3: Central Tendency
Measures of Central Tendency
Today: Central Tendency & Dispersion
Measures of Central Tendency
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
The Data Analysis Plan. The Overall Data Analysis Plan Purpose: To tell a story. To construct a coherent narrative that explains findings, argues against.
Hypothesis Testing:.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
APPENDIX B Data Preparation and Univariate Statistics How are computer used in data collection and analysis? How are collected data prepared for statistical.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
1.3 Psychology Statistics AP Psychology Mr. Loomis.
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
Smith/Davis (c) 2005 Prentice Hall Chapter Four Basic Statistical Concepts, Frequency Tables, Graphs, Frequency Distributions, and Measures of Central.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Chapter 15 Data Analysis: Testing for Significant Differences.
Reasoning in Psychology Using Statistics Psychology
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Introduction to Descriptive Statistics Objectives: 1.Explain the general role of statistics in assessment & evaluation 2.Explain three methods for describing.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
The Normal Curve Theoretical Symmetrical Known Areas For Each Standard Deviation or Z-score FOR EACH SIDE:  34.13% of scores in distribution are b/t the.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Research Ethics:. Ethics in psychological research: History of Ethics and Research – WWII, Nuremberg, UN, Human and Animal rights Today - Tri-Council.
Chapter 11 Univariate Data Analysis; Descriptive Statistics These are summary measurements of a single variable. I.Averages or measures of central tendency.
Measures of Central Tendency: The Mean, Median, and Mode
Review. Statistics Types Descriptive – describe the data, create a picture of the data Mean – average of all scores Mode – score that appears the most.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Chapter Eight: Using Statistics to Answer Questions.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Data Analysis.
Chapter 6: Analyzing and Interpreting Quantitative Data
Central Tendency A statistical measure that serves as a descriptive statistic Determines a single value –summarize or condense a large set of data –accurately.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Introduction to statistics I Sophia King Rm. P24 HWB
Descriptive and Inferential Statistics Or How I Learned to Stop Worrying and Love My IA.
Anthony J Greene1 Central Tendency 1.Mean Population Vs. Sample Mean 2.Median 3.Mode 1.Describing a Distribution in Terms of Central Tendency 2.Differences.
Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Review: Stages in Research Process Formulate Problem Determine Research Design Determine Data Collection Method Design Data Collection Forms Design Sample.
Descriptive Statistics(Summary and Variability measures)
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Making Sense of Statistics: A Conceptual Overview Sixth Edition PowerPoints by Pamela Pitman Brown, PhD, CPG Fred Pyrczak Pyrczak Publishing.
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
Statistics.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
STATS DAY First a few review questions.
Statistics: The Interpretation of Data
Analysis and Interpretation of Experimental Findings
15.1 The Role of Statistics in the Research Process
Chapter Nine: Using Statistics to Answer Questions
BUSINESS MARKET RESEARCH
Presentation transcript:

Some Introductory Statistics Terminology

Descriptive Statistics Procedures used to summarize, organize, and simplify data (data being a collection of measurements or observations) taken from a sample Examples: –Expressed on a 1 to 5 scale, the average satisfaction score was 3.7 –43% of students in an online course cited that family obligations were the main motivation behind choosing distance education

Inferential Statistics Techniques that allow us to make inferences about a population based on data that we gather from a sample Study results will vary from sample to sample strictly due to random chance (i.e., sampling error) Inferential statistics allow us to determine how likely it is to obtain a set of results from a single sample This is also known as testing for “statistical significance”

Population A population is the entire set of individuals that we are interested in studying This is the group that we want to generalize, or apply, our results to Although populations can vary in size, they are usually quite large Thus, it is usually not feasible to collect data from the entire population

Sample A sample is simply a subset of individuals selected from the population In the best case, the sample will be representative of the population That is, the characteristics of the individuals in the sample will mirror those in the population

Variables A characteristic that takes on different values for different individuals in a sample Examples: –Gender –Age –Course satisfaction –The amount of instructor contact during the semester

Independent Variables (IV) The “explanatory” variable The variable that attempts to explain or is purported to cause differences in a second variable Example: –Does the use of a computer-delivered curriculum enhance student achievement? –Whether or not (yes or no) students received the computer instruction is the IV

Dependent Variables (DV) The “outcome” variable The variable that is thought to be influenced by the independent variable Example: –Does the use of a computer-delivered curriculum enhance student achievement? –Student achievement is the DV

Confounding Variables Researchers are usually only interested in the relationship between the IV and DV Confounding variables represent unwanted sources of influence on the DV, and are sometimes referred to as “nuisance” variables Example: –Does the use of a computer-delivered curriculum enhance student achievement? –One’s previous experience with computers, age, gender, SES, etc. may all be confounding variables

Controlling Confounding Variables Typically, researchers are interested in excluding, or controlling for, the effects of confounding variables This is not a statistical issue, but is accomplished by the research design Certain types of designs (e.g., true experiments) better control the effects of confounding variables

Central Tendency

Measures of Central Tendency Three measures of central tendency are available –The Mean –The Median –The Mode Unfortunately, no single measure of central tendency works best in all circumstances –Nor will they necessarily give you the same answer

Example SAT scores from a sample of 10 college applicants yielded the following: –Mode:480 –Median: 505 –Mean: 526 Which measure of central tendency is most appropriate?

The Mean The mean is simply the arithmetic average The mean would be the amount that each individual would get if we took the total and divided it up equally among everyone in the sample Alternatively, the mean can be viewed as the balancing point in the distribution of scores (i.e., the distances for the scores above and below the mean cancel out)

The Median The median is the score that splits the distribution exactly in half 50% of the scores fall above the median and 50% fall below The median is also known as the 50th percentile, because it is the score at which 50% of the people fall below

Special Notes A desirable characteristic of the median is that it is not affected by extreme scores Example: –Sample 1: 18, 19, 20, 22, 24 –Sample 2: 18, 19, 20, 22, 47 –The median is 20 in both samples Thus, the median is not distorted by skewed distributions

The Mode The mode is simply the most common score There is no formula for the mode When using a frequency distribution, the mode is simply the score (or interval) that has the highest frequency value When using a histogram, the mode is the score (or interval) that corresponds to the tallest bar

Choosing the Proper Statistic Continuous data –Always report the mean –If data are substantially skewed, it is appropriate to use the median as well Categorical data –For nominal data you can only use the mode –For ordinal data the median is appropriate (although people often use the mean)

Distribution Shape and Central Tendency In a normal distribution, the mean, median, and mode will be approximately equal

Distribution Shape (2) In a skewed distribution, the mode will be the peak, the mean will be pulled toward the tail, and the median will fall in the middle

Frequency Distribution Tables

Overview After collecting data, researchers are faced with pages of unorganized numbers, stacks of survey responses, etc. The goal of descriptive statistics is to aggregate the individual scores (datum) in a way that can be readily summarized A frequency distribution table can be used to get “picture” of how scores were distributed

Frequency Distributions A frequency distribution displays the number (or percent) of individuals that obtained a particular score or fell in a particular category As such, these tables provide a picture of where people respond across the range of the measurement scale One goal is to determine where the majority of respondents were located

When To Use Frequency Tables Frequency distributions and tables can be used to answer all descriptive research questions It is important to always examine frequency distributions on the IV and DV when answering comparative and relationship questions

Three Components of a Frequency Distribution Table Frequency –the number of individuals that obtained a particular score (or response) Percent –The corresponding percentage of individuals that obtained a particular score Cumulative Percent –The percentage of individuals that fell at or below a particular score (not relevant for nominal variables)

Example (1) Frequency distribution showing the ages of students who took the online course

Example (2) Student responses when asked whether or not they would recommend the online course to others Most would recommend the course

Independent t-Test

The independent samples t-test is used to test comparative research questions That is, it tests for differences in two group means –Two groups are compared on a continuous DV

Scenario Suppose we wish to compare how males and females differed with respect to their satisfaction with an online course The null hypothesis states that men and women have identical levels of satisfaction

Research Question If we were conducting this study, the research question could be written as follows: –Are there differences between males and females with respect to satisfaction? The word “differences” was used to denote a comparative question

The Data (1) Satisfaction is measured on a 25-point scale that ranges between 5 (low) and 30 (high) The descriptive statistics were as follows:

The Data (2) On a 25-point satisfaction scale, men and women differed by about 5 points (means were and 23.5, respectively) They were not identical, but how likely is a 5 point difference to occur from the hypothetical population where men and women are identical?

Conceptual Formula The conceptual formula for the t statistic is The formula tells how big the 5 point difference we observed is relative to the difference expected simply due to sampling error

Results The t-statistic value was 1.695, suggesting that the 5-point difference is not quite twice as large as the difference we would expect due to chance (which is quantified by the standard error statistic) The p-value for the analysis was.116 (almost.12, or 12%)

Interpreting the Probability Thus, there was about a 12% chance that this sample (the 5 point difference) originated from the hypothetical null hypothesis population The p-value is greater than.05, so we would retain the null (results are not significant) Thus, there is no evidence that males and females differ in their satisfaction

Cohen’s d Effect Size Recall that p-values don’t tell how important the results are A measure of effect size can be computed that helps us quantify the magnitude of the results we obtained The mean difference (5 points) is expressed in standard deviation units

Example Using the statistics from the SPSS printout, the d effect size can be computed as

Interpreting Cohen’s d Cohen (1988) suggested the following guidelines for interpreting the d effect size –d >.20 is a small effect size (1/5 of a standard deviation difference) –d >.50 is a medium effect size (1/2 of a standard deviation difference) –d >.80 is a large effect size (4/5 of a standard deviation difference)

Writing Up the Results If you were writing the results for publication, it could go something like this: –“As seen in Table 1, satisfaction scores for female students were approximately five points higher, on average, than those of males. Using an independent t test, no statistically significant differences were observed between the group means, (t (12) = 1.70, p =.12). However, despite no statistical significance, Cohen’s d effect size indicated a large difference between the groups (d =.92)”