Aim: 1. To debrief correlation studies- What is a correlation study

Slides:



Advertisements
Similar presentations
Unit 1: Science of Psychology
Advertisements

Statistical Tests Karen H. Hagglund, M.S.
QUANTITATIVE DATA ANALYSIS
Descriptive Statistics
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Introduction to Educational Statistics
Essential Questions EQ 1-5: How do psychologists draw appropriate conclusions about behavior from research? **Make sure in their do now they grab a graphic.
Today: Central Tendency & Dispersion
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Statistics Recording the results from our studies. Must use a common language so we all know what we are talking about.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
QUANTITATIVE RESEARCH AND BASIC STATISTICS. TODAYS AGENDA Progress, challenges and support needed Response to TAP Check-in, Warm-up responses and TAP.
Chapter 1: Science of Psychology Daily Objective (concept map): Apply basic statistical concepts to explain research findings: - Descriptive Statistics:
Statistics Recording the results from our studies. Must use a common language so we all know what we are talking about.
Case Studies A detailed picture of one or a few subjects. Tells us a great story…but is just descriptive research. Does not even give us correlation data.
Chapter 11 Univariate Data Analysis; Descriptive Statistics These are summary measurements of a single variable. I.Averages or measures of central tendency.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Statistics AP Psychology 2010 J. Mulder. Why are statistics important? “Proof is virtually impossible for psychology researchers to attain because controlling.
Chapter Eight: Using Statistics to Answer Questions.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
AP Psychology Warm Up List 5 guidelines that psychologists should follow when conducting experiments with animals. Then list 5 guidelines that should apply.
Statistics. Descriptive Statistics Organize & summarize data (ex: central tendency & variability.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Descriptive and Inferential Statistics Or How I Learned to Stop Worrying and Love My IA.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
WHS AP Psychology Descriptive Statistics: The Normal Curve and Measures of Variance.
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
STATS DAY First a few review questions. Which of the following correlation coefficients would a statistician know, at first glance, is a mistake? A. 0.0.
Outline Sampling Measurement Descriptive Statistics:
MSS 905 Methods of Missiological Research
Introductory Psychology: Statistical Analysis
Data measurement, probability and Spearman’s Rho
Measurements Statistics
Different Types of Data
Unit 3: Science of Psychology
STATISTICS FOR SCIENCE RESEARCH
INF397C Introduction to Research in Information Studies Spring, Day 12
PCB 3043L - General Ecology Data Analysis.
Unit 1: Science of Psychology
Tips for exam 1- Complete all the exercises from the back of each chapter. 2- Make sure you re-do the ones you got wrong! 3- Just before the exam, re-read.
Statistics.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Basic Statistics Overview
Vocab Quiz—History, Research Methods
Statistics in AP Psychology
Descriptive Statistics
Survey Method Most common type of study in psychology
Correlational Method Correlation expresses a relationship between two variable. Does not show causation. As more ice cream is eaten, more people are murdered.
Science of Psychology AP Psychology
Statistical Evaluation
Descriptive and Inferential Statistics
Experimental Method Looking to prove causal relationships.
STATS DAY First a few review questions.
Analysis and Interpretation: Exposition of Data
Module 8 Statistical Reasoning in Everyday Life
An Introduction to Statistics
Introduction to Statistics
Basic Statistical Terms
Operational Definitions
Practice: Imagine that you are a golfer of above-average ability and that you have the opportunity to play the greatest golfer in the world (say Tiger.
Research Day 3.
1. Homework #2 (not on posted slides) 2. Inferential Statistics 3
Ms. Saint-Paul A.P. Psychology
15.1 The Role of Statistics in the Research Process
Chapter Nine: Using Statistics to Answer Questions
Descriptive Statistics
InferentIal StatIstIcs
Statistics Review (It’s not so scary).
Presentation transcript:

Aim: 1. To debrief correlation studies- What is a correlation study Aim: 1. To debrief correlation studies- What is a correlation study? How is it used in research? 2. Introduction to how stats are use in reporting data in research. DO NOW: Take out your paper and staple rubric to the FRONT of your paper. Homework: Complete Stats Vocabulary Sheet. Define ALL terms AND look up the terms you are unsure about. Bring in Calculator!!

The Dangers of Bread Evaluate the reasoning. Is it scientifically sound or flawed? Why? What lessons can we take from this?

About 1 in 8 U.S. women (about 12%) will develop invasive breast cancer over the course of her lifetime. In 2017, an estimated 252,710 new cases of invasive breast cancer are expected to be diagnosed in women in the U.S., along with 63,410 new cases of non-invasive (in situ) breast cancer. About 2,470 new cases of invasive breast cancer are expected to be diagnosed in men in 2017. A man’s lifetime risk of breast cancer is about 1 in 1,000. Breast cancer incidence rates in the U.S. began decreasing in the year 2000, after increasing for the previous two decades. They dropped by 7% from 2002 to 2003 alone. Facts from Breastcancer.org

Susan G. Komen Foundation Rates of breast cancer are low in women under 40. Fewer than 5 percent of women diagnosed with breast cancer in the U.S. are younger than 40 [4]. Rates begin to increase after age 40 and are highest in women over age 70 (see Figure 2.1 below). The median age of diagnosis of breast cancer for women in the U.S. is 62 [6]. However, the median age of diagnosis varies by race and ethnicity. For example, African-American women tend to be diagnosed at a younger age than white women [6]. The median age at diagnosis for African-American women is 59, compared to 63 for white women [6]. Susan G. Komen Foundation

Age is also a risk factor for breast cancer in men. The older a man is, the more likely he is to get breast cancer. However, breast cancer is much less common in men than in women (see Figure 2.1 below). The median age of diagnosis of breast cancer for men in the U.S. is 68 [6]. However, the median age of diagnosis varies by race and ethnicity. For example, African-American men tend to be diagnosed at a younger age than white men [6]. The median age at diagnosis for African-American men is 65, compared to 68 for white men [6]. Learn more about breast cancer in men.

Statistics Is concerned with the collection, analysis, interpretation, & presentation of data Must use a common language so we all know what we are talking about.

What do we already know about statistics? Define ALL of the terms you know. Put a question mark next to the terms you are unsure about. What questions do you have? What do you wonder about?

Aim: How stats are use in reporting data in research. DO NOW: Go over TEST and put in test folder. Take out HW – vocabulary sheet. What is the difference between the 4 scales of measurement? Which one is truly quantifiable? Qualitative? Homework: Quiz on Stats (60 Pt Quiz) Wednesday

Measurement Scales DEFINE/ EXAMPLE Nominal Ordinal Interval Ratio

Nominal Ordinal Interval Ratio Categorical Non-numerical; cannot compute mean. Show with bar graph. These numbers are purely for categorizing data into groups. They have no quantitative properties (can’t compute mean). Channels on TV OR A survey where 1 = Democrat, 2 = Republican, or 3 = Independent Favorite ice cream Ordinal Ordered These numbers contain some quantitative information, namely that of determining ranking (Better, Faster, Smarter). But are not numerical – can’t compute mean. However, The distance between scale points is not equal. NCAA Seed Rankings, Positions in a race. Interval There are equal interval between points; These numbers contain quantities information. The spaces in between the numbers mean something, and they can be added and subtracted, and average mean. Temperature (Fahrenheit or Celsius) Intelligence Test Score Ratio These numbers contain the most quantitative information. They can also be added, subtracted, multiplied, and divided, and they have a true zero point. Money, speed, height, gpa.

Continuous/ Discontinuous Continuous = there are no limits on the value Discontinuous = there are a finite number of values

Check for Understanding Complete 1 – 6 p. 2 Complete 1 – 15 p.

What is the difference between bar graphs and histograms? used to display "categorical data", that is data that fits into categories Nominal, ordinal Histogram present "continuous data", that is data that represents measured quantity where, at least in theory, the numbers can take on any value in a certain range Interval,ratio

Aim: How stats are use in reporting data in research. DO NOW: Homework: Complete p. 10 – 11 Stats Practice in Stats Packet. Quiz on Stats (60 Pt Quiz) Thursday – 15 Multiple Choice and computations.

Descriptive Statistics Inferential Statistics Descriptive Statistics PAGE 4 to reach conclusions beyond describing the data Estimating the likelihood you would get the same results (probability) Estimate whether you can generalize the results to larger population Just describes sets of data. Summarize in a meaningful way Notice patterns Can’t really draw conclusions

Check for Understanding Identify as descriptive or inferential TOP OF PAGE 4

Measure of Central Tendency Mean “average” of a distribution Calculate - sum of scores/ number of entries Mode most common score in a distribution Median “middle” score in a distribution Calculate – list all scores lowest to highest, middle number Which of the above would be most useful in describing a skewed curve?

Central Tendency Mean, Median and Mode. Watch out for extreme scores or outliers. Let’s look at the salaries of the employees at Dunder Mifflen Paper in Scranton: $25,000-Pam $25,000- Kevin $25,000- Angela $100,000- Andy $100,000- Dwight $200,000- Jim $300,000- Michael The mean salary good at about $110,000 but it is affected by outliers. The median salary looks good at $100,000. But the mode salary is only $25,000.

The Median is a much better measure of the center But the mean doesn’t work in a skewed distribution (what type is this?) The Median is a much better measure of the center 22

Skewed Curves PAGE 7– Positive v. Negative Skewed Curves PAGE 7– Positive v. Negative? Where are the mean, median and mode? Negatively Skewed Outliers skew distributions (not representative of majority). If group has one high score, the curve has a positive skew (contains more low scores) If a group has a low outlier, the curve has a negative skew (contains more high scores) Positively Skewed 23

Measures of Variation Page 5 Dispersion = how distributed the data points are (high v. low variability) Two key ways of measuring variability: Range Standard Deviation 24

Range The range simply gives the lowest and highest values of a data set but not any other info/variability of the other scores. 25

Standard Deviation measure of dispersion/ how much each score varies/deviates from the mean. They measures the average difference between the values. The higher the variance or SD, the more spread out the distribution is. Do scientists want a big or small SD? Shaq and Kobe may both score 30 ppg (same mean). But their SDs are very different.

In research what would we prefer in our sample – a high or low SD? Why?

Three populations with the same mean, median, and mode Three populations with the same mean, median, and mode. Which has the smallest standard deviation?

Formulas for Standard Deviation

Standard Deviation 30

Check for Understanding Page 5 Calculate the Range and SD for the data set Page 5 Estimating SD 1 – 2 at bottom

Standard Deviation in Action A couple needs to be within one standard deviation of each other in intelligence (10 points in either direction). —Neil Clark Warren, founder of eHarmony.com 32

Interpret this graph Figure 6. The distribution of IQ scores in male and female populations. Adjusted parameter values yielded a male-female gap of 0.162 SD in g equivalent to 2.43 IQ points in favor of men 33

Normal Distributions Page 5 The distribution of data also gives us key info. We know that many human attributes… e.g height, weight, task skill, reaction time, anxiousness, personality characteristics, attitudes etc. all …follow a normal distribution. In a normal distribution, the mean, median and mode are all the same. Symmetrical bell shaped curve in which % of scores between mean and any pt. on horizon is always the same 68% fall w/in 1 SD, 95% w/in 2 SD 34

Normal Distribution

IQ follows a Normal Distribution Mean = 100 SD = 15 36

What percentage score below 100? Mean = 100 SD = 15 37

What percentage score below 100? Mean = 100 SD = 15 38

What percentage score above 100? Mean = 100 SD = 15 34.1% + 13.6% + 2.1% 39

What percentage score between 85 and 100? Mean = 100 SD = 15 34.1% 40

What percentage score between 85 and 115? Mean = 100 SD = 15 34.1% + 34.1% = 68.2% 41

What percentage score between 70 and 130? 13.6% + 34.1% + 34.1% + 13.6% = 95.4% Mean = 100 SD = 15 42

What percentage score below 70 and above 130? Mean = 100 SD = 15 43

The shelf life of a particular dairy product is normally distributed with a mean of 12 days and a standard deviation of 3 days. About what percent of the products last between 9 and 15 days? About what percent of the products last between 12 and 15 days? About what percent of the products last 6 days or less? About what percent of the products last 15 or more days?

Aim: How stats are use in reporting data in research. DO NOW: Take out your HW Complete NORMAL CURVE practice. Homework: Quiz on Stats (60 Pt Quiz) Tomorrow – 15 Multiple Choice and computations.

Inferential Statistics Page 8 Calculations that estimate the likelihood that you’d get similar results if you repeated the study; ie, that your results are NOT a fluke or random, atypical event. Calculate the probability. T Test, chi-squared, ANOVA p value = likelihood that results are a fluke or coincidental Which should you trust more, results with a low or high p value? How low? If p < 0.05, then the results are “statistically significant”. Statistically significant – not likely due to random chance

Testing for Differences If we have results (means) from two groups, before we infer causation we must ask the question: Is there a real difference between the means of the two groups or did it just happen by chance? To answer the question, we must run a t-Test 48

Example of when to do a t-test Does caffeine improve our reaction time? We recruit 40 people and give (random assignment) 20 a caffeine pill (experimental group) 20 a sugar pill (control group) We give them a brief reaction time test and record the results. 49

Example of when to do a t-test Experimental Group results (caffeine) Mean = 500.32ms SD = 172.60ms Control Group results (placebo) Mean = 608.64ms SD = 146.93 50

Example of when to do a t-test Caffeine No Caffeine 51

Why can’t I be done! Yes, they are different. . . But you don’t know if that difference was due to your IV (caffeine) or just dumb luck. You have to be sure that the results are statistically significant

T-Test formula

T-test excel formula =TTEST(array1,array2,tails,type) Array1 is the first data set. Array2 is the second data set. Tails specifies the number of distribution tails. If tails = 1, TTEST uses the one-tailed distribution. If tails = 2, TTEST uses the two-tailed distribution. Type is the kind of t-Test to perform. IF TYPE EQUALS THIS TEST IS PERFORMED 1 Paired 2 Two-sample equal variance (homoscedastic) 3 Two-sample unequal variance (heteroscedastic)

T-test yields a p-value Generally, the t test gives a P value that allows us a measure of confidence in the observed difference. It allows us to say that the difference is real and not just by chance. A p value of less than 0.05 is a common criteria for significance. We call this statistically significant Note: We also need to be careful about finding false negatives (Type II Errors). Look up ‘statistical power’ if you want to know more about this. 55

T-test results Does caffeine improve our reaction time? Caffeine condition has a lower mean RT. We run a t-test on our samples and get: p = 0.039 Can we be confident that the difference in the data is not due to chance? 56

Kahoot https://play.kahoot.it/#/k/176cb923-99ef-418d-aa0e-974cf214ef12

Scores A unit that measures the distance of one score from the mean. A positive z score means a number above the mean. A negative z score means a number below the mean.

PERCENTILES If scores from an English exam range from 0 to 100 and your raw score is 75, does that mean you are in the 75th percentile? % of people who “fall” at each RAW SCORE” need mean and SD to figure out!

Meta Analysis Combines and analyzes data from many different studies Determines how much variance in scores across all studies could be explained by a particular variable.

Z - Scores Raw scores can be converted to Z scores Tells you how far a given score is above or below the mean, using the SD as a unit of measurement Raw Score – Mean divided by SD

Z score - the number of standard deviations from the mean