Descriptive Statistics and Inferential Statistics A Picture Can Be Worth a Thousand Words
Frequency Distributions Large amounts of information can be neatly organized and summarized Graphs can show entire patterns of scores very clearly The horizontal line is called the abscissa The vertical line is called the ordinate
Frequency Distribution Histogram: Graph of a frequency distribution in which the number of scores falling in each class represented by vertical bars Frequency polygon: Graph of a frequency distribution in which scores falling in each class represented by points on a line
Scores Graphed on a Histogram for Obedience to order to give shocks
Frequency Histogram for accuracy of taste: Which is abscissa?
Opiate Deaths by Age: Which is the Ordinate?
How Number in a Group Influences our Level of Conformity
Studies Want to Compare Groups: Average Score on Memory Test Use Measures of Central Tendency Mean: average score of each group Median: middle score of each group Mode: most frequent score
Measures of Variability Crucial How spread out are the scores in each group Range: difference between highest and lowest score Standard Deviation: how much does the typical score differ from the mean SD is the important statistic in most research studies
Standard Deviation When scores are widely spread, the standard deviation number gets larger When the scores are close together, the standard deviation score gets smaller Scores on different tests are converted by using standard deviations
Standardizing Scores From Different Tests Subtract mean from score and divide by the standard deviation for the group—now a Z-score Susan had a score of 110 in a class with a mean of 100 and a standard deviation of 10 Her z-score is what?
Turning Scores on Different Tests into Standard Scores Susan’s score on the original test was 110. In order to compare her performance to scores on other tests, her score was turned into a standard or z-score Her score was subtracted from the mean of 100, resulting in was divided by the standard deviation of 10, resulting in?
Z-scores allow researchers to compare scores from different tests Susan’s standard or z-score is +1 Now John took a different memory test and received a score of 118. Why can’t compare John’s score to Susan’s original score of 110?
Importance of Standard Scores John’s score of 118 on one test can be compared to Susan’s score of 110 on a different score IF both scores are turned into standard or z-scores John’s score came from a class having a mean of 100 and a standard deviation of 18. What is his z-score?
Both John and Susan have the equivalent Standard Scores John: mean of 100 subtracted from his score of 118 is 18. Divided by the standard deviation of 18 provides the z-score of +1 Now we know that Susan and John scores are equivalent. Compared to other students, each was an equal distance above average.
Normal Curve When chance events are recorded, some outcomes have a high probability and occur very often Other events have a low probability and occur rarely Distributions resembles a normal curve
Normal Curve Bell-shaped, with large number of scores in the middle, with very few extremely high and low scores
Psychological Traits Follow the Bell Curve Measurements of height, memory span and intelligence are distributed along a normal curve Most people have “average” height, memory ability and intelligence Fewer people found at the extremes
Standard Deviation and the Normal Curve Standard deviation measures the proportion of curve above and below the mean 68 percent of all cases fall between one standard deviation above and below the mean
The Real Extremes! 9 percent of all cases fall between 2 standard deviations above and below the mean 99 percent of all cases fall between 3 standard deviations of the mean
Show the Percentages relative to Standard Deviations
Relationship between standard deviations and the normal curve
Standard scores relationship to the Normal Curve Z-scores of either – or + 3 are very extreme—99.9 or 00.1 Z-scores of – or + 2 are also extreme: 97.7 or 02.3 Z-scores of plus or minus 1 are 84 or 16
Why need statistics? Results of psychological studies often expressed as numbers These numbers need to be summarized and interpreted to have any meaning Summarizing numbers with graphs makes it easier to see patterns
Graphical statistics Descriptive statistics organize and summarize numbers Histograms and Polygons represent numbers pictorially.
Two basic questions about groups of numbers What is the central tendency of the group of numbers? How much do the group of numbers vary—or what is the variability?
Measures of Central Tendencies Mean is the average score: add all scores and divide by total # of scores Median found by arranging scores from highest to lowest and selecting the middle score Mode is score that occurs most often
Measures of Variability Range is the difference between the highest score and the lowest score Standard deviation shows how, on average, all the scores differ from the mean
Standard Scores To change an original score into a standard score (or z-score), you subtract the mean from the score and divide it by the standard deviation This allows for meaningful comparisons between scores from different groups
Standard Deviations and the Normal Curve Any one standard score can be placed on a normal curve relative to how it compares to other scores. Some scores are close to the mean while other scores might be way above or below the mean—by 2 or even 3 standard deviations.
Comprehension Check Let’s say you ask 100 people how many minutes they sleep each night and record their answers. How could you show these scores graphically?
Comprehension Check To find the average amount of sleep for your subjects, would you prefer to know the most frequent score (mode), the middle score (median), or the arithmetic average (mean)?
Comprehension Check How could you determine how much sleep times vary? That is, would you prefer to know the highest and lowest scores (range) or the average amount of variation (standard deviation)?
Comprehension Check Do you think that the distribution of minutes of sleep would form a normal curve? Why or why not? If the number of minutes of sleep that Subject A reports is two standard deviations above the mean, what percentage of people would sleep as much or more than s/he does?
Comprehension Check If the numbers of minutes Subject B reports that he sleeps one standard deviation below the mean, how many people in the study sleep the same or less? If instead Subject B reports that he sleeps two standard deviations below the mean, how many people in the study sleep the same or less?
Comprehension Check __________________ statistics summarize numbers so they become more meaningful or easier to communicate. The measure of central tendency that provides the average is the ________
Comprehension Check If scores are placed in order, from the smallest to the largest, the ________ defined as the middle score. As a measure of variability, the ______ Is defined as the difference between the highest and the lowest score.
Comprehension Check As a measure of variability, the ________________ provides the average amount of variation. A z-score of -1 tells us that the score stands ______ standard deviation below the mean.
Comprehension Check A z-score of +2 tells us that the score stands two standard deviations _____ the mean. In a normal curve 99 percent of all scores can be found between ____ and ____ standard deviations from the mean
Inferential Statistics Let’s say a researcher studies the effects of a new therapy on a small group of depressed people. The researcher would like to know if the results of her study holds true for all depressed people. Inferential statistics provide techniques that allow researchers to determine if their results are generalizable or not.
Samples and Populations Scientific studies wish to make conclusions about entire populations of subjects—such as all cancer patients or all married couples. However, it is totally impractical to study the entire population of cancer patients or the entire population of married couples. How is this problem resolved?
Representative Samples Samples—that is smaller cross sections of a population—are studies to draw conclusions about the entire population. For any sample to be meaningful, it must be representative. That is, it must truly reflect the characteristics of the entire population.
How obtain representative samples? A very important aspect of choosing a sample is through random selection— chosen totally at random. That would mean that each member of the population must have an equal chance of being included in the sample.
Significant Differences Let’s say that in a memory experiment, it was found that the average memory score was higher for the group given the drug than for those who did not take the drug. How can researchers determine if this difference wasn’t just by chance?
Tests of Significant Differences Any experimental result that could have occurred by chance 5 times or less our of 100 (probability of.05) is considered to be a significant result. In the memory experiment, they find that the probability is.025 that the group means would differ as much as they do. This allows the conclusion that, with reasonable certainty, the drug did improve the scores.
Correlations—Rating Relationships Psychologists are very interested in detecting relationships between events Are children from single-parent families more likely to achieve in school? Is wealth related to happiness Is the chance of having a heart attack related to being a hostile person?
Visualizing a Correlation Construct scatter diagrams—one variable being on the X axis and the other being on the Y axis. Variables with a positive correlation means that when one goes up the other also goes up Also, when one variable goes down, the other goes down
Positive Correlations Amount a student studies for a test and their test performance The more a student studies, the higher their test score will be. The less a student studies, the lower their test score will be.
Positive Correlation How frequently parents read to their child and the child’s reading ability. The more parents read to their child, the better their child’s reading ability. The less parents read to their child, the lower their child’s reading ability.
Strength of Correlation The strength of the positive correlation between studying and test scores could be very strong for example. But the strength of this positive correlation for some types of tests might be fairly low, more like +36.
Scatter Diagrams for Variables with positive correlations Increases in the X variable are match by increases in the Y variable.
As German mark goes up, so does the French mark
Positive Correlation between Democracy and Capitalism
Negative Correlations Two variables have a negative association if the presence of one predicts the absence of the other Talking to your friends during class will predict low comprehension of class content. Watching lots of TV predicts a low GPA.
Diagrams for Variables with a Negative Correlation As values of one variable increases, the values of other variable decreases.
Some variables have an insignificant association
Correlation Coefficient The coefficient is simply a number falling somewhere between and –1.00. If the number is close to zero, it indicates a very weak or nonexistent relationship.
Correlation Coefficient If the correlation is +1.00, a perfect positive relationship exists. If the correlation is –1.00, a perfect negative relationship exists. Most correlations of significance have a strength somewhere between.90 and.30.
Pearson r: Statistic used to determine strength of correlations The most commonly used correlation coefficient is called the Pearson r. Correlations often provide useful information, like knowing there is a correlation between smoking and lung cancer
Correlations allow for Predictions NOT Explanations Colleges use students GPA’s and scores on the ACT/SAT to help predict which students will have the best chance to succeed. Such predictions are not perfect but they are useful to a certain extent for screening applicants
How to increase predictive power of correlation coefficients Square correlation coefficient—i.e., multiply r by itself Get a number telling the percent of variance accounted for by correlation Correlation between IQ and GPA is.6 Multiply.6 by.6 and learn 36 percent of variation in college grads accounted for by knowing IQ scores
Correlation and Causation Finding correlation between two variables does not automatically mean one causes the other Correlation does not demonstrate causation
What does a correlation mean? When a correlation exists, the best we can say is that two variables are related Not mean that it is impossible for two correlated variables to have a cause-effect relationship Means cannot conclude causal link solely on correlation
Reasons variables can be correlated Two correlated measures can be related as a result of a third variable The more hours students devote to studying, the better their grades Possible that grades and amount of study both related to motivation or interest a student has
Comprehension Check In inferential statistics, observations of a _________________ are used to make inferences and conclusions about an entire population. A ________________ sample must reflect characteristics of the entire population.
Comprehension Check A representative sample can be obtained by selecting members of the sample at _______________ The results of an experiment to be considered statistical significant only if the results could not have occurred by ______________
Comprehension Check If the results of an experiment could only have occurred by chance ______ times out of 100, then the results would be considered statistically significant. If the results could have occurred 20 times out of 100, the result would not be considered __________. Why?
Comprehension Check A scatter diagram can be used to plot and visualize a ________________ between two groups of scores. If two variables have a negative correlation, increases in one variable will correspond to _________ in the other variable.
Comprehension Check In a positive correlation, increases in X correspond to __________ in Y. It is important to remember that a correlation does not demonstrate ________________
Comprehension Check This scatter diagram shows variables that have a _________ correlation.
Correlation Check This scatter diagram shows two variables that have a _________ correlation.
Comprehension Check This scatter diagram shows variables that have close to zero association.