Chapter 2 Frequency Distributions, Stem-and-leaf displays, and Histograms.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

1 Practical Psychology 1 Week 5 Relative frequency, introduction to probability.
Random Sampling and Data Description
Section #1 October 5 th Research & Variables 2.Frequency Distributions 3.Graphs 4.Percentiles 5.Central Tendency 6.Variability.
Sections 4.1 and 4.2 Overview Random Variables. PROBABILITY DISTRIBUTIONS This chapter will deal with the construction of probability distributions by.
Stem and Leaf Display Stem and Leaf displays are an “in between” a table and a graph – They contain two columns: – The left column contains the first digit.
Chapter 3 The Normal Curve.
The standard error of the sample mean and confidence intervals
Chapter 3 The Normal Curve Where have we been? To calculate SS, the variance, and the standard deviation: find the deviations from , square and sum.
Chapter 1 The mean, the number of observations, the variance and the standard deviation.
PSY 307 – Statistics for the Behavioral Sciences
Chapter 3 The Normal Curve Where have we been? To calculate SS, the variance, and the standard deviation: find the deviations from , square and sum.
Chapter 2 Frequency Distributions, Stem-and- leaf displays, and Histograms.
Measures of Variability or Dispersion
Chapter 4 Translating to and from Z scores, the standard error of the mean and confidence intervals Welcome Back! NEXT.
Chapter 2 Frequency Distributions, Stem-and- leaf displays, and Histograms.
Chapter 3 The Normal Curve Where have we been? To calculate SS, the variance, and the standard deviation: find the deviations from , square and sum.
Chapter 1 The mean, the number of observations, the variance and the standard deviation.
Chapter 2 online slides Chapter 2 Frequency Distributions, Stem-and- leaf displays, and Histograms.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter 1 The mean, the number of observations, the variance and the standard deviation.
Chapter 1-6 Review Chapter 1 The mean, variance and minimizing error.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Frequency Distributions and Graphs
Today: Central Tendency & Dispersion
Chapter 2 Describing Data with Numerical Measurements
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Chapter 1 – Exploring Data YMS Displaying Distributions with Graphs xii-7.
COURSE: JUST 3900 TIPS FOR APLIA Developed By: Ethan Cooper (Lead Tutor) John Lohman Michael Mattocks Aubrey Urwick Chapter 2: Frequency Distributions.
APPENDIX B Data Preparation and Univariate Statistics How are computer used in data collection and analysis? How are collected data prepared for statistical.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Graphical Summary of Data Distribution Statistical View Point Histograms Skewness Kurtosis Other Descriptive Summary Measures Source:
CHAPTER 1 Basic Statistics Statistics in Engineering
Methods for Describing Sets of Data
Probability The definition – probability of an Event Applies only to the special case when 1.The sample space has a finite no.of outcomes, and 2.Each.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Chapter 2 Describing Data.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Descriptive Statistics: Presenting and Describing Data.
Thursday August 29, 2013 The Z Transformation. Today: Z-Scores First--Upper and lower real limits: Boundaries of intervals for scores that are represented.
Central Tendency & Dispersion
Chapter SixteenChapter Sixteen. Figure 16.1 Relationship of Frequency Distribution, Hypothesis Testing and Cross-Tabulation to the Previous Chapters and.
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
Compare the data sets given below. a. 20, 30, 40, 50, 60, 70 b. 20, 43, 44, 46, 47, 70 c. 40, 43, 44, 46, 47, 50 Med = 45.
1 Frequency Distributions. 2 After collecting data, the first task for a researcher is to organize and simplify the data so that it is possible to get.
Introduction to statistics I Sophia King Rm. P24 HWB
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Chapter 4: Variability. Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability.
Statistics and Data Analysis
Preparing for Algebra Chapter Plan for Problem Solving Pg. P5-P6 Obj: Learn how to use the four-step problem- solving plan.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 4-6 Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
Things you will need in class. zLecture notes from the my website on the internet. yGo to and look for the latest set of.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
1 ENGINEERING MEASUREMENTS Prof. Emin Korkut. 2 Statistical Methods in Measurements.
Chapter 4 Review December 19, 2011.
Descriptive Statistics: Presenting and Describing Data
An Introduction to Statistics
Organizing and Displaying Data
Probability Key Questions
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
AP Statistics Chapter 16 Notes.
Presentation transcript:

Chapter 2 Frequency Distributions, Stem-and-leaf displays, and Histograms

Where have we been?

To calculate SS, the variance, and the standard deviation: find the deviations from , square and sum them (SS), divide by N (  2 ) and take a square root(  ). Example: Scores on a Psychology quiz Student John Jennifer Arthur Patrick Marie X78357X78357  X = 30 N = 5  = 6.00 X -   (X-  ) = 0.00 (X -  )  (X-  ) 2 = SS =  2 = SS/N = 3.20  = = 1.79

Ways of showing how scores are distributed around the mean zFrequency Distributions, zStem-and-leaf displays z Histograms

Some definitions zFrequency Distribution - a tabular display of the way scores are distributed across all the possible values of a variable zAbsolute Frequency Distribution - displays the count of each score. zCumulative Frequency Distribution - displays the total number of scores at and below each score. zRelative Frequency Distribution - displays the proportion of each score. zRelative Cumulative Frequency Distribution - displays the proportion of scores at and below each score.

Example Data Traffic accidents by bus drivers Studied 708 bus drivers. Recorded all accidents for a period of 4 years. Data looks like: 3, 0, 6, 0, 0, 2, 1, 4, 1, … 6, 0, 2

Frequency Distributions # of accidents Absolute Freq Relative Frequency Calculate relative frequency. Divide each absolute frequency by the N. For example, 117/708 =.165 Notice rounding error

What can you answer? # of accidents Relative Freq Proportion with at most 1 accident? Proportion with 8 or more accidents? = = * 100 = 38.7% = =.014 = 1.4% Proportion with between 4 and 7 accidents? = =.212 = 21.2%

Cumulative Frequencies # of acdnts Absolute Frequency Cumulative Frequency Cumulative Relative Frequency Cumulative frequencies show number of scores at or below each point. Calculate by adding all scores below each point. Cumulative relative frequencies show the proportion of scores at or below each point. Calculate by dividing cumulative frequencies by N at each point.

Grouped Frequency Example 100 High school students’ average time in seconds to read ambiguous sentences. Values range between 2.50 seconds and 2.99 seconds.

Grouped Frequencies Needed when ynumber of values is large OR yvalues are continuous. To calculate group intervals yFirst find the range. yDetermine a “good” interval based on xon number of resulting intervals, xmeaning of data, and xcommon, regular numbers. yList intervals from largest to smallest.

Grouped Frequencies Reading Time Reading Time Frequency Frequency Range = =.49 ~.50 i =.1 #i = 5 i =.05 #i = 10

Either is acceptable. zUse whichever display seems most informative. zIn this case, the smaller intervals and 10 category table seems more informative. zSometimes it goes the other way and less detailed presentation is necessary tp prevent the reader from missing the forest for the trees.

Stem and Leaf Displays zUsed when seeing all of the values is important. zShows ydata grouped yall values yvisual summary

Stem and Leaf Display zReading time data Reading Time Leaves 5,5,6,6,6,6,8,8,9 0,0,1,2,3,3,3 5,5,5,5,5,6,6,6,7,7,7,7,7,7,7,8,9,9,9,9 0,0,1,2,3,3,3,3,4,4,4 5,5,5,5,6,6,6,8,9,9 0,0,0,1,2,3,3,3,4,4 5,6,6,6 0,1,1,1,2,3,3,4 6,6,8,8,8,8,8,9,9,9 0,1,1,1,2,2,2,4,4,4,4 i =.05 #i = 10

Stem and Leaf Display zReading time data Reading Time Leaves 0,0,1,2,3,3,3,5,5,6,6,6,6,8,8,9 0,0,1,2,3,3,3,3,4,4,4,5,5,5,5,5,6,6,6,7,7,7,7,7,7,7,8,9,9,9,9 0,0,0,1,2,3,3,3,4,4,5,5,5,5,6,6,6,8,9,9 0,1,1,1,2,3,3,4,5,6,6,6 0,1,1,1,2,2,2,4,4,4,4,6,6,8,8,8,8,8,9,9,9 i =.1 #i = 5

Transition to Histograms

Histogram of reading times Reading Time (seconds) FrequencyFrequency

Histogram concepts - 1 zUsed to display continuous data. zDiscrete data are shown on a box graph. zBut most psychology data are continuous, even if they are measured with integers.

Histogram concepts - 2 zUse bar graphs, not histograms, for discrete data. zYou rarely see data that is really discrete. zDiscrete data are categories or rankings. zIf you have continuous data, you can use histograms, but remember real class limits. zHistograms can be used for relative frequencies as well.

What are the real limits of each class? Real limits of the fifth class are ???? - ???? Real limits of the highest class are ???? - ????. FrequencyFrequency

What are the real limits of each class? Real limits of the fifth class are Real limits of the highest class are FrequencyFrequency

Predicting from Theoretical Distributions zTheoretical distributions show how scores can be expected to be distributed around the mean. (Mean = for reading data). zDistributions are named after the shapes of their histograms: yRectangular yJ-shaped yBell (Normal) ymany others

Rectangular Distribution of scores

Flipping a coin 100 flips - how many heads and tails do you expect? Heads Tails

Rolling a die 120 rolls - how many of each number do you expect?

Rolling 2 dice How many combinations are possible? Dice Total Absolute Freq Relative Frequency

Rolling 2 dice 360 rolls - how many of each number do you expect?

Normal Curve

J Curve Occurs when socially normative behaviors are measured. Most people follow the norm, but there are always a few outliers.

Principles of Theoretical Curves zExpected frequency = Theoretical relative frequency * N zExpected frequencies are your best estimates because they are closer, on the average, than any other estimate when we square the error. zLaw of Large Numbers - The more observations that we have, the closer the relative frequencies should come to the theoretical distribution.

Q & A: Continuous data zHOW IS THE FACT THAT WE ARE DISPLAYING CONTINUOUS DATA SHOWN ON A HISTOGRAM AS OPPOSED TO A BAR GRAPH? zThe bars of the graph on a histogram meet at the real limits of each interval. zIF DATA CAN ONLY BE INTEGERS (SUCH AS NUMBER OF TRUE/FALSE QUESTIONS ANSWERED CORRECTLY ON A PSYCH QUIZ), HOW COME IT IS CALLED CONTINUOUS DATA. zWhether data is continuous or discrete depends on what your measuring, not the accuracy of your measuring instrument. For example, distance is continuous whether you measure it with a yardstick or a micrometer. Knowledge, like self-confidence and other psychological variables, is probably best thought of as a continuous variable.

Determining “i” (the size of the interval) zWHAT IS THE RULE FOR DETERMINING THE SIZE OF INTERVALS TO USE IN WHICH TO GROUP DATA? zWhatever intervals seems appropriate to most informatively present the data. It is a matter of judgement. Usually we use 6 – 12 same size intervals each of which use intuitively obvious endpoints (e.g., 5s and 0s).