Download presentation
Presentation is loading. Please wait.
1
Module 8: Two-Way Frequency Tables
Essential Questions: How can categorical data for two categories be summarized? How can you recognize possible association and trends between two categories of categorical data?
2
Data Collected Categorical Variable - Usually an adjective
Rarely a number Examples: Gender Race Grade in School (Freshmen, Soph, Jr., Sr.) Zip Code Quantitative Variable Always a number Must be able to find the mean of the numbers Examples: Weight Height Amount of money in wallet Age
3
Categorical or Quantitative?
1. Survey about whether student buy lunch from the cafeteria or bring lunch from home, doesn’t eat lunch, etc. 2. Experiment where we measure how tall a plant grows. 3. Observation where we count how many people are in each car leaving school. 4. Survey about each student’s shoe size.
4
Two-Way Frequency Table
5
Complete the Tables…
6
Complete the Tables…
7
Vocab: Relative Frequency
Joint Relative Frequency: tells what proportion/percentage of the total share TWO characteristics. Example: percentage of freshmen boys. Marginal Relative Frequency: tells what proportion/percentage of the total share ONE characteristic Example: percentage of boys.
8
1. What percent of students are boys that prefer soccer. 2
1. What percent of students are boys that prefer soccer? 2. What percent of students prefer football? 3. What is the joint relative frequency of girl and basketball? 4. What is the marginal relative frequency of soccer?
9
What percentage of students are girls that like oranges?
What percentage of students prefer bananas? What is the joint relative frequency of boy and apple? What is the marginal relative frequency of orange?
10
Vocab: Conditional Frequency
Conditional Frequency: describes what proportion of the population share a 2nd characteristic given that they already have one in common. Example: What proportion of students with As are girls? What proportion of girls have blue eyes? The shared/given characteristic is always the denominator.
11
1. What percent of boys prefer soccer. 2
1. What percent of boys prefer soccer? 2. What percent of students that prefer football are girls? 3. What proportion of girls prefer basketball? 4. Given that the student is a boy what is the probability he prefers football?
12
What percentage of girls like oranges?
What percentage of students that prefer bananas are boys? What proportion of boys prefer oranges? Given that the student likes apples, what is the probability that he is a boy?
15
Vocab: Associations in Data
Association: pattern/trend apparent in two-way tables. Similar in concept to correlation. Is there a association between liking aerobic exercise and liking weight lifting?
16
Measures of Center & Spread Graphical Representations
Module 9 Measures of Center & Spread Graphical Representations Outliers & Shape Normal Distributions
17
Measures of Center and Spread
Essential Question: How can you describe & compare data sets?
18
Mean Definition: Average value in data set. How to: Add up all of the values and divide by the number of numbers in the data set.
19
Median Definition: Middle value when the data set is arranged in order from smallest to largest How to: Put values in order. Determine middle value. If two middle values, find average of those two numbers.
21
Range Definition: the difference between the largest and smallest numbers in the data set.
22
Quartiles Definition: Quartiles are divisions representing 25% of the data. Q1 is the 1st quartile or the median of the lower half. Q3 is the 3rd quartile or the median of the upper half. How tow: Put number sin order Find median (middle number). Find median of the lower half. Find median of upper half.
23
Let’s Practice… Calculate the 1st, 2nd (median) and 3rd quartiles for the following data sets: 1. 15, 17, 16, 15, 18, 19, 15, 20, , 8, 9, 7, 6, 9, 8 ,7, 10, 11, 4
24
Interquartile Range (IQR)
Definition: Interquartile Range is the distance between Q1 and Q3. Subtract Q3 – Q1.
25
1. Find the median, IQR and range. 21, 31, 26, 24, 28, 26
26
2. Find the median, IQR and range.
27
Mrs. Daniel AP Stats- Standard Deviation Review
Definition: Standard deviation is used to tell how measurements for a group are spread out from the mean. Low standard deviation = data points tend to be very close to the mean. High standard deviation value = data points are spread out
28
Standard Deviation
29
Rank the Standard Deviations:
30
Rank the Standard Deviations:
31
Rank the Standard Deviations:
32
Center vs. Spread Center Spread
33
Graphical Displays of Data
Essential Question: How can we display data graphically? How can we determine the mean, median and IQR from a graphical display?
34
Displaying Quantitative Data
Histogram Boxplot Dotplot
35
Number of Goals Scored Per Game by the 2004 US Women’s Soccer Team
Dotplots Each data value is shown as a dot above its location on a number line. Number of Goals Scored Per Game by the 2004 US Women’s Soccer Team 3 2 7 8 4 5 1 6
36
How to Make a Dotplot Draw a horizontal axis (a number line) and label it with the variable name. Scale the axis from the minimum to the maximum value. Mark a dot above the location on the horizontal axis corresponding to each data value.
37
Let’s Practice… Create a dot plot. 12 employees at a small company make the following annual salaries (in thousands of dollars): 25, 30, 35, 35, 35, 40, 40, 40, 45, 45, 50 and 60.
38
Histograms Looks like a bar graph, but the bars must touch!
X-axis labeled with number ranges
39
How to Make a Histogram Divide the range of data into classes of equal sizes. Find the count (frequency) of individuals in each class. Label and scale your axes and draw the histogram. The height of the bar equals its frequency. Adjacent bars should touch, unless a class contains no individuals.
41
The histogram shows the length, in days, of Maria’s last vacations
The histogram shows the length, in days, of Maria’s last vacations. Estimate the mean of the data set displayed in the histogram.
42
Box and Whisker Plots A box plot is a graphical display of the minimum, first quartile, median, third quartile, and maximum. The term "box plot" comes from the fact that the graph looks like a rectangle with lines extending from the top and bottom.
43
How to Make a Box & Whisker Plot
Find the median. Find Q1 and Q3. Plot the following points: minimum, Q1, median, Q3 and maximum. Make a whisker from minimum to Q1. Make a box from Q1 to Q3. Draw a line for the median. Make a whisker from Q3 to maximum.
44
Create a Box and Whisker Plot.
Let’s Practice: Create a Box and Whisker Plot. Quiz Scores: 25 30 26 29 22 23 24 28
45
Create a box plot:
46
Let’s Practice… Identify the median, Q1 and Q3, and the IQR
47
Let’s Practice… Identify the median, Q1 and Q3, and the IQR. If you had to pick one career, which one would you pick and why? (Must be a statistical reason!) In tens of thousands of dollars.
51
Outliers & Shape Essential Question: What statistics are most affected by outliers? What shapes can data distributions have?
52
Outlier Outlier: a value that substantially varies from the other values. It is either much larger or much smaller.
53
Outliers Affect… Measure Yes/No Why? Mean Median IQR
Standard Deviation Range
55
Let’s Practice.. Mr. Morris gave his algebra class a test, the results of which are listed below. 68, 92, 74, 75, 86, 90, 92, 81, 60, 82, 77, 80 Shania was absent on the day of the test and had to take the test late. She earned a score of 99. Which measure of the class's test results did Shania's score most change? A. IQR B. Mean C. Median D. Range
56
Calculate the mean, median & IQR…
57
Calculate the mean, median & IQR…
Median: IQR: Compare:
60
Different Shapes of Distributions
Distributions can be described as: Roughly symmetric Skewed right Skewed left
61
Mrs. Daniel AP Stats Shape Definitions: Symmetric: if the right and left sides of the graph are approximately mirror images of each other. Skewed to the right (right-skewed) if the right side of the graph is much longer than the left side. Skewed to the left (left-skewed) if the left side of the graph is much longer than the right side. Symmetric Skewed-left Skewed-right
63
Skew in Box Plots
65
Describe the Shape…
66
Describe the Shape…
67
Other Ways to Describe Shape:
Unimodal Bimodal Multimodal
68
Measures of Center Type of Distribution Best Measure of Center
Measures of Center = Mean and Median Type of Distribution Best Measure of Center Symmetric Skewed Right Skewed Left
69
Why?!?!
70
Which Measure of Center?
72
Distribution Picture Describe Best Measure of Spread? Symmetric Right Skew Left Skew
75
Normal Distributions Essential Question: How can you use characteristics of a normal population to make estimates and probability predictions about the population that the data represent?
76
Vocab: Normal Distribution
Normal distribution: special type of data distribution that allows us to make predictions and estimate probabilities. Normal distributions are symmetric about the mean. Common: Heights, weights, tests scores, etc.
77
Normal Curve Properties
78
Normal Curve Properties
_______ % of the data falls within 1 standard deviation of the mean. _______ % of the data falls within 2 standard deviations of the mean. _______ % of the data falls within 3 standard deviations of the mean.
80
How to Solve Normal Distribution Questions:
Draw a curve. Label mean and standard deviation. Add and subtract standard deviation three times to label each division. Compute.
81
Normal Distribution Curves
Pennies manufactured in the United States after have a Normally distributed mass of 2.50 g with a standard derivation of 0.02g.
82
Normal Distribution Curves
Pennies manufactured in the United States after have a Normally distributed mass of 2.50 g with a standard derivation of 0.02g. What proportion of pennies have a mass greater than 2.50g? What proportion of pennies have a mass between 2.48g and 2.52g? What proportion of pennies have a mass less than 2.48? What proportion of pennies have a mass between and 2.50g?
83
Normal Distribution Curves
15 year-old girls have heights that are Normally distributed with a mean of 64 inches and a standard deviation of 1.5 inches.
84
Normal Distribution Curves
15 year-old girls have heights that are Normally distributed with a mean of 64 inches and a standard deviation of 1.5 inches. What proportion of girls are taller than inches? What proportion of girls are shorter than 64 inches? What proportion of girls have a height between 61 and 65.5 inches?
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.