Presentation is loading. Please wait.

Presentation is loading. Please wait.

AP Statistics Student Curriculum Review- Fall 2016

Similar presentations


Presentation on theme: "AP Statistics Student Curriculum Review- Fall 2016"— Presentation transcript:

1 AP Statistics Student Curriculum Review- Fall 2016
Mrs. Daniel Alonzo & Tracy Mourning Sr. High

2 This PowerPoint is posted at: teachers.dadeschools.net/sdaniel

3 Agenda Z-scores & Normal Distributions FRQ: 2011 #1
Calculator Skills Review: Calculating Probabilities/Percentages using z-scores Multiple Choice Practice Calculator Skills Review: Scatterplots, Correlation & Linear Regression FRQ: 2016 #6 FRQ: 2007B #4

4 Two Normal curves, showing the mean µ and standard deviation σ.
Normal Distributions All Normal curves are symmetric, single-peaked, and bell-shaped A Specific Normal curve is described by giving its mean µ and standard deviation σ. Two Normal curves, showing the mean µ and standard deviation σ.

5 Normal Distributions We abbreviate the Normal distribution with mean µ and standard deviation σ as N(µ,σ). Any particular Normal distribution is completely specified by two numbers: its mean µ and standard deviation σ. The mean of a Normal distribution is the center of the symmetric Normal curve. The standard deviation is the distance from the center to the change-of-curvature points on either side.

6 Normal Distributions are Useful…
Normal distributions are good descriptions for some distributions of real data. Normal distributions are good approximations of the results of many kinds of chance outcomes. Many statistical inference procedures are based on Normal distributions. Normal Distributions will appear AGAIN and AGAIN. Chapter 6, Chapter 8, Chapter 9 and Chapter 10!

7 Importance of Standardizing
There are infinitely many different Normal distributions; all with unique standard deviations and means. In order to more effectively compare different Normal distributions we “standardize”. Standardizing allows us to compare apples to apples. We can compare SAT and ACT scores by standardizing.

8 The Standardized Normal Distribution
All Normal distributions are the same if we measure in units of size σ from the mean µ as center. The standardized Normal distribution is the Normal distribution with mean 0 and standard deviation 1.

9 x= variable µ= mean σ= standard deviation Z-score Formula

10 Let’s Practice… Venus Williams has a very fast first serve. Historically, Ms. Williams’ first serve averages 88 mph with a standard deviation of 12 mph. What is the standard normal score (z-score) for a first serve at 75 mph? What is the standard normal score (z-score) for a first serve at 105 mph?

11 Let’s Practice… Venus Williams has a very fast first serve. Historically, Ms. Williams’ first serve averages 88 mph with a standard deviation of 12 mph. A. What is the standard normal score for a first serve at 75 mph? z = 𝟕𝟓 −𝟖𝟖 𝟏𝟐 = B. What is the standard normal score for a first serve at 105 mph? z = 𝟏𝟎𝟓 −𝟖𝟖 𝟏𝟐 = 1.42

12 FRQ: 2011 #1

13

14 FRQ 2011 #1a No, it is not reasonable to believe that the distribution of 40-yard running times is approximately normal, because the minimum time is only 1.33 standard deviations below the mean. In a normal distribution we expect about 3 deviations below the mean. z = 4.4 − = -1.33

15

16 FRQ 2011 #1b The z-score for a player who can lift a weight of 370 pounds is z = 370− = 2.4. The z-score indicates that the amount of weight the player can lift is 2.4 standard deviations above the mean for all previous players in this position. He is very strong!

17

18 FRQ 2011 #1c Player A. Although the z-score are similair for weight, Player A has a significantly lower 40 yard dash time as evidenced by (-1.2 for A vs for B).

19 Calculating Probabilities/ Percentages using z-scores

20 By Hand/Table Using z-score, look up p-value in Standard Normal Table.
Draw and label Normal curve. Use z-score formula, plug in values and solve. Using z-score, look up p-value in Standard Normal Table. Conclude in context.

21 Calculator Draw and label Normal curve.
Plug in lower bound, upper bound, mean and standard deviation. Conclude in context.

22 By Hand vs. Calculator AP awards FULL credit for answers done “by hand/table” or using the calculator. Must show work Hand/Table: z-score formula plugged in Calculator: syntax with labels Lower/upper bounds, mean and standard deviation Calculator leads to less errors and is faster

23 TI-84 Calculator: NormalCDF
2nd, VARS (Distr) 2:normalcdf( Enter the following information: Lower: (the lower bound of the region OR 1^-99) Upper: (the upper band of the region OR 1,000,000) µ: (mean) 𝜃: (standard deviation) Press enter, number that appears is the p-value

24 TI-Nspire: NormalCDF Select Calculator (on home screen), press center button. Press menu, press enter. Select 6: Statistics, press enter. Select 5: Distributions, press enter. Select 2: Normal Cdf, press enter. Enter the following information: Lower: (the lower bound of the region OR 1^-99/-∞) Upper: (the upper band of the region OR 1,000,000/+∞) µ: (mean) 𝜃: (standard deviation) Press enter, number that appears is the p-value

25 Let’s Practice… According to Edmunds, 2015 Honda Civics have an average fuel efficiency of 25 mpg with a standard deviation of 4.5mpg. What is the probability that a randomly selected car with have a gas mileage of 30 or lower?

26 Solution

27 Let’s Practice… According to ACT, the average ACT score for college bound seniors was 20.8 with a standard deviation of 4.8. Jose knows he was in the 82nd percentile. What was his ACT score?

28 Solution

29 Normal Calculations on Calculator
Calculates Example NormalCDF Probability of obtaining a value BETWEEN two values What percent of students scored between 70 and 95 on the test? InvNorm X-value given probability or percentile Tommy scored in the 92nd percentile on the test; what was his raw score? NormalPDF (RARE) Probability of obtaining PRECISELY or EXACTLY a specific x-value What is the probability that Suzy scored exactly a 75 on the test?

30 Let’s Practice… According to ACT, the average ACT score for college bound seniors was 20.8 with a standard deviation of 4.8. A. What percentage of college bound seniors scored lower than 19 on the ACT? B. What percentage of college bound seniors scored between 27 and 32 on the ACT?

31 Let’s Practice… According to ACT, the average ACT score for college bound seniors was 20.8 with a standard deviation of 4.8. A. What percentage of college bound seniors scored lower than 19 on the ACT? Normalcdf(0, 19, 20.8, 4.8)= … B. What percentage of college bound seniors scored between 27 and 32 on the ACT? Normalcdf(27, 32, 20.8, 4.8)= …

32 Let’s Practice… According to ACT, the average ACT score for college bound seniors was 20.8 with a standard deviation of 4.8. C. What percentage of college bound seniors scored a 33 or greater on the ACT? D. If Juan scored in the 90th percentile, what was his ACT score?

33 Let’s Practice… According to ACT, the average ACT score for college bound seniors was 20.8 with a standard deviation of 4.8. C. What percentage of college bound seniors scored a 33 or greater on the ACT? Normalcdf(33, 36, 20.8, 4.8)= … D. If Juan scored in the 90th percentile, what was his ACT score? Invnorm(.90, 20.8, 4.8)= (or 27 on ACT)

34 MC #1 Scores on the ACT college entrance exam follow a bell-shaped distribution with mean 18 and standard deviation 6. Wayne’s standardized score on the ACT was −0.7. What was Wayne’s actual ACT score? (a) 4.2 (b) −4.2 (c) 13.8 (d) 17.3 (e) 22.2

35 MC #1 Scores on the ACT college entrance exam follow a bell-shaped distribution with mean 18 and standard deviation 6. Wayne’s standardized score on the ACT was −0.7. What was Wayne’s actual ACT score? (a) 4.2 (b) −4.2 (c) 13.8 (d) 17.3 (e) 22.2

36 MC #2 Which of the following is least likely to have a nearly Normal distribution? (a) Heights of all female students taking STAT 001 at State Tech. (b) IQ scores of all students taking STAT 001 at State Tech. (c) SAT Math scores of all students taking STAT 001 at State Tech. (d) Family incomes of all students taking STAT 001 at State Tech. (e) All of (a)–(d) will be approximately Normal.

37 MC #2 Which of the following is least likely to have a nearly Normal distribution? (a) Heights of all female students taking STAT 001 at State Tech. (b) IQ scores of all students taking STAT 001 at State Tech. (c) SAT Math scores of all students taking STAT 001 at State Tech. (d) Family incomes of all students taking STAT 001 at State Tech. (e) All of (a)–(d) will be approximately Normal.

38 MC #3 The scores on the real estate licensing exam given in Florida are Normally distribution with a standard deviation of 70. What is the mean test score if 25% of the applicants score above 475? 416 b. 428 c. 468 d. 522 e. Not enough information to answer question.

39 MC #3 The scores on the real estate licensing exam given in Florida are Normally distribution with a standard deviation of 70. What is the mean test score if 25% of the applicants score above 475? 416 b. 428 c. 468 d. 522 e. Not enough information to answer question.

40 MC #4 Polly takes three standardized tests. She scores 600 on all three tests. The scores are Normal distributed. Rank her performance on the three tests. I, II and III b. III, II, and I c. I, III and II d. III, I, and II e. II, I and III

41 MC #4 Polly takes three standardized tests. She scores 600 on all three tests. The scores are Normal distributed. Rank her performance on the three tests. I, II and III b. III, II, and I c. I, III and II d. III, I, and II e. II, I and III

42 MC #5 The heights of American men aged 15 to 24 are approximately normally distributed with a mean of 68 inches and a standard deviation of 2.5 inches. About 20% of these men are taller than… a. 66 inches b. 68 inches c. 70 inches d. 72 inches e. 74 inches

43 MC #5 The heights of American men aged 15 to 24 are approximately normally distributed with a mean of 68 inches and a standard deviation of 2.5 inches. About 20% of these men are taller than… a. 66 inches b. 68 inches c. 70 inches d. 72 inches e. 74 inches

44 Scatterplots & Correlation

45 Scatterplots Make a scatterplot of the relationship between body weight and pack weight. Body weight is our eXplanatory variable. Body weight (lb) 120 187 109 103 131 165 158 116 Backpack weight (lb) 26 30 24 29 35 31 28

46 Making a Scatterplot: TI-84
Using List editor enter data into list1 and list2. Press 2nd, Y = (Stat Plot), 1, enter, Select: On Select: Scatter X: list 1 Y: list2 Select: Box Press “Zoom”, 9

47 Constructing a Scatterplot: TI-Nspire
Enter x values into list 1 and enter y values into list 2. Label each column. Label column x : weight and column y: bpack. Press HOME/On, click Add Data & Statistics

48 Constructing a Scatterplot: TI-Nspire
4. Move the cursor to the bottom of the screen and “click to add variable”. Select weight. 5. Move the cursor to the left of the screen and “click to add variable”. Select bpack.

49 What is Correlation? A mathematical value that describes the strength of a linear relationship between two quantitative variables. Correlation values are between -1 and 1. Correlation is abbreviated: r The strength of the linear relationship increases as r moves away from 0 towards -1 or 1.

50 What does “r” tell us?! Correlation describes what percent of variation in y is ‘explained’ by x. Notice that the formula is the sum of the z-scores of x multiplied by the z-scores of y.

51 Scatterplots and Correlation

52 What does “r” mean? R Value Strength -1 Perfectly linear; negative
-0.75 Strong negative relationship -0.50 Moderately strong negative relationship -0.25 Weak negative relationship nonexistent 0.25 Weak positive relationship 0.50 Moderately strong positive relationship 0.75 Strong positive relationship 1 Perfectly linear; positive

53 Calculate Correlation: TI-84
Enter x values in list 1 and y values in list 2. Press Stats, arrow right to Calc Option 4: LinReg(ax + b) Enter Information: Xlist: L1, Ylist: L2 Calculate Correlation should be 0.79

54 Calculate Correlation: TI-Nspire
Enter x values in list 1 and y values in list 2. Press MENU, then 4: Statistics Option 1: Stat Calculations Option 3: Linear Regression mx + b X: a[] , Y: b[] , ENTER Correlation = r Correlation should be 0.79

55 Facts about Correlation
Correlation requires that both variables be quantitative. Correlation does not describe curved relationships between variables, no matter how strong the relationship is. Correlation is not resistant. r is strongly affected by a few outlying observations. Correlation makes no distinction between explanatory and response variables. r does not change when we change the units of measurement of x, y, or both. r does not change when we add or subtract a constant to either x, y or both. The correlation r itself has no unit of measurement.

56 R: Ignores distinctions between X & Y

57 R: Highly Effected By Outliers

58 Why?! Since r is calculated using standardized values (z-scores), the correlation value will not change if the units of measure are changed (feet to inches, etc.) Adding a constant to either x or y or both will not change the correlation because neither the standard deviation nor distance from the mean will be impacted.

59 Correlation Formula: Suppose that we have data on variables x and y for n individuals. The values for the first individual are x1 and y1, the values for the second individual are x2 and y2, and so on. The means and standard deviations of the two variables are x-bar and sx for the x-values and y-bar and sy for the y- values. The correlation r between x and y is:

60 Least Squares Regressions

61 Regression Lines A regression line summarizes the relationship between two variables, but only in settings where one of the variables helps explain or predict the other. A regression line is a line that describes how a response variable y changes as an explanatory variable x changes. We often use a regression line to predict the value of y for a given value of x.

62 Least-Squares Regression Line
Different regression lines produce different residuals. The regression line we use in AP Stats is Least-Squares Regression. The least-squares regression line of y on x is the line that makes the sum of the squared residuals as small as possible.

63 Regression Line Equation
Suppose that y is a response variable (plotted on the vertical axis) and x is an explanatory variable (plotted on the horizontal axis). A regression line relating y to x has an equation of the form: ŷ = ax + b In this equation, ŷ (read “y hat”) is the predicted value of the response variable y for a given value of the explanatory variable x. a is the slope, the amount by which y is predicted to change when x increases by one unit. b is the y intercept, the predicted value of y when x = 0.

64 Regression Line Equation

65 Format of Regression Lines
𝑦 = x 𝑦 = predicted back pack weight x= student’s weight Format 2: Predicted back pack weight= (student’s weight)

66 TI-NSpire: LSRL Enter x data into list 1 and y data into list 2.
Press MENU, 4: Statistics, 1: Stat Calculations Select Option4: Linear Regression. Insert either name of list or a[] for x and name of list or b[] of y. Press ENTER.

67

68 TI-84: LSRL Enter x values in list 1 and y values in list 2.
Press Stats, arrow right to Calc Option 4: LinReg(ax + b) Enter Information: Xlist: L1, Ylist: L2, StoreRegEQ:Y1 (VARS, arrow right to Y-VARS, enter, enter, 1. Y1) Calculate To view: “Zoom”, 10

69 TI-NSPIRE: LSRL to View Graph
Enter x data into list 1 and y data into list 2. Be sure to name lists Press HOME/ON, Add Data & Statistics Enter variables to x and y axis. Click MENU, 4: Analyze Option 6: Regression Option 2: Show Linear (a + bx), ENTER

70

71 Interpreting Computer Regression Output
Be sure you can locate: the slope, the y intercept and determine the equation of the LSRL. 𝒚 = x 𝒚 = predicted.... x = explanatory variable

72 r2: Coefficient of Determination
r 2 tells us how much better the LSRL does at predicting values of y than simply guessing the mean y for each value in the dataset. In this example, r2 equals 60.6%. 60.6% of the variation in pack weight is explained by the linear relationship with bodyweight. (Insert r2)% of the variation in y is explained by the linear relationship with x.

73 Self Check Quiz! The data is a random sample of 10 trains comparing number of cars on the train and fuel consumption in pounds of coal. What is the regression equation? Be sure to define all variables. What is r2 telling you? Define and interpret the slope in context. Does it have a practical interpretation? Define and interpret the y-intercept in context. What is s telling you?

74 1. ŷ = x ŷ = predicted fuel consumption in pounds of coal x = number of rail cars % of the varation is fuel consumption is explained by the linear realtionship with the number of rail cars. 3. Slope = With each additional car, the fuel consuption increased by pounds of coal, on average. This makes practical sense. 4. Y-interpect = When there are no cars attached to the train the fuel consuption is pounds of coal. This has no practical intrepretation beacuse there is always at least one car, the engine. 5. S= On average, the model under predicts fuel consumption by pounds of coal using the least-squares regression line.

75 FRQ: 2016 #6

76

77 There is a positive linear association between number of semesters studied and starting salary in euros, which supports the newspaper’s claim.

78

79 The slope is , which means for each additional semester studied the starting salary increased by 1,159.4 euros, on average.

80

81

82 There is a strong, negative linear relationship between the number of semesters studied and starting salary when looking at each major individually.

83

84 Business majors have the lowest median starting salary, physics majors are in the middle and chemistry majors have the highest median starting salary.

85

86

87 Within each major there is a negative association between the number of semesters studied and median salary. The students that take longer within each major earn less, on average. However, the programs of study, like chemistry, that require more semesters of study result in higher median earnings.

88

89

90

91 MC #1 If women always married men who were 2 years older than themselves, what would the correlation between the ages of husband and wife be? (a) 2 (b) 1 (c) 0.5 (d) 0 (e) Can’t tell without seeing the data

92 MC #1 If women always married men who were 2 years older than themselves, what would the correlation between the ages of husband and wife be? (a) 2 (b) 1 (c) 0.5 (d) 0 (e) Can’t tell without seeing the data

93 MC #2 Smokers don’t live as long (on average) as nonsmokers, and heavy smokers don’t live as long as light smokers. You perform least-squares regression on the age at death of a group of male smokers y and the number of packs per day they smoked x. The slope of your regression line (a) will be greater than 0. (b) will be less than 0. (c) will be equal to 0. (d) You can’t perform regression on these data. (e) You can’t tell without seeing the data.

94 MC #2 Smokers don’t live as long (on average) as nonsmokers, and heavy smokers don’t live as long as light smokers. You perform least-squares regression on the age at death of a group of male smokers y and the number of packs per day they smoked x. The slope of your regression line (a) will be greater than 0. (b) will be less than 0. (c) will be equal to 0. (d) You can’t perform regression on these data. (e) You can’t tell without seeing the data.

95 MC #3 Measurements on young children in Mumbai, India, found this least-squares line for predicting height (y) from arm span (x): 𝒚 = x. Measurements are in centimeters (cm). How much does height increase on average for each additional centimeter of arm span? (a) 0.93 cm (b) 1.08 cm (c) 5.81 cm (d) 6.4 cm (e) 7.33 cm

96 MC #3 Measurements on young children in Mumbai, India, found this least-squares line for predicting height (y) from arm span (x): 𝒚 = x. Measurements are in centimeters (cm). How much does height increase on average for each additional centimeter of arm span? (a) 0.93 cm (b) 1.08 cm (c) 5.81 cm (d) 6.4 cm (e) 7.33 cm

97 MC #4 Measurements on young children in Mumbai, India, found this least-squares line for predicting height (y) from arm span (x): 𝒚 = x. Measurements are in centimeters (cm). According to the regression line, the predicted height of a child with an arm span of 100 cm is about (a) cm. (b) 99.4 cm. (c) 93 cm. (d) 15.7 cm. (e) 7.33 cm.

98 MC #4 Measurements on young children in Mumbai, India, found this least-squares line for predicting height (y) from arm span (x): 𝒚 = x. Measurements are in centimeters (cm). According to the regression line, the predicted height of a child with an arm span of 100 cm is about (a) cm. (b) 99.4 cm. (c) 93 cm. (d) 15.7 cm. (e) 7.33 cm.

99 MC #5 Measurements on young children in Mumbai, India, found this least-squares line for predicting height (y) from arm span (x): 𝒚 = x. Measurements are in centimeters (cm). One child in the Mumbai study had height 59 cm and arm span 60 cm. This child’s residual is (a) −3.2 cm. (b) −2.2 cm. (c) −1.3 cm. (d) 3.2 cm. (e) 62.2 cm.

100 MC #5 Measurements on young children in Mumbai, India, found this least-squares line for predicting height (y) from arm span (x): 𝒚 = x. Measurements are in centimeters (cm). One child in the Mumbai study had height 59 cm and arm span 60 cm. This child’s residual is (a) −3.2 cm. (b) −2.2 cm. (c) −1.3 cm. (d) 3.2 cm. (e) 62.2 cm.

101 FRQ: 2007B #4

102 2007B #4a & b

103 2007B #4c The slope would stay the same, since the new point fits the existing pattern. The correlation coefficient would increase because the additional points fits the existing pattern and thus makes the relationship even stronger. A strong relationship results in a great correlation coefficient.


Download ppt "AP Statistics Student Curriculum Review- Fall 2016"

Similar presentations


Ads by Google