Download presentation
Presentation is loading. Please wait.
Published byCornelius Atkins Modified over 9 years ago
1
Chapter 3 concepts/objectives Define and describe density curves Measure position using percentiles Measure position using z-scores Describe Normal distributions Describe and apply the 68-95-99.7 Rule Describe the standard Normal distribution Perform Normal calculations 1
2
2 The 68-95-99.7 Rule In the Normal distribution with mean µ and standard deviation σ: Approximately 68% of the observations fall within σ of µ. Approximately 95% of the observations fall within 2σ of µ. Approximately 99.7% of the observations fall within 3σ of µ. The 68-95-99.7 Rule In the Normal distribution with mean µ and standard deviation σ: Approximately 68% of the observations fall within σ of µ. Approximately 95% of the observations fall within 2σ of µ. Approximately 99.7% of the observations fall within 3σ of µ. σ
3
3 The Standard Normal Distribution All Normal distributions are the same if we measure in units of size σ from the mean µ as center. The standard Normal distribution is the Normal distribution with mean 0 and standard deviation 1. If a variable x has any Normal distribution N(µ,σ) with mean µ and standard deviation σ, then the standardized variable has the standard Normal distribution, N(0,1). The standard Normal distribution is the Normal distribution with mean 0 and standard deviation 1. If a variable x has any Normal distribution N(µ,σ) with mean µ and standard deviation σ, then the standardized variable has the standard Normal distribution, N(0,1).
4
4 Normal Calculations Find the proportion of observations from the standard Normal distribution that are between -1.25 and 0.81. Can you find the same proportion using a different approach? 1 – (0.1056+0.2090) = 1 – 0.3146 = 0.6854
5
5 State: Express the problem in terms of the observed variable x. Plan: Draw a picture of the distribution and shade the area of interest under the curve. Solve: Perform calculations. Standardize x to restate the problem in terms of a standard Normal variable z. Use Table A and the fact that the total area under the curve is 1 to find the required area under the standard Normal curve. Conclude: State the conclusion in the context. State: Express the problem in terms of the observed variable x. Plan: Draw a picture of the distribution and shade the area of interest under the curve. Solve: Perform calculations. Standardize x to restate the problem in terms of a standard Normal variable z. Use Table A and the fact that the total area under the curve is 1 to find the required area under the standard Normal curve. Conclude: State the conclusion in the context. How to Solve Problems Involving Normal Distributions Normal Calculations
6
Chapter 4 concepts/objectives Explanatory and Response Variables Displaying Relationships: Scatterplots Interpreting Scatterplots Measuring Linear Association: Correlation Facts About Correlation 6
7
The Standard Normal Distribution All Normal distributions are the same if we measure in units of size σ from the mean µ as center. The standard Normal distribution is the Normal distribution with mean 0 and standard deviation 1. If a variable x has any Normal distribution N(µ,σ) with mean µ and standard deviation σ, then the standardized variable. has the standard Normal distribution, N(0,1).
8
Measuring Linear Association A scatterplot displays the strength, direction, and form of the relationship between two quantitative variables The correlation r measures the strength of the linear relationship between two quantitative variables. r is always a number between -1 and 1. r > 0 indicates a positive association. r < 0 indicates a negative association. Values of r near 0 indicate a very weak linear relationship. The strength of the linear relationship increases as r moves away from 0 toward -1 or 1. The extreme values r = -1 and r = 1 occur only in the case of a perfect linear relationship.
9
Facts About Correlation 1.Correlation makes no distinction between explanatory and response variables. 2. r has no units and does not change when we change the units of measurement of x, y, or both. 3. Positive r indicates positive association between the variables, and negative r indicates negative association. 4. The correlation r is always a number between -1 and 1. Cautions: Correlation requires that both variables be quantitative. Correlation does not describe curved relationships between variables, no matter how strong the relationship is. Correlation is not resistant. r is strongly affected by a few outlying observations. Correlation is not a complete summary of two-variable data.
10
10 Regression equation: y = a + bx ^ x is the value of the explanatory variable. “y-hat” is the predicted value of the response variable for a given value of x ( based on the line of best fit ). b is the slope, the amount by which y changes for each one-unit increase in x. a is the intercept, the value of y when x = 0. Chapter 5 -- Regression Line
11
11 Least Squares Regression Line To predict y, we want the regression line to be as close as possible to the data points in the y (vertical) direction. Least Squares Regression Line (LSRL): The line that minimizes the sum of the squares of the vertical distances of the data points from the line. For LSRL, the constants a (intercept) and b (slope) are calculated and inserted in the regression line. Regression equation: y = a + bx Calculate b from: Calculate a from: where s x and s y are the standard deviations of the two variables x and y, and r is their correlation. Least Squares Regression Line (LSRL): The line that minimizes the sum of the squares of the vertical distances of the data points from the line. For LSRL, the constants a (intercept) and b (slope) are calculated and inserted in the regression line. Regression equation: y = a + bx Calculate b from: Calculate a from: where s x and s y are the standard deviations of the two variables x and y, and r is their correlation. ^
12
12 An outlier is an observation that lies far away from the other observations. –Outliers in the y direction have large residuals. –Outliers in the x direction are often influential for the least-squares regression line, meaning that the removal of such points would markedly change the equation of the line. Outliers and Influential Points
13
13 Chapter 6 --Two-Way Table, Example What are the variables described by this two-way table? (Hint: Number of columns?) How many young adults were surveyed? (Hint: It is one of the totals in bottom row.)
14
14 Chap 6, Marginal Distribution The Marginal Distribution of one of the categorical variables in a two-way table of counts is the distribution of values of that variable among all individuals described by the table. Note: Percents are often more informative than counts, especially when comparing groups of different sizes. To examine a marginal distribution: 1.Use the data in the table to calculate the marginal distribution (in percents) of the row or column totals. 2. Make a graph to display the marginal distribution.
15
15 Chap. 6 -- Conditional Distribution Marginal distributions tell us nothing about the relationship between two variables. A Conditional Distribution of a variable describes the values of that variable among individuals who have a specific value of another variable. To examine or compare conditional distributions: 1.Select the row(s) or column(s) of interest. 2.Use the data in the table to calculate the conditional distribution (in percents) of the row(s) or column(s). 3.Make a graph to display the conditional distribution. Use a side-by-side bar graph or segmented bar graph to compare distributions. 15
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.