Statistical Analysis – Chapter 4 Normal Distribution

Slides:



Advertisements
Similar presentations
Chapter 5 Some Key Ingredients for Inferential Statistics: The Normal Curve, Probability, and Population Versus Sample.
Advertisements

Estimation of Means and Proportions
Chapter 6 – Normal Probability Distributions
The Normal Distribution
The Normal Distribution
Sampling Distributions
CHAPTER 13: Binomial Distributions
1 Normal Probability Distributions. 2 Review relative frequency histogram 1/10 2/10 4/10 2/10 1/10 Values of a variable, say test scores In.
Chapter 6: Standard Scores and the Normal Curve
Topic 3 The Normal Distribution. From Histogram to Density Curve 2 We used histogram in Topic 2 to describe the overall pattern (shape, center, and spread)
Chapter 6 Introduction to Continuous Probability Distributions
Sampling Distributions for Proportions Allow us to work with the proportion of successes rather than the actual number of successes in binomial experiments.
The Normal Distribution. Distribution – any collection of scores, from either a sample or population Can be displayed in any form, but is usually represented.
Normal Distributions What is a Normal Distribution? Why are Many Variables Normally Distributed? Why are Many Variables Normally Distributed? How Are Normal.
Normal Distribution Z-scores put to use!
Chapter 11: Random Sampling and Sampling Distributions
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Chapter 6: Normal Probability Distributions
Copyright © Cengage Learning. All rights reserved. 6 Normal Probability Distributions.
Basic Statistics Standard Scores and the Normal Distribution.
Chapter 5 Sampling Distributions
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Chapter 6 The Normal Probability Distribution
Chapter 6: Probability Distributions
The Normal Distribution The “Bell Curve” The “Normal Curve”
Section 7.1 The STANDARD NORMAL CURVE
Probability & the Normal Distribution
Chapter 7: Sampling Distributions
AP Statistics Chapter 9 Notes.
Sampling Distributions of Proportions. Parameter A number that describes the population Symbols we will use for parameters include  - mean  – standard.
Statistics Sampling Distributions
Chapter 6 Probability. Introduction We usually start a study asking questions about the population. But we conduct the research using a sample. The role.
Problem: Assume that among diabetics the fasting blood level of glucose is approximately normally distributed with a mean of 105mg per 100ml and an SD.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Essential Statistics Chapter 31 The Normal Distributions.
Modular 11 Ch 7.1 to 7.2 Part I. Ch 7.1 Uniform and Normal Distribution Recall: Discrete random variable probability distribution For a continued random.
+ Recitation 3. + The Normal Distribution + Probability Distributions A probability distribution is a table or an equation that links each outcome of.
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 6 Probability Distributions Section 6.2 Probabilities for Bell-Shaped Distributions.
Chapter 3b (Normal Curves) When is a data point ( raw score) considered unusual?
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Thursday August 29, 2013 The Z Transformation. Today: Z-Scores First--Upper and lower real limits: Boundaries of intervals for scores that are represented.
Introduction to Statistics Chapter 6 Feb 11-16, 2010 Classes #8-9
The Normal Distribution
Estimation of a Population Mean
Lecture 6 Normal Distribution By Aziza Munir. Summary of last lecture Uniform discrete distribution Binomial Distribution Mean and Variance of binomial.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Problem: Assume that among diabetics the fasting blood level of glucose is approximately normally distributed with a mean of 105mg per 100ml and an SD.
MM207 Statistics Welcome to the Unit 6 Seminar Wednesday, March 7, to 9 PM ET.
Chapter 5 Normal Probability Distributions. Chapter 5 Normal Probability Distributions Section 5-1 – Introduction to Normal Distributions and the Standard.
THE NORMAL DISTRIBUTION AND Z- SCORES Areas Under the Curve.
Continuous Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements Can be described.
The Normal distribution and z-scores
Binomial Distributions Chapter 5.3 – Probability Distributions and Predictions Mathematics of Data Management (Nelson) MDM 4U.
Chapter 7: The Distribution of Sample Means. Frequency of Scores Scores Frequency.
Probability, Sampling, and Inference Q560: Experimental Methods in Cognitive Science Lecture 5.
Statistical Analysis – Chapter 5 “Central Limit Theorem” Dr. Roderick Graham Fashion Institute of Technology.
The Normal Distribution. Normal and Skewed Distributions.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
THE NORMAL DISTRIBUTION
Copyright ©2011 Brooks/Cole, Cengage Learning Continuous Random Variables Class 36 1.
Chapter 7 Continuous Probability Distributions and the Normal Distribution.
Sampling Distributions
Chapter 12 Statistics 2012 Pearson Education, Inc.
Chapter 5 Sampling Distributions
Sampling Distributions
Normal Distributions and the Empirical Rule
Chapter 12 Statistics.
Presentation transcript:

Statistical Analysis – Chapter 4 Normal Distribution

What is the normal curve? In chapter 2 we talked about histograms and modes A normal distribution is when a set of values for one variable, when displayed in a histogram (or line graph) has one peak (mode) and looks like a bell. Here is an example using height:

Characteristics of the Normal Curve Bell shaped, fading at the tails. In other words, more values are in the middle, and odd or unusual values fall at the tails All (100%) of the data fits on the curve, with 50% before the mean and 50% after 68% of the data falls within -1 and +1 standard deviations of the mean 95% of the data falls between -2 and +2 standard deviations The percentage of data between any two points is equal to the probability of randomly selecting a value between the two points (remember classical probability from Ch. 3)

Standard Deviations and Z-Score Z – scores = the number of standard deviations away from the mean. z-score = x - µ σ (x = data for which we want to know the z-score) We use the characteristics of the normal curve, and the z- score, to find out the probability of a particular event or value occurring (remember classical probability from Chapter 3)

Solving Normal Curve Problems Using Z-Scores (steps listed at bottom of p. 111) Draw a normal curve, showing values for (-2 through +2) Shade the area in question Calculate the z scores and cutoffs (percentages asked for) Use the z-scores and cutoffs to solve the normal curve problem

Find Percentages on the Normal Curve Table Let’s do these questions as a class… What is the percentage of data from z = 0 to z = 0.1? What is the percentage of data from z = 0 to z = 2.16? What is the percentage of data from z = -1.11 to z = 1.11? What is the percentage of data above z = 1.24? What is the percentage of data below z = -0.6? Answers .0398…39.8% .4846…48.46% .3665 + .3665 = .733…73.3% .50 - .3925 = .1075…10.75% .50 - .2257 = .2743…27.43%

Working backwards from percentages… When working backwards from percentages, we still use the normal table…but look for the percentage to give us the z-score… What is the z-score associated 10.2% of the data? What is the z-score(s) for the middle 30% of the normal curve? What is the z-score of data in the upper 25% of the normal curve? Answers z = 0.26 z = -.39 to z = .39 z = 0.67

Let’s do Question 4.2 Use the normal curve table to determine the percentage of data in the normal curve Between z = 0 and z = .82 Above z = 1.15 Between z = -1.09 and z = .47 Between z = 1.53 and z = 2.78 Work backward in the normal curve table to solve the following: 32% of the data in the normal curve data can be found between z = 0 and z = ? Find the z score associated with the lower 5% of the data. Find the z scores associated with the middle 98% of the data.

Question 4.2 Answers Answers to Question 4.2 29.39% 12.51% 54.29% 6.03% Between z = 0 and z = .92, or between z = 0 and z = -.92

Question 4.7 Use the normal curve table to determine the percentage of data in the normal curve Between z = 0 and z = .38 Above z = -1.45 Above z = 1.45 Between z = .77 and z = 1.92 Between z = -.25 and z = 2.27 Between z = -1.63 and z = -2.89 Work backward in the normal curve table to solve the following. 15% of the data in the normal curve can be found between z = 0 and z = ? Find the z score associated with the upper 73.57% of the data. Find the z scores associated with the middle 95%

Question 4.7 Answers 14.80% 92.65% 7.35% 19.32% 58.71% 4.97% z = .39 or -.39 z = -.63 Between z = -1.96 and z = +1.96

Binomial Distributions and Sampling Binomial means two categories in a population… Males and females Sports game players vs. Non sports game players Incomes over 40,000 vs. incomes under 40,000 Quick note: Remember…for binomial distributions, we would visualize this data through a pie chart…because we do not have enough categories for a histogram…

Sampling from a Two-Category Population With two-category populations, we can describe the population by p – the percentage of values in one category This is the same p from the last chapter on probability (classical probability)… P(event) ≈ s (number of chances for success) n (total equally likely possibilities) We know (actually….statisticians know) that if we randomly sampled from a population, then ps ≈ p

Sampling Distribution In order to know the odds of getting certain values from this particular binomial sample, we have to know the sampling distribution from this population. Under certain conditions, the sampling distribution for a binomial value is normal (i.e. the distribution follows the normal curve). When the sampling distribution is normal, then we can make predictions using our table and our z-scores

Sampling from a Binomial Distribution Suppose, we defined a population (full time FIT students who either shop at Hot Topic), and we have made our measure of interest into a binomial distribution – those who shop at Hot Topic and those who do not. Suppose over the last 10 years, marketers have surveyed the FIT population hundreds of times and found that Hot Topic shoppers are p = .13. (those who are non-Hot Topic shoppers is p = .87)

Sampling from a Binomial Distribution But suppose sometime later, your manager asks you to lead another study. But this time, you don’t have enough money to survey the whole population, and you have to get a sample. We can assume, because so many studies have been done in the past that the true value of Hot Topic shoppers is p = .13. Thus, because we know that ps ≈ p, your sample should have approximately the same value.

Sampling from a Binomial Distribution For each sample, we can use the number sampled, and the p value from the population to predict the total number of Hot Topic shoppers. This is called the expected value. Expected value = np Thus, if we collected a sample of 200 FIT students, how many students would we expect to be Hot Topic shoppers? np = (200)(.13) = 26 This expected value is the mean of your sample

Binomial Distribution and the Normal Curve Now, we need to decide if we can use the normal curve to solve problems… If (np) > 5 and n(1 – p)>5…then the sampling distribution will be normally distributed. So, our sample was 200 students. Is (np) > 5? Is n(1 – p)>5? Yes…and yes. np = (200)(.13) = 26 n(1 – p) = (200)(1 - .13) = (200)(.87) = 174

Binomial Distribution and the Normal Curve What do we mean that a sampling distribution is normal? Just like someone’s age is one value among many ages that we tally to make a histogram, we can tally many samples, get the p values of those sample, and construct histograms from these means. If we took say, 1000 samples, and tallied the p values for Hot Topic shoppers, then those values, when turned into a histogram, should form a normal curve. Just like if we took the heights of a 1000 women, and tallied those values to get a normal curve.

How to use the Binomial Distribution and the Normal Curve Get the mean (µ)…the mean is the expected value (np) Get the standard deviation (σ) = √np(1 – p) Draw a normal curve using mean and standard dev Use the “continuity correction factor,” and add +/- half a unit to the value we want to solve for Get the z-scores = x - µ σ Use the normal curve table to solve the problem

Why the “continuity correction factor”? This is only for discrete values (where values occupy only distinct points.) For example, in our study, there is no such thing as a “half” or “3/4” Hot Topic shopper. Either you are a shopper or not. Looking at how histograms are presented, you can see why we have to use the correction factor. Probability of getting a value equal to or greater than (=>), then you must subtract a half-unit Probability of getting a value equal to or lesser than (=<), you must add a half unit. Probability of getting the exact value, you must get the Z-scores for a half-unit above and a half-unit below

Now let’s answer a Hot Topic Question… If you collected a sample of 200 FIT students… What is the probability that 13 will be Hot Topic shoppers? What is the probability that you will have 30 or more Hot Topic shoppers? What is the probability that you will have 25 or less Hot Topic shoppers?

Question What is the probability that 13 will be Hot Topic shoppers? What is the probability that you will have 30 or more Hot Topic shoppers? What is the probability that you will have 25 or less Hot Topic shoppers? Answer Get the mean (µ) = expected value = np = (200)(.13) = 26 Get the standard deviation (σ) = √np(1 – p) = √26(1 - .13) = √26(.87) = √22.62 ≈ 4.76 Draw a normal curve using mean and standard dev. Use the continuity correction factor to correct x. (a) 12.5 and 13.5, (b) 29.5, (c) 25.5 Get the z-scores. (a) -2.83 and -2.62, (b) .735, (c)-.105 Solve the problem… (a) 4977 - .4956 = .002, or 2% (b) .50 - .2704 ≈ .23, or 23%, (c) .50 - .0596 = .4404

Now let’s do question 4.16 as a class… In a marketing population of phone calls, 3% produced a sale. If this population proportion (p = 3%) can be applied to future phone calls, then out of 500 randomly monitored phone calls, How many would you expect to produce a sale? What is the probability of getting 11 to 14 sales? What is the probability of getting 12 or less sales? 15 32.93% 25.46%

Question 4.16 answers Expected value = np = 500(.03) = 15 32.93% 25.46%