NORMAL DISTRIBUTION AND ITS APPL ICATION. INTRODUCTION Statistically, a population is the set of all possible values of a variable. Random selection of.

Slides:



Advertisements
Similar presentations
Normal Distribution Sampling and Probability. Properties of a Normal Distribution Mean = median = mode There are the same number of scores below and.
Advertisements

A.k.a. “bell curve”.  If a characteristic is normally distributed in a population, the distribution of scores measuring that characteristic will form.
Confidence Intervals Chapter 7. Rate your confidence Guess my mom’s age within 10 years? –within 5 years? –within 1 year? Shooting a basketball.
Confidence Intervals Chapter 10. Rate your confidence Name my age within 10 years? 0 within 5 years? 0 within 1 year? 0 Shooting a basketball.
Introduction to Summary Statistics
DENSITY CURVES and NORMAL DISTRIBUTIONS. The histogram displays the Grade equivalent vocabulary scores for 7 th graders on the Iowa Test of Basic Skills.
4.2.2 Inductive Statistics 1 UPA Package 4, Module 2 INDUCTIVE STATISTICS.
Biostatistics Unit 4 Probability.
Biostatistics Unit 4 - Probability.
2-5 : Normal Distribution
The Normal Distribution
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
1 Sociology 601, Class 4: September 10, 2009 Chapter 4: Distributions Probability distributions (4.1) The normal probability distribution (4.2) Sampling.
Ch. 6 The Normal Distribution
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
Chapter 2: Density Curves and Normal Distributions
The Normal Distributions
Data observation and Descriptive Statistics
Discrete and Continuous Random Variables Continuous random variable: A variable whose values are not restricted – The Normal Distribution Discrete.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.
Continuous Probability Distributions A continuous random variable can assume any value in an interval on the real line or in a collection of intervals.
BPT 2423 – STATISTICAL PROCESS CONTROL.  Frequency Distribution  Normal Distribution / Probability  Areas Under The Normal Curve  Application of Normal.
CHAPTER 2 Percentages, Graphs & Central Tendency.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
© Copyright McGraw-Hill CHAPTER 6 The Normal Distribution.
Chap 6-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 6 The Normal Distribution Business Statistics: A First Course 6 th.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
3.3 Density Curves and Normal Distributions
Section 7.1 The STANDARD NORMAL CURVE
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Problem: Assume that among diabetics the fasting blood level of glucose is approximately normally distributed with a mean of 105mg per 100ml and an SD.
Random Variables Numerical Quantities whose values are determine by the outcome of a random experiment.
The normal distribution Binomial distribution is discrete events, (infected, not infected) The normal distribution is a probability density function for.
CHAPTER 3: The Normal Distributions ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
The exam duration: 1hour 30 min. Marks :25 All MCQ’s. You should choose the correct answer. No major calculations, but simple maths IQ is required. No.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 Normal Probability Distributions 6-1 Review and Preview 6-2 The Standard Normal.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
The Normal Distribution Chapter 6. Outline 6-1Introduction 6-2Properties of a Normal Distribution 6-3The Standard Normal Distribution 6-4Applications.
Biostatistics, statistical software III. Population, statistical sample. Probability, probability variables. Important distributions. Properties of the.
© 2010 Pearson Prentice Hall. All rights reserved. CHAPTER 12 Statistics.
FREQUANCY DISTRIBUTION 8, 24, 18, 5, 6, 12, 4, 3, 3, 2, 3, 23, 9, 18, 16, 1, 2, 3, 5, 11, 13, 15, 9, 11, 11, 7, 10, 6, 5, 16, 20, 4, 3, 3, 3, 10, 3, 2,
Statistics What is statistics? Where are statistics used?
Confidence Intervals with Means. Rate your confidence Name my age within 10 years? Name my age within 10 years? within 5 years? within 5 years?
Descriptive Statistics Review – Chapter 14. Data  Data – collection of numerical information  Frequency distribution – set of data with frequencies.
Problem: Assume that among diabetics the fasting blood level of glucose is approximately normally distributed with a mean of 105mg per 100ml and an SD.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
The Abnormal Distribution
Normal Distributions.
Continuous Probability Distribution By: Dr. Wan Azlinda Binti Wan Mohamed.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai The Normal Curve and Univariate Normality PowerPoint.
5-Minute Check on Activity 7-9 Click the mouse button or press the Space Bar to display the answers. 1.What population parameter is a measure of spread?
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Density Curves & Normal Distributions Textbook Section 2.2.
THE NORMAL DISTRIBUTION
PROBABILITY DISTRIBUTION. Probability Distribution of a Continuous Variable.
Biostatistics Class 3 Probability Distributions 2/15/2000.
Construction Engineering 221 Probability and statistics Normal Distribution.
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Problem: Assume that among diabetics the fasting blood level of glucose is approximately normally distributed with a mean of 105mg per 100ml and an SD.
Properties of the Normal Distribution
The Normal Distribution
The Normal Probability Distribution
Chapter 5 Normal Distribution
The normal distribution
The Normal Distribution
Click the mouse button or press the Space Bar to display the answers.
PROBABILITY DISTRIBUTION
Presentation transcript:

NORMAL DISTRIBUTION AND ITS APPL ICATION

INTRODUCTION Statistically, a population is the set of all possible values of a variable. Random selection of objects of the population makes the variable a random variable ( it involves chance mechanism) Example: Let ‘x’ be the weight of a newly born baby. ‘x’ is a random variable representing the weight of the baby. The weight of a particular baby is not known until he/she is born.

Discrete random variable: If a random variable can only take values that are whole numbers, it is called a discrete random variable. If a random variable can only take values that are whole numbers, it is called a discrete random variable. Example: No. of daily admissions No. of boys in a family of 5 No. of boys in a family of 5 No. of smokers in a group of 100 No. of smokers in a group of 100 persons. persons. Continuous random variable: If a random variable can take any value, it is called a continuous random variable. Example: Weight, Height, Age & BP.

Continuous Probability Distributions Continuous distribution has an infinite number of values between any two values assumed by the continuous variable Continuous distribution has an infinite number of values between any two values assumed by the continuous variable As with other probability distributions, the total area under the curve equals 1 As with other probability distributions, the total area under the curve equals 1 Relative frequency (probability) of occurrence of values between any two points on the x-axis is equal to the total area bounded by the curve, the x-axis, and perpendicular lines erected at the two points on the x-axis Relative frequency (probability) of occurrence of values between any two points on the x-axis is equal to the total area bounded by the curve, the x-axis, and perpendicular lines erected at the two points on the x-axis

The Normal or Gaussian distribution is the most important continuous probability distribution in statistics. The term “Gaussian” refers to ‘Carl Freidrich Gauss’ who develop this distribution. The word ‘normal’ here does not mean ‘ordinary’ or ‘common’ nor does it mean ‘disease-free’. It simply means that the distribution conforms to a certain formula and shape.

Histograms A kind of bar or line chart A kind of bar or line chart Values on the x-axis (horizontal) Values on the x-axis (horizontal) Numbers on the y-axis (vertical) Numbers on the y-axis (vertical) Normal distribution is defined by a particular shape Normal distribution is defined by a particular shape Symmetrical Symmetrical Bell-shaped Bell-shaped

Histogram Figure 1 Histogram of ages of 60 subjects

A Perfect Normal Distribution

Gaussian Distribution Many biologic variables follow this pattern Many biologic variables follow this pattern Hemoglobin, Cholesterol, Serum Electrolytes, Blood pressures, age, weight, height Hemoglobin, Cholesterol, Serum Electrolytes, Blood pressures, age, weight, height One can use this information to define what is normal and what is extreme One can use this information to define what is normal and what is extreme In clinical medicine 95% or 2 Standard deviations around the mean is normal In clinical medicine 95% or 2 Standard deviations around the mean is normal Clinically, 5% of “normal” individuals are labeled as extreme/abnormal Clinically, 5% of “normal” individuals are labeled as extreme/abnormal We just accept this and move on. We just accept this and move on.

Normal distribution  Most important distribution in statistics  Also called the Gaussian distribution  Density given by  for -  < x <   where  is the mean and  the standard deviation

Area under a smooth curve Integration of the density function over the range a to b Integration of the density function over the range a to b Density function is a formula used to represent the distribution of a continuous random variable Density function is a formula used to represent the distribution of a continuous random variable A nonnegative function f(x) is called a probability distribution or probability density function of the continuous random variable X if the total area bounded by its curve and the x- axis is equal 1 and if the sub area under the curve bounded by the curve, the x-axis, and perpendiculars erected at any two points a and b gives the probability that X is between the points a and b A nonnegative function f(x) is called a probability distribution or probability density function of the continuous random variable X if the total area bounded by its curve and the x- axis is equal 1 and if the sub area under the curve bounded by the curve, the x-axis, and perpendiculars erected at any two points a and b gives the probability that X is between the points a and b

Gaussian or Normal Distribution Curve

Characteristics of Normal Distribution Symmetrical about mean,  Symmetrical about mean,  Mean, median, and mode are equal Mean, median, and mode are equal Total area under the curve above the x- axis is one square unit Total area under the curve above the x- axis is one square unit 1 standard deviation on both sides of the mean includes approximately 68% of the total area 1 standard deviation on both sides of the mean includes approximately 68% of the total area 2 standard deviations includes approximately 95% 2 standard deviations includes approximately 95% 3 standard deviations includes approximately 99% 3 standard deviations includes approximately 99%

Characteristics of the Normal Curve Values on the horizontal axis are Z values ranging from 0< to <1 (probability units) Values on the horizontal axis are Z values ranging from 0< to <1 (probability units) The mean is the center and the values in Standard Deviations account for proportions of the population The mean is the center and the values in Standard Deviations account for proportions of the population 1 SD = 68% of the sample 2 SD= 95% of the sample 3 SD = 99% of the sample

Characteristics of the Normal Distribution Normal distribution is completely determined by the parameters  and  Normal distribution is completely determined by the parameters  and  Different values of  shift the distribution along the x-axis Different values of  shift the distribution along the x-axis Different values of  determine degree of flatness or peakedness of the graph Different values of  determine degree of flatness or peakedness of the graph

Applications of Normal Distribution Frequently, data are normally distributed Frequently, data are normally distributed Essential for some statistical procedures Essential for some statistical procedures If not, possible to transform to a more normal form If not, possible to transform to a more normal form Approximations for other distributions Approximations for other distributions Because of the frequent occurrence of the normal distribution in nature, much statistical theory has been developed for it. Because of the frequent occurrence of the normal distribution in nature, much statistical theory has been developed for it.

What’s so Great about the Normal Distribution ? If you know two things, you know everything about the distribution If you know two things, you know everything about the distribution Mean Mean Standard deviation Standard deviation You know the probability of any value arising You know the probability of any value arising

Standardised Scores My diastolic blood pressure is 100 My diastolic blood pressure is 100 So what ? So what ? Normal is 90 (for my age and sex) Normal is 90 (for my age and sex) Mine is high Mine is high But how much high? But how much high? Express it in standardised scores Express it in standardised scores How many SDs above the mean is that? How many SDs above the mean is that?

Mean = 90, SD = 4 (my age and sex) Mean = 90, SD = 4 (my age and sex) This is a standardised score, or z-score This is a standardised score, or z-score Can consult tables (or computer) Can consult tables (or computer) See how often this high (or higher) score occur See how often this high (or higher) score occur 99.38% of people have lower scores 99.38% of people have lower scores

A Z-score Table

Standard Normal Distribution  Normal distribution is really family of curves determined by   and   Standard normal distribution is one with a  = 0 and  = 1  Standard normal density given by:  for -  < x <   where z = (x -  ) / 

Standard Normal Distribution To find probability that z takes on a value between any two points on the z-axis, need to find area bounded by perpendiculars erected at these points, the curve, and the z-axis To find probability that z takes on a value between any two points on the z-axis, need to find area bounded by perpendiculars erected at these points, the curve, and the z-axis Values are tabled. Values are tabled. Standard normal distribution is symmetric Standard normal distribution is symmetric

Examples of Standard Normal Distribution Height and weight Height and weight Calculate z-statistics Calculate z-statistics Pr(X < x) Pr(X < x) Pr(X > x) Pr(X > x) Pr(x 1 < X < x 2 ) Pr(x 1 < X < x 2 ) Why? Why? Determine percentiles Determine percentiles Comparisons between different distributions Comparisons between different distributions

Normal Distributions Go Wrong Wrong shape Wrong shape Non-symmetrical Non-symmetrical Skew Skew Too fat or too narrow Too fat or too narrow Kurtosis Kurtosis Aberrant values Aberrant values Outliers Outliers

Effects of Non-Normality Skew Skew Bias parameter estimates Bias parameter estimates E.g. mean E.g. mean Kurtosis Kurtosis Doesn’t effect parameter estimates Doesn’t effect parameter estimates Does effect standard errors Does effect standard errors Outliers Outliers Depends Depends

Distributions Bell-Shaped (also known as symmetric” or “normal”) Bell-Shaped (also known as symmetric” or “normal”) Skewed: Skewed: positively (skewed to the right) – it tails off toward larger values positively (skewed to the right) – it tails off toward larger values negatively (skewed to the left) – it tails off toward smaller values negatively (skewed to the left) – it tails off toward smaller values

Kurtosis

Outliers

Dealing with Outliers Error Error Data entry error Data entry error Correct it Correct it Real value Real value Difficult Difficult Delete it Delete it

ANYQUESTIONS