Biostatistics 410.645.01 Class 3 Probability Distributions 2/15/2000.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

Statistics review of basic probability and statistics.
Biostatistics Unit 4 Probability.
Biostatistics Unit 4 - Probability.
FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.
QUANTITATIVE DATA ANALYSIS
Probability Densities
Chapter 6 Continuous Random Variables and Probability Distributions
Final Review Session.
Ch. 6 The Normal Distribution
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Continuous Random Variables and Probability Distributions
Chapter 5 Continuous Random Variables and Probability Distributions
Chap 6-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 6 Continuous Random Variables and Probability Distributions Statistics.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.
Continuous Probability Distributions A continuous random variable can assume any value in an interval on the real line or in a collection of intervals.
Modern Navigation Thomas Herring
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
Chapter 4 Continuous Random Variables and Probability Distributions
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Standard Statistical Distributions Most elementary statistical books provide a survey of commonly used statistical distributions. The reason we study these.
PBG 650 Advanced Plant Breeding
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
QBM117 Business Statistics Probability and Probability Distributions Continuous Probability Distributions 1.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Some Useful Continuous Probability Distributions.
Random Variables Numerical Quantities whose values are determine by the outcome of a random experiment.
Applied Quantitative Analysis and Practices LECTURE#11 By Dr. Osman Sadiq Paracha.
Descriptive Statistics
1 G Lect 4a G Lecture 4a f(X) of special interest: Normal Distribution Are These Random Variables Normally Distributed? Probability Statements.
ENGR 610 Applied Statistics Fall Week 3 Marshall University CITE Jack Smith.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Continuous Random Variables.
Determination of Sample Size: A Review of Statistical Theory
NORMAL DISTRIBUTION AND ITS APPL ICATION. INTRODUCTION Statistically, a population is the set of all possible values of a variable. Random selection of.
FREQUANCY DISTRIBUTION 8, 24, 18, 5, 6, 12, 4, 3, 3, 2, 3, 23, 9, 18, 16, 1, 2, 3, 5, 11, 13, 15, 9, 11, 11, 7, 10, 6, 5, 16, 20, 4, 3, 3, 3, 10, 3, 2,
1 1 Slide © 2004 Thomson/South-Western Chapter 3, Part A Discrete Probability Distributions n Random Variables n Discrete Probability Distributions n Expected.
CY1B2 Statistics1 (ii) Poisson distribution The Poisson distribution resembles the binomial distribution if the probability of an accident is very small.
Introduction to Probability
Advanced Statistical Methods: Continuous Variables REVIEW Dr. Irina Tomescu-Dubrow.
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 6-1 The Normal Distribution.
5 - 1 © 1998 Prentice-Hall, Inc. Chapter 5 Continuous Random Variables.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions Basic Business.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
THE NORMAL DISTRIBUTION
PROBABILITY DISTRIBUTION. Probability Distribution of a Continuous Variable.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics, A First Course 4 th.
Theoretical distributions: the Normal distribution.
Chapter 6 The Normal Distribution and Other Continuous Distributions
Tests of hypothesis Contents: Tests of significance for small samples
MATB344 Applied Statistics
Introductory Statistics and Data Analysis
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Chapter 4. Inference about Process Quality
Part Three. Data Analysis
Review of Probability Concepts
Statistical Evaluation
Introduction to Statistics
The normal distribution
Chapter 6 Confidence Intervals.
Chapter 5 Continuous Random Variables and Probability Distributions
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 5 Continuous Random Variables and Probability Distributions
The Normal Distribution
Presentation transcript:

Biostatistics Class 3 Probability Distributions 2/15/2000

Continuous Probability Distributions Continuous distribution has an infinite number of values between any two values assumed by the continuous variable As the number of observations, n, approaches infinity, the the width of the class intervals approaches zero, the graph of the frequencies (frequency polygon) approaches a smooth curve

Continuous Probability Distributions As with other probability distributions, the total area under the curve equals 1 Relative frequency (probability) of occurrence of values between any two points on the x-axis is equal to the total area bounded by the curve, the x-axis, and perpendicular lines erected at the two points on the x-axis Probability of any specific value of the random variable is 0

Area under a smooth curve Integration of the density function over the range a to b –Density function is a formula used to represent the distribution of a continuous random variable A nonnegative function f(x) is called a probability distribution or probability density function of the continuous random variable X if the total area bounded by its curve and the x-axis is equal 1 and if the subarea under the curve bounded by the curve, the x-axis, and perpendiculars erected at any two points a and b gives the probability that X is between the points a and b

Normal distribution Most important distribution in statistics Also called the Gaussian distribution Density given by –for -  < x <  –where  is the mean and  the standard deviation

Characteristics of Normal Distribution Symmetrical about mean,  Mean, median, and mode are equal Total area under the curve above the x-axis is one square unit 1 standard deviation on both sides of the mean includes approximately 68% of the total area –2 standard deviations includes approximately 95% –3 standard deviations includes approximately 99%

Characteristics of the Normal Distribution Normal distribution is completely determined by the parameters  and  –Different values of  shift the distribution along the x-axis –Different values of  determine degree of flatness or peakedness of the graph

Standard Normal Distribution Normal distribution is really family of curves determined by   and  Standard normal distribution is one with a  = 0 and  = 1 –Standard normal density given by: – for -  < x <  –where z = (x -  ) / 

Standard Normal Distribution To find probability that z takes on a value between any two points on the z-axis, need to find area bounded by perpendiculars erected at these points, the curve, and the z-axis –Values are tabled (Appendix Table 3) –Standard normal distribution is symmetric

Applications of Normal Distribution Frequently, data are normally distributed –Essential for some statistical procedures Due to the combination of many random factors/influences having different effects on the outcome –If not, possible to transform to a more normal form Approximations for other distributions Because of the frequent occurrence of the normal distribution in nature, much statistical theory has been developed for it

Examples of Standard Normal Distribution –Height and weight Calculate z-statistics Pr(X < x) Pr(X > x) Pr(x 1 < X < x 2 ) –Why? Determine percentiles Comparisons between different distributions

Linear Combination of Random Variables –Frequently, we work with combinations of variables; ie., variables ‘added’ of combined together in some way –Definition: L = c 1 X 1 + c 2 X 2 + … + c n X n –also known as linear contrast –Sum = 1X 1 + 1X 2 –Difference = 1X 1 + (-1)X 2 –Mean = 0.5X X 2

Linear Combination of Random Variables –Expected value (typically, the mean) of a linear combination is the sum of the expected values of the variables E(L) = c 1 E( X 1 ) + c 2 E( X 2 ) + … E(L) =  i c i X i –Variance of a linear combination of independent random variables is Var(L) =  i c i 2 var(X i ) –For normally distributed random variables, E(L) =  i c i  i Var(L) =  i c i 2  i 2

Linear Combination of Random Variables Variables that are not independent have a covariance between each pair –Cov(X,Y) = E[(X -  x ) (Y -  Y )] Covariance hard to interpret directly, so we calculate the correlation coefficient for descriptive purposes –  = Corr(X,Y) = Cov(X,Y) / (  x  Y ) Correlation coefficient ranges from -1 to 1 and is without units –Frequently used to describe strength of relationship between variables

Linear Combination of Random Variables Variance of a linear combination takes into account the covariance –Var(c 1 X 1 + c 2 X 2 ) = c 1 2 var(X 1 ) + c 2 2 var(X 2 ) + 2c 1 c 2 Cov(X 1,X 2 ) –Typically, we calculate the variance- covariance matrix for these calculations Note that the mean of a linear combination is not affected by the covariance ‘structure’ of the combination, only the variance –Important for regression analysis and analysis of correlated analysis

Normal Approximation to the Binomial Distribution When n is moderate in size (n  25) and p is not too extreme, then the normal distribution is a good approximation to the binomial distribution with  = np and  2 = npq Pr(a  X  b) is approximately the area under the N(np, npq) curve between a- 1/2 and b+ 1/2

Normal Approximation to the Poisson Distribution When  is moderate in size (   10), the Poisson distribution is cumbersome to use and the normal distribution is a good approximation with  =  and  2 =  Pr(X = x) is approximately the area under the N( ,  ) curve between x- 1/2 and x+ 1/2

Summary Laid the ground work for ‘normal theory’ estimation and hypothesis testing Showed how the binomial and Poisson distribution generalize to the normal with large samples Investigated the properties of linear contrasts, important for ANOVA and regression