also Gaussian distribution

Slides:



Advertisements
Similar presentations
Jamil A. Malik (PhD) National Institute of Psychology Quaid-e-Azam Univeristy.
Advertisements

Sampling: Final and Initial Sample Size Determination
Sampling Distributions (§ )
Random variable Distribution. 200 trials where I flipped the coin 50 times and counted heads no_of_heads in a trial.
Analysis of Research Data
Those who don’t know statistics are condemned to reinvent it… David Freedman.
HIM 3200 Normal Distribution Biostatistics Dr. Burton.
Chapter 11: Inference for Distributions
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
1 Introduction to biostatistics Lecture plan 1. Basics 2. Variable types 3. Descriptive statistics: Categorical data Categorical data Numerical data Numerical.
EPE/EDP 557 Key Concepts / Terms –Empirical vs. Normative Questions Empirical Questions Normative Questions –Statistics Descriptive Statistics Inferential.
Graphical Summary of Data Distribution Statistical View Point Histograms Skewness Kurtosis Other Descriptive Summary Measures Source:
Ch2: Probability Theory Some basics Definition of Probability Characteristics of Probability Distributions Descriptive statistics.
Measures of Variability Objective: Students should know what a variance and standard deviation are and for what type of data they typically used.
The normal distribution Binomial distribution is discrete events, (infected, not infected) The normal distribution is a probability density function for.
Continuous Distributions The Uniform distribution from a to b.
Measures of Dispersion & The Standard Normal Distribution 9/12/06.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Statistical Methods II&III: Confidence Intervals ChE 477 (UO Lab) Lecture 5 Larry Baxter, William Hecker, & Ron Terry Brigham Young University.
Determination of Sample Size: A Review of Statistical Theory
Statistical Methods II: Confidence Intervals ChE 477 (UO Lab) Lecture 4 Larry Baxter, William Hecker, & Ron Terry Brigham Young University.
Expectation. Let X denote a discrete random variable with probability function p(x) (probability density function f(x) if X is continuous) then the expected.
IE 300, Fall 2012 Richard Sowers IESE. 8/30/2012 Goals: Rules of Probability Counting Equally likely Some examples.
Ch8.2 Ch8.2 Population Mean Test Case I: A Normal Population With Known Null hypothesis: Test statistic value: Alternative Hypothesis Rejection Region.
Raw ScoreFrequency Figures and Graphs Frequency Distributions Example: Scores on a quiz 6,2,5,6,5,4,6,7,6,6,8, 5,6,3,7,6,8,3,6,5,4,5,
1 Day 1 Quantitative Methods for Investment Management by Binam Ghimire.
Paul Cornwell March 31,  Let X 1,…,X n be independent, identically distributed random variables with positive variance. Averages of these variables.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Sampling Distributions
Session V: The Normal Distribution Continuous Distributions (Zar, Chapter 6)
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Continuous Random Variables. Probability Density Function When plotted, continuous treated as discrete random variables can be “binned” form “bars” A.
Continuous Distributions
Distributions.
Statistics in SPSS Lecture 3
Review 1. Describing variables.
Measures of Dispersion
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Research Methods in Psychology PSY 311
CHAPTER 6 Random Variables
Practice Page Practice Page Positive Skew.
STAT 311 REVIEW (Quick & Dirty)
Summary of Prev. Lecture
CENTRAL MOMENTS, SKEWNESS AND KURTOSIS
Normal distribution, Central limit theory, and z scores
Skewness Skewness literaly means lack of symmetary. Thus it helps us to understand the shape of distributions.
Sample Mean Distributions
Descriptive Statistics
Statistics in Applied Science and Technology
Test for Normal Distribution
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
MEGN 537 – Probabilistic Biomechanics Ch.3 – Quantifying Uncertainty
Bar charts and Histograms
Why does sampling work?.
Warmup To check the accuracy of a scale, a weight is weighed repeatedly. The scale readings are normally distributed with a standard deviation of
Shape of Distributions
Hypothesis Tests for a Standard Deviation
PROBABILITY DISTRIBUTION
CENTRAL MOMENTS, SKEWNESS AND KURTOSIS
Sampling Distribution Models
Sampling Distributions (§ )
Measures of Dispersion
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Statistical Inference for the Mean: t-test
Continuous Distributions
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Fundamental Sampling Distributions and Data Descriptions
Presentation transcript:

also Gaussian distribution Normal distribution also Gaussian distribution

Probability density Mean value of distribution is  holds - variance of distribution is 2

There is certain inconsistency -  is a symbol for mean in general as well as the specific parameter of normal distribution (which is a mean too) and similar for 2 Key role of normal distribution in statistics rises from central limit theorem. It says, that mean of “very big” random sample is random value with almost normal distribution, even if distribution of population differs from normal one.

From “definition” – variable with normal distribution can take with nonzero probability values from - to + Biological variables haven’t usually normal distribution, but can be ofted “reasonably” approximated by normal distribution.

Skewness and kurtosis i-th moment - mean value Xi i-th central moment, κi – mean value (X- )i So, mean value is the first general moment The first central moment is from definition 0 Variance is the second central moment Skewness is characterized as the third central moment Kurtosis is characterized as the fourth central moment

Skewness Positive skewness – a lot of negative deviances from mean is compensated by lesser number of big positive deviances: 3, 3, 3, 4, 7 μ=4 3=(3-4)3+ (3-4)3+ (3-4)3+ (4-4)3+ (7-4)3 =(-1)+(-1)+(-1)+0+27=24 3 – is in the third powers of measurement units mean median - is dimensionless and set just shape

Skewness Negative skew distribution - a lot of small positive deviations from mean is compensated by lesser number of big negative deviances 5, 5, 5, 1, 4 μ=4 3=(5-4)3+ (5-4)3+ (5-4)3+ (4-4)3+ (1-4)3 =1+1+1+0+(-27)=-24 mean median

Kurtosis – 4th central moment Normal distribution is mesokurtic normal - mesokurtic leptokurtic 2 > 0 platykurtic 2 < 0

standartized normal distribution

“checking“ of normality - graphic Mixture of two types of normal distribution Normal Plot a cumulative histogram of counts on a scale of probabilities Left skewed Right skewed Platykurtic leptokurtic Examples of denzities of probabilities (frequency distributions) together with their cumulative distributions plotted on y-axis in a scale of normal probability

Normality checking– I compute skewness and kurtosis and compare them with expected values for normal distribution. The most of biological data has positive skewed distribution – that’s why computing of skewness give us quite strong test and tell us how much data differ from normality at the same time.

Normality testing - χ2 test I compute mean and variance from data and compare sample data with date with normal distribution with the same mean and variance as data of my own. Then, with help of χ2 test, I compare number of cases in size classes set up from observed data and expected frequencies in normal distribution – classic problem, I must decide the breadth of categories (columns’ width in histogram) – number of degrees of freedom = k-1-2 2 parameters from data

Editors of journals demand such a test, but (almost) no biological data have normal distribution, so if I have a large data, the test is strong and I reject null hypothesis about normality (even if the deviation from normality is small) if I have a few data, the test is desperately weak and even for data with big deviation from normality I cannot reject null hypothesis