Biostatistics, statistical software III. Population, statistical sample. Probability, probability variables. Important distributions. Properties of the.

Slides:



Advertisements
Similar presentations
Sampling Distributions (§ )
Advertisements

Stat350, Lecture#4 :Density curves and normal distribution Try to draw a smooth curve overlaying the histogram. The curve is a mathematical model for the.
Topic 3 The Normal Distribution. From Histogram to Density Curve 2 We used histogram in Topic 2 to describe the overall pattern (shape, center, and spread)
DENSITY CURVES and NORMAL DISTRIBUTIONS. The histogram displays the Grade equivalent vocabulary scores for 7 th graders on the Iowa Test of Basic Skills.
Biostatistics Unit 4 - Probability.
Sampling Distributions
CHAPTER 6 Statistical Analysis of Experimental Data
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Introduction to Probability and Statistics Chapter 7 Sampling Distributions.
CHAPTER 3: The Normal Distributions Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
BPS - 5th Ed. Chapter 31 The Normal Distributions.
Fall 2013Biostat 5110 (Biostatistics 511) Week 7 Discussion Section Lisa Brown Medical Biometry I.
Chapter 7: The Normal Probability Distribution
Chapter 6 The Normal Probability Distribution
8.5 Normal Distributions We have seen that the histogram for a binomial distribution with n = 20 trials and p = 0.50 was shaped like a bell if we join.
Chapter 3 (continued) Nutan S. Mishra. Exercises Size of the data set = 12 for all the five problems In 3.11 variable x 1 = monthly rent of.
Probability & the Normal Distribution
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
1 If we can reduce our desire, then all worries that bother us will disappear.
Continuous Probability Distributions  Continuous Random Variable  A random variable whose space (set of possible values) is an entire interval of numbers.
Overview 6.1 Discrete Random Variables
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Ch5 Continuous Random Variables
Copyright ©2011 Nelson Education Limited The Normal Probability Distribution CHAPTER 6.
Topics Covered Discrete probability distributions –The Uniform Distribution –The Binomial Distribution –The Poisson Distribution Each is appropriately.
Theory of Probability Statistics for Business and Economics.
Problem: Assume that among diabetics the fasting blood level of glucose is approximately normally distributed with a mean of 105mg per 100ml and an SD.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Learning Objectives for Section 11.5 Normal Distributions The student will be able to identify what is meant.
Stat 1510: Statistical Thinking and Concepts 1 Density Curves and Normal Distribution.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Random Variables Numerical Quantities whose values are determine by the outcome of a random experiment.
Chapter 2: The Normal Distribution Section 1: Density Curves and the Normal Distribution.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Chapter 6. Continuous Random Variables Reminder: Continuous random variable.
CHAPTER 3: The Normal Distributions ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Chapter 6 The Normal Curve. A Density Curve is a curve that: *is always on or above the horizontal axis *has an area of exactly 1 underneath it *describes.
Essential Statistics Chapter 31 The Normal Distributions.
Density Curves and the Normal Distribution.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 Normal Probability Distributions 6-1 Review and Preview 6-2 The Standard Normal.
CHAPTER 3: The Normal Distributions
The Normal Distribution Chapter 6. Outline 6-1Introduction 6-2Properties of a Normal Distribution 6-3The Standard Normal Distribution 6-4Applications.
Statistical estimation, confidence intervals
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Determination of Sample Size: A Review of Statistical Theory
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 2 Modeling Distributions of Data 2.2 Density.
NORMAL DISTRIBUTION AND ITS APPL ICATION. INTRODUCTION Statistically, a population is the set of all possible values of a variable. Random selection of.
Biostatistics, statistical software I. Basic statistical concepts Krisztina Boda PhD Department of Medical Informatics, University of Szeged.
Thursday August 29, 2013 The Z Transformation. Today: Z-Scores First--Upper and lower real limits: Boundaries of intervals for scores that are represented.
The two way frequency table The  2 statistic Techniques for examining dependence amongst two categorical variables.
BPS - 5th Ed. Chapter 31 The Normal Distributions.
Essential Statistics Chapter 31 The Normal Distributions.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Section 2.1 Density Curves. Get out a coin and flip it 5 times. Count how many heads you get. Get out a coin and flip it 5 times. Count how many heads.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
1 Chapter 2: The Normal Distribution 2.1Density Curves and the Normal Distributions 2.2Standard Normal Calculations.
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
Welcome to the Wonderful World of AP Stats.…NOT! Chapter 2 Kayla and Kelly.
 Normal Curves  The family of normal curves  The rule of  The Central Limit Theorem  Confidence Intervals  Around a Mean  Around a Proportion.
Theoretical distributions: the Normal distribution.
Chapter 2 The Normal Distributions. Section 2.1 Density curves and the normal distributions.
Chapter 6: Sampling Distributions
Normal Distribution and Parameter Estimation
Chapter 7 ENGR 201: Statistics for Engineers
Density Curve A mathematical model for data, providing a way to describe an entire distribution with a single mathematical expression. An idealized description.
Sampling Distribution Models
Section 2.1 Density Curves & the Normal Distributions
CHAPTER 2 Modeling Distributions of Data
Section 2.1 Density Curves & the Normal Distributions
Section 2.1 Density Curves & the Normal Distributions
M248: Analyzing data Block A UNIT A3 Modeling Variation.
Presentation transcript:

Biostatistics, statistical software III. Population, statistical sample. Probability, probability variables. Important distributions. Properties of the normal distribution, the central limit theorem, the standard error. Krisztina Boda PhD Department of Medical Informatics, University of Szeged

Krisztina Boda INTERREG 2 Population, sample Population: the entire group of individuals that we want information about. Sample: a part of the population that we actually examine in order to get information A simple random sample of size n consists of n individuals chosen from the population in such a way that every set of n individuals has an equal chance to be in the sample actually selected.

Krisztina Boda INTERREG 3 Examples Sample data set  Questionnaire filled in by a group of pharmacy students  Blood pressure of 20 healthy women …… Population  Pharmacy students  Students  Blood pressure of women (whoever) ……

Krisztina Boda INTERREG 4 SamplePopulation Bar chart of relative frequencies of a categorical variable Distribution of that variable in the population (approximates)

Krisztina Boda INTERREG 5 SamplePopulation Histogram of relative frequencies of a continuous variable Distribution of that variable in the population (approximates)

Krisztina Boda 6 SamplePopulation Mean (x) Standard deviation (SD) Median Mean  (unknown) Standard deviation  (unknown) Median (unknown) (approximates)

Krisztina Boda INTERREG 7 The concept of probability Lets repeat an experiment n times under the same conditions. In a large number of n experiments the event A is observed to occur k times (0  k  n). k : frequency of the occurrence of the event A. k/n : relative frequency of the occurrence of the event A. 0  k/n  1 If n is large, k/n will approximate a given number. This number is called the probability of the occurrence of the event A and it is denoted by P(A). 0  P(A)  1

Krisztina Boda INTERREG 8 Example: the tossing of a (fair) coin k: number of „head”s n= k= k/n= P(„head”)=0.5

Krisztina Boda INTERREG 9 Probability facts Any probability is a number between 0 and 1. All possible outcomes together must have probability 1. The probability of the complementary event of A is 1-P(A).

Krisztina Boda INTERREG 10 Rules of probability calculus Assumption: all elementary events are equally probable Examples: Rolling a dice. What is the probability that the dice shows 5?  If we let X represent the value of the outcome, then P(X=5)=1/6. What is the probability that the dice shows an odd number?  P(odd)=1/2. Here F=3, T=6, so F/T=3/6=1/2.

Krisztina Boda INTERREG 11 The density curve The density curve is an idealized description of the overall pattern of a distribution that smoothes out the irregularities in the actual data. The density curve  is on the above the horizontal axis, and  has area exactly 1 underneath it. The area under the curve and above any range of values is the proportion of all observations that fall in that range.

Krisztina Boda INTERREG 12 Special distribution of a continuous variable: The normal distribution The normal curve was developed mathematically in 1733 by DeMoivre as an approximation to the binomial distribution. His paper was not discovered until 1924 by Karl Pearson. Laplace used the normal curve in 1783 to describe the distribution of errors. Subsequently, Gauss used the normal curve to analyze astronomical data in The normal curve is often called the Gaussian distribution. The term bell-shaped curve is often used in everyday usage.

Krisztina Boda INTERREG 13 Special distribution of a continuous variable: The normal distribution N( ,  ) The density curves are symmetric, single-peaked and bell-shaped The rule. In the normal distribution with mean  and standard deviation  :  68% of the observations fall within  of the mean   95% of the observations fall within 2  of the mean   99.7% of the observations fall within 3  of the mean 

Krisztina Boda INTERREG 14 Normal distributions N( ,  ) N(0,1) N(1,1) N(0,2) ,  : parameters (a parameter is a number that describes the distribution)

Krisztina Boda INTERREG 15 „Imagination” of the distribution given the sample mean and sample SD – supposing normal distribution In the papers generally sample means and SD-s are published. We use these value to know the original distribution For example, age in years: 55.2  % of the data are supposed to be in this interval 95.44% of the data are supposed to be in the interval (23.8, 86.6)

Krisztina Boda INTERREG 16 Stent length per lesion (mm): 18.8  10.5 Using these values as parameters, the „normal” distribution would be like this: The original distribution of stent length was probably skewed, or if the distribution was normal, there were negative values in the data set.

Krisztina Boda INTERREG 17 Standard normal probabilities N(0,1) z(z): proportion of area to the left of z The area under the standard normal curve is tabulated

Krisztina Boda INTERREG 18 The use of the normal curve table z(z): proportion of area to the left of z Find the area under the standard normal curve to the left of z=-1.96, that is, find the probablity that z<-1.96 P(z<-0.96)=0.025

Krisztina Boda INTERREG 19 Standardization If a random variable X has normal distribution N(,) distribution, than the variable has is standard normal distribution N(0,1).

Krisztina Boda INTERREG 20 The use of the normal curve table z(z): proportion of area to the left of z Example. It was found that the weights of a certain population of laboratory rats are normally distributed with  = 14 ounces and σ=2 ounces. In such a population, what percentage of rats would we expect to weigh below 10 ounces? Solution. 1. Standardize X 2. Find the area to the left of -2

Krisztina Boda INTERREG 21 Distribution of sample means h ttp://

Krisztina Boda INTERREG 22 Distribution of sample means The original population is not normally distributed

Krisztina Boda INTERREG 23 The central limit theorem Let’s draw samples from a population with mean µ and standard deviation σ. Examine the distribuiton of the sample means. If the sample size n is large (say, at least 30), then the population of all possible sample means approximately has a normal distribution with mean µ and standard deviation σ no matter what probability describes the population sampled.

Krisztina Boda INTERREG 24 The standard error of mean (SE or SEM) is called the standard error of mean Meaning: the dispersion of the sample means around the (unknown) population mean. Estimation of SE from the sample:

Krisztina Boda INTERREG 25 Review questions and exercises Problems to be solved by hand- calculations ..\Handouts\Problems hand III.doc..\Handouts\Problems hand III.doc Solutions ..\Handouts\Problems hand III solutions.doc..\Handouts\Problems hand III solutions.doc Problems to be solved using computer - none

Krisztina Boda INTERREG 26 Useful WEB pages     2/index.html 2/index.html