Download presentation
Presentation is loading. Please wait.
1
Samples and populations Estimating with uncertainty
2
Review - order of operations
3
1.Parentheses 2.Exponents and roots 3.Multiply and divide 4.Add and subtract
4
Review - order of operations
6
Review - types of variables Categorical variables –For example, country of birth Numerical variables –For example, student height
7
Review - types of variables Categorical variables Numerical variables Discrete Continuous
8
Review - types of variables Categorical variables Numerical variables Discrete Continuous Nominal Ordinal
9
Review - types of variables Categorical variables –Nominal - no natural order –Ordinal - can be placed in an order
10
Review - types of variables Categorical variables –Nominal - no natural order Example - country of birth –Ordinal - can be placed in an order
11
Review - types of variables Categorical variables –Nominal - no natural order Example - country of birth –Ordinal - can be placed in an order Example - educational experience –Some high school, high school diploma, some college, college degree, masters degree, PhD
12
Sampling from a population We often sample from a population Consider random samples –Each individual has an equal and identical probability of being selected
13
Body mass of 400 humans
15
Random sample of 10 people
18
Population mean: = 70.8 kg
19
Population mean: = 70.8 kg Sample mean: x = 76.7 kg
20
Another sample…
22
Population mean: = 70.8 kg Sample mean: x = 69.2 kg
23
What if we do this many times? Example: gene length
24
n = 20,290
25
= 2622.0 = 2037.9
26
Sample histogram
27
n = 100 Y = 2675.4 s = 1539.2
28
Y = 2675.4 s = 1539.2 Y = 2588.8 s = 1620.5 Y = 2702.4 s = 1727.1 Y = 2767.2 s = 2044.7
29
Y = 2675.4 s = 1539.2 Y = 2588.8 s = 1620.5 Y = 2702.4 s = 1727.1 Y = 2767.2 s = 2044.7 Sampling distribution of the mean
30
1000 samples Sampling distribution of the mean
33
= 2622.0 Mean of means: 2626.4 Sampling distribution of the mean
34
Y = 2675.4 s = 1539.2 Y = 2588.8 s = 1620.5 Y = 2702.4 s = 1727.1 Y = 2767.2 s = 2044.7
35
s = 1539.2 s = 1620.5s = 1727.1 s = 2044.7 Sampling distribution of the standard deviation
37
100 samples Population = 2036.9 Mean sample s = 1962.6 Sampling distribution of the standard deviation
38
1000 samples Population = 2036.9 Mean sample s = 1929.7 Sampling distribution of the standard deviation
39
Sampling distribution of the mean, n=10 Sampling distribution of the mean, n=100 Sampling distribution of the mean, n = 1000
40
Sampling distribution of the mean, n=10 Sampling distribution of the mean, n=100 Sampling distribution of the mean, n = 1000
42
Larger sample size
44
Group activity #2 Form groups of size 2-5 Get out a blank sheet of paper Write everyone’s full name on the paper
45
How many toes do aliens have?
48
Instructions You have measurements from a population of 400 aliens Use your random number table to select a sample of ten measurements Calculate your sample mean and, if you have a calculator or a large brain, your sample standard deviation On your paper, answer the following: 1.What was your sample mean and standard deviation? 2.How did you randomly choose your sample?
49
Distribution of the sample mean No matter what the frequency distribution of the population: The sample mean has an approximately bell-shaped (normal) distribution Especially for large n (large samples)
50
How precise is any one estimated sample mean?
51
The standard error of an estimate is the standard deviation of its sampling distribution. The standard error predicts the sampling error of the estimate.
52
Standard error of the mean
53
Estimate of the standard error of the mean
54
Confidence interval –a range of values surrounding the sample estimate that is likely to contain the population parameter 95% confidence interval –plausible range for a parameter based on the data
55
The 2SE rule-of-thumb
56
Confidence interval
57
Pseudoreplication The error that occurs when samples are not independent, but they are treated as though they are.
58
Example: “The transylvania effect” A study of 130,000 calls for police assistance in 1980 found that they were more likely than chance to occur during a full moon.
59
Example: “The transylvania effect” A study of 130,000 calls for police assistance in 1980 found that they were more likely than chance to occur during a full moon. Problem: There may have been 130,000 calls in the data set, but there were only 13 full moons in 1980. These data are not independent.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.