Presentation is loading. Please wait.

Presentation is loading. Please wait.

Samples and populations Estimating with uncertainty.

Similar presentations


Presentation on theme: "Samples and populations Estimating with uncertainty."— Presentation transcript:

1 Samples and populations Estimating with uncertainty

2 Review - order of operations

3 1.Parentheses 2.Exponents and roots 3.Multiply and divide 4.Add and subtract

4 Review - order of operations

5

6 Review - types of variables Categorical variables –For example, country of birth Numerical variables –For example, student height

7 Review - types of variables Categorical variables Numerical variables Discrete Continuous

8 Review - types of variables Categorical variables Numerical variables Discrete Continuous Nominal Ordinal

9 Review - types of variables Categorical variables –Nominal - no natural order –Ordinal - can be placed in an order

10 Review - types of variables Categorical variables –Nominal - no natural order Example - country of birth –Ordinal - can be placed in an order

11 Review - types of variables Categorical variables –Nominal - no natural order Example - country of birth –Ordinal - can be placed in an order Example - educational experience –Some high school, high school diploma, some college, college degree, masters degree, PhD

12 Sampling from a population We often sample from a population Consider random samples –Each individual has an equal and identical probability of being selected

13 Body mass of 400 humans

14

15 Random sample of 10 people

16

17

18 Population mean:  = 70.8 kg

19 Population mean:  = 70.8 kg Sample mean: x = 76.7 kg

20 Another sample…

21

22 Population mean:  = 70.8 kg Sample mean: x = 69.2 kg

23 What if we do this many times? Example: gene length

24 n = 20,290

25  = 2622.0  = 2037.9

26 Sample histogram

27 n = 100 Y = 2675.4 s = 1539.2

28 Y = 2675.4 s = 1539.2 Y = 2588.8 s = 1620.5 Y = 2702.4 s = 1727.1 Y = 2767.2 s = 2044.7

29 Y = 2675.4 s = 1539.2 Y = 2588.8 s = 1620.5 Y = 2702.4 s = 1727.1 Y = 2767.2 s = 2044.7 Sampling distribution of the mean

30 1000 samples Sampling distribution of the mean

31

32

33  = 2622.0 Mean of means: 2626.4 Sampling distribution of the mean

34 Y = 2675.4 s = 1539.2 Y = 2588.8 s = 1620.5 Y = 2702.4 s = 1727.1 Y = 2767.2 s = 2044.7

35 s = 1539.2 s = 1620.5s = 1727.1 s = 2044.7 Sampling distribution of the standard deviation

36

37 100 samples Population  = 2036.9 Mean sample s = 1962.6 Sampling distribution of the standard deviation

38 1000 samples Population  = 2036.9 Mean sample s = 1929.7 Sampling distribution of the standard deviation

39 Sampling distribution of the mean, n=10 Sampling distribution of the mean, n=100 Sampling distribution of the mean, n = 1000

40 Sampling distribution of the mean, n=10 Sampling distribution of the mean, n=100 Sampling distribution of the mean, n = 1000

41

42 Larger sample size

43

44 Group activity #2 Form groups of size 2-5 Get out a blank sheet of paper Write everyone’s full name on the paper

45 How many toes do aliens have?

46

47

48 Instructions You have measurements from a population of 400 aliens Use your random number table to select a sample of ten measurements Calculate your sample mean and, if you have a calculator or a large brain, your sample standard deviation On your paper, answer the following: 1.What was your sample mean and standard deviation? 2.How did you randomly choose your sample?

49 Distribution of the sample mean No matter what the frequency distribution of the population: The sample mean has an approximately bell-shaped (normal) distribution Especially for large n (large samples)

50 How precise is any one estimated sample mean?

51 The standard error of an estimate is the standard deviation of its sampling distribution. The standard error predicts the sampling error of the estimate.

52 Standard error of the mean

53 Estimate of the standard error of the mean

54 Confidence interval –a range of values surrounding the sample estimate that is likely to contain the population parameter 95% confidence interval –plausible range for a parameter based on the data

55 The 2SE rule-of-thumb

56 Confidence interval

57 Pseudoreplication The error that occurs when samples are not independent, but they are treated as though they are.

58 Example: “The transylvania effect” A study of 130,000 calls for police assistance in 1980 found that they were more likely than chance to occur during a full moon.

59 Example: “The transylvania effect” A study of 130,000 calls for police assistance in 1980 found that they were more likely than chance to occur during a full moon. Problem: There may have been 130,000 calls in the data set, but there were only 13 full moons in 1980. These data are not independent.


Download ppt "Samples and populations Estimating with uncertainty."

Similar presentations


Ads by Google