Basic Statistical Concepts  M. Burgman & J. Carey 2002.

Slides:



Advertisements
Similar presentations
Hypothesis testing Another judgment method of sampling data.
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Hypothesis Testing Steps in Hypothesis Testing:
Inference Sampling distributions Hypothesis testing.
Parametric/Nonparametric Tests. Chi-Square Test It is a technique through the use of which it is possible for all researchers to:  test the goodness.
From the homework: Distribution of DNA fragments generated by Micrococcal nuclease digestion mean(nucs) = bp median(nucs) = 110 bp sd(nucs+ = 17.3.
Analysis and Interpretation Inferential Statistics ANOVA
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
PSY 307 – Statistics for the Behavioral Sciences
Topic 2: Statistical Concepts and Market Returns
Chapter Goals After completing this chapter, you should be able to:
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Lecture 9: One Way ANOVA Between Subjects
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Chapter Sampling Distributions and Hypothesis Testing.
Chapter 11 Multiple Regression.
Inference.ppt - © Aki Taanila1 Sampling Probability sample Non probability sample Statistical inference Sampling error.
Inference about a Mean Part II
Chapter 2 Simple Comparative Experiments
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Today Concepts underlying inferential statistics
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Richard M. Jacobs, OSA, Ph.D.
Lecture 5 Correlation and Regression
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
AM Recitation 2/10/11.
Inferential Statistics & Test of Significance
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Fundamentals of Statistical Analysis DR. SUREJ P JOHN.
T-Tests and Chi2 Does your sample data reflect the population from which it is drawn from?
P Values Robin Beaumont 10/10/2011 With much help from Professor Chris Wilds material University of Auckland.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Statistics for Water Science: Hypothesis Testing: Fundamental concepts and a survey of methods Unite 5: Module 17, Lecture 2.
Lecture 3 Forestry 3218 Avery and Burkhart, Chapter 3 Shiver and Borders, Chapter 2 Forest Mensuration II Lecture 3 Elementary Sampling Methods: Selective,
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
Instructor Resource Chapter 5 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
The Scientific Method Formulation of an H ypothesis P lanning an experiment to objectively test the hypothesis Careful observation and collection of D.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
CONFIDENCE INTERVAL It is the interval or range of values which most likely encompasses the true population value. It is the extent that a particular.
Lecture 2 Forestry 3218 Lecture 2 Statistical Methods Avery and Burkhart, Chapter 2 Forest Mensuration II Avery and Burkhart, Chapter 2.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
26134 Business Statistics Tutorial 11: Hypothesis Testing Introduction: Key concepts in this tutorial are listed below 1. Difference.
P Values Robin Beaumont 8/2/2012 With much help from Professor Chris Wilds material University of Auckland.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
P Values - part 2 Samples & Populations Robin Beaumont 2011 With much help from Professor Chris Wilds material University of Auckland.
BIOSTATISTICS Hypotheses testing and parameter estimation.
Chapter 13 Understanding research results: statistical inference.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Inference about the slope parameter and correlation
Parameter Estimation.
ESTIMATION.
Hypothesis Testing.
What are their purposes? What kinds?
Presentation transcript:

Basic Statistical Concepts  M. Burgman & J. Carey 2002

Statistical Population The entire underlying set of individuals from which samples are drawn. e.g. 0.25m 2 quadrats are used to count barnacles on a sea shore. The population is defined implicitly by the sampling frame.

Strategies Define survey objectives Define population parameters to estimate Implement sampling strategy i) measure every individual (cost, time, practicality especially if destructive) ii) measure a representative portion of the population (a sample)

Statistical Sample An aggregate of objects from which measurements are taken. A representative subset of a population.

Simple Random Sampling Every unit and combination of units in the population has an equal chance of selection. a) with replacement b) without replacement c) finite and infinite populations

Sampling Objectives To obtain an unbiased estimate of a population mean To assess the precision of the estimate (i.e. calculate the standard error of the mean) To obtain as precise an estimate of the parameters as possible for time and money spent

 (x i -  ) 2 n  (x i - x) 2 n - 1  (x i - x) 2 n - 1 Statistics of Dispersion Population variance  2 = Sample variance s 2 = Sample standard deviation s = 

s 2 n s x  (x i - x ) (y i - y ) n - 1 Statistics of Dispersion Standard error of the mean s x = Coefficient of variation CV = Covariance s xy = 

Expectations and Variances E(X+b)= E(X) + b E(aX)=aE(X) E(X+Y)=E(X) + E(Y) V(X+b)=V(X) V(aX)=a 2 V(X) V(X+Y)=V(X) + V(Y) + 2Cov(X,Y)

Confidence Limits For the mean= x  t [ , n-1] This formula sets confidence limits to means of samples from a normally distributed population. s  n

Confidence Limits Confidence limits of the mean define a region that we expect will enclose the true mean. The likelihood that this is true is determined by . If we set  at 5% (hence specifying 95% confidence intervals), then the region enclosed by the confidence intervals will capture the true mean 95 times out of 100.

Confidence Limits The same formula may be used to set confidence limits to any statistic as long as it follows the normal distribution, e.g. the median, the average (absolute) deviation, standard deviation (s), coefficient of variation, or skewness.

How many samples? where : CV is coefficient of variation (expressed as a %) of samples in a pilot survey t is Student's t value for a specified degree of certainty and the number of samples used to estimate the parameters E is specified error limits (expressed as a % of the mean) n = t 2 CV 2 E 2

Measurement Error Measured variation may be decomposed into natural variation + measurement error Measurement error may be reduced by improving sampling protocols and instrumentation Reducing measurement error increases confidence in estimates without increasing the number of samples. Precision (variation) v. accuracy (bias)

Components of Measurement Error Systematic errors Random errors Causes Measurement assumptions (shape, size, allometry) Instrument error Operator error

Kinds of Uncertainty 1. Epistemic Uncertainty inherent environmental variation variation in population responses due to demographic structure imperfect knowledge model mis-specification measurement error (assessment error) ignorance

Kinds of Uncertainty 2. Semantic Uncertainty Ambiguity - interpretation of a phrase in two or more distinct ways. “Juvenile Court to Try Shooting Defendant” “Local High School Dropouts Cut in Half” Vagueness - leads to borderline cases. e.g. tall; endangered; adult

Kinds of Uncertainty More examples of vagueness: Tree crown tree foliage bounded by the first healthy branch forming part of the main crown and extending as far or further than any branch above it. forked trees?dead branches?

Kinds of Uncertainty More examples of vagueness: Epilimnion the upper layer of water in a lake, bounded by a thermocline Soil horizon a relatively uniform soil layer, differentiated by contrasts in mineral or organic properties.

Sampling Design Criteria Operational simplicity Unambiguous interpretation

Null-Hypothesis Tests An example of hypothesis testing in which management alternatives are judged on the basis of the outcome of the test. HypothesisSymbolDescription Null H 0 The strategy has no hypothesis effect. Alternative H 1 The strategy is hypothesiseffective

Statistical Outcomes in Null Hypothesis Testing Test Result Significant Not significant (H 0 rejected) (H 0 not rejected) Difference correct Type II error (H 0 false) (  ) No difference Type I error correct (H 0 true) (  ) Reality

The Character of Error Types Type I errors Alarmism/Over-reaction Incorrectly accepting a (false) alternative hypothesis Concluding (incorrectly) that there is an impact Type II errors False confidence/Cornucopia Incorrectly "accepting" a (false) null hypothesis Concluding (incorrectly) that there is no impact

t-tests A t-test of the hypothesis that two sample means come from a population with equal  i.e. H 0 :  1 =  2 t = Y 1 - Y 2 1n1n (s s 2 2 ) 

Distributions of Test Statistics distribution of mean of actual population distribution of the null hypothesis, assumed to be true until rejected P(statistic)  critical value

Assumptions The assumption of independence: correlation and autocorrelation 1. if error in one object is related to error in others, there will be bias eg. measure one and compare others. 2. the effective sample size may be less than the number of samples if measurements are correlated in space or time.

The effects of the non-independence of data on errors of interpretation of statistical tests Non-independence Among Within treatments treatments Positive Increased Increased Type II Type I Negative Increased Increased Type I Type II Correlation

Randomization Tests Jaw lengths of Golden Jackals: Males: 120, 107, 110, 116, 114, 111, 113, 117, 114, 112 Females: 110, 111, 107, 108, 110, 105, 107, 106, 111, 111 Is there a difference in jaw length between males and females?

1.Calculate means for males and for females. 2.Calculate the difference between the means D 0 = x m - x f = Randomly allocate 10 sample lengths to each of 2 groups 4.Calculate D i, the difference between means for these 2 groups 5.Repeat Steps 3 & 4 many times Randomization Tests

If D 0 is unusually large, the observed data are unlikely to have arisen if there was no difference between males and females. Randomization Tests Difference in jaw length (mm) Frequency D 0 =

Randomization Tests From 5000 runs, only 9 D i s were greater than or equal to /5000 = (t-test: p Ho = )

Confidence Limits by Randomization For 95% confidence limits, the upper and lower limits, U and L, are such that they enclose 95% of the randomization distribution. For 99% confidence, L and U must give values at the 0.5% and 99.5% points on the distribution.

Can do randomization tests in lieu of: paired comparisons ANOVA multiple regression Randomization Tests