A Statistical Analysis of Seedlings Planted in the Encampment Forest Association By: Tony Nixon.

Slides:



Advertisements
Similar presentations
Inference in the Simple Regression Model
Advertisements

Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Inference for Regression
Confidence Intervals Chapter 7. Rate your confidence Guess my mom’s age within 10 years? –within 5 years? –within 1 year? Shooting a basketball.
Confidence Intervals Chapter 10. Rate your confidence Name my age within 10 years? 0 within 5 years? 0 within 1 year? 0 Shooting a basketball.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Topic 2: Statistical Concepts and Market Returns
8-5 Testing a Claim About a Standard Deviation or Variance This section introduces methods for testing a claim made about a population standard deviation.
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Confidence Intervals Chapter 10. Rate your confidence Name my age within 10 years? within 5 years? within 1 year? Shooting a basketball at a wading.
CHAPTER 19: Two-Sample Problems
AM Recitation 2/10/11.
Inference for regression - Simple linear regression
Overview Definition Hypothesis
Statistical inference: confidence intervals and hypothesis testing.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Inference for Linear Regression Conditions for Regression Inference: Suppose we have n observations on an explanatory variable x and a response variable.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
1 Objective Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means.
Student’s t-distributions. Student’s t-Model: Family of distributions similar to the Normal model but changes based on degrees-of- freedom. Degrees-of-freedom.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Chapter 12 Tests of a Single Mean When σ is Unknown.
1 Section 9-4 Two Means: Matched Pairs In this section we deal with dependent samples. In other words, there is some relationship between the two samples.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Chapter 23 Inference for One- Sample Means. Steps for doing a confidence interval: 1)State the parameter 2)Conditions 1) The sample should be chosen randomly.
1 Required Sample Size, Type II Error Probabilities Chapter 23 Inference for Means: Part 2.
Essential Statistics Chapter 141 Thinking about Inference.
Confidence Intervals For a Sample Mean. Point Estimate singleUse a single statistic based on sample data to estimate a population parameter Simplest approach.
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Confidence Intervals. Rate your confidence Name my age within 10 years? within 5 years? within 1 year? Shooting a basketball at a wading pool,
1.1 Statistical Analysis. Learning Goals: Basic Statistics Data is best demonstrated visually in a graph form with clearly labeled axes and a concise.
1 Objective Compare of two population variances using two samples from each population. Hypothesis Tests and Confidence Intervals of two variances use.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Inferences from sample data Confidence Intervals Hypothesis Testing Regression Model.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Confidence Intervals with Means. Rate your confidence Name my age within 10 years? Name my age within 10 years? within 5 years? within 5 years?
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
PCB 3043L - General Ecology Data Analysis.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
T tests comparing two means t tests comparing two means.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Chapter 13 Understanding research results: statistical inference.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
STATISTICS People sometimes use statistics to describe the results of an experiment or an investigation. This process is referred to as data analysis or.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Chapter Nine Hypothesis Testing.
CHAPTER 10 Comparing Two Populations or Groups
PCB 3043L - General Ecology Data Analysis.
Chapter 11 Inferences About Population Variances
CHAPTER 26: Inference for Regression
CHAPTER 10 Comparing Two Populations or Groups
Basic Practice of Statistics - 3rd Edition Inference for Regression
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Section 10.2 Comparing Two Means.
CHAPTER 10 Comparing Two Populations or Groups
Presentation transcript:

A Statistical Analysis of Seedlings Planted in the Encampment Forest Association By: Tony Nixon

Project Overview The Encampment Forest Association The Nature Conservancy White Pine and White Cedar Restoration Silvicultural Management GIS Database

General Information of Database 3,321 total records in database pertaining to white pine, white cedar, and yellow birch seedlings. 1,987 useable records for statistical analysis that contain plant year information about white pine and white cedar with no yellow birch seedlings. Important database fields (“variables”) for statistical measure include year planted, species, height, vigor (measure of health), canopy cover, canopy species, and date updated. Plant year from 2004 to present.

Various Graphs Representing Overall Data Pie Chart showing the abundance of each species planted.Time plot representing the number of seedlings planted by year.

Determining Samples for Analysis It can be stated that seedling are the same height and vigor when planted. All records were updated in 2012 – 2013 for measuring changes in height, vigor, canopy cover, and competition. Samples will be created using a method of stratified random sampling and multistage sampling. Seedlings must all be planted in the same year. Seedling must be of the same species. Seedlings must be located within the same sample site. This reduces variance due to different environmental conditions.

Correlation Test A correlation test performed on white pine seedlings planted in Response variable is height of seedling and explanatory variable is canopy cover. Results show a correlation value of r = This indicates a negative correlation between height and canopy cover. In other words, with a higher percentage of canopy cover the growth of the seedlings is delayed. The most evidence is shown between a canopy cover of 0 through 15 where there is the greatest height of seedlings.

Variable Inference about Population A growth rate (inches/year) is calculated from the height of seedling based on year planted and year the data was updated. All seedlings are white pine and were planted in They reside in the same sample site. By testing this variable at this sample site, an inference can be made on the population for these environmental conditions. The sample mean and standard deviation are calculated and shown below along with the distribution. It can be concluded that the sample is relatively Normal with a couple outliers that may skew the data a little to the right.

One Sample t Test State: According to the National Forest Service early stages of white pine growth is relatively slow and varies from year to year. On average the height of white pine seedlings are 54 inches at 8 years of age. That is about the growth rate of 6.75 inches per year. Based on the data, does the current conditions of the sample site suggest a relatively healthy habitat or does it require more management? Formulate: The null hypothesis suggests that this sample site is no different than the average growth rate determined by the Forest Service. Use the one sample t test to determine if this is true. Solve: We know the sample is relatively Normal with a sample size > 15 so the t test will yield approximately accurate results. Calculate the probability that the growth rate of the sample site is to happen based on the conditions of the site. The sample has a 17.24% chance of occurring which is not enough evidence to reject the null hypothesis. Conclude: It can be stated that the null hypothesis is true. In other words, the sample site has about average growth rate. Therefore, the growing conditions are acceptable and do not need further management. One Sample t Test Calculations Sample Size78 Degrees of Freedom77 Sample Mean7.112 Sample Standard Deviation2.321 Standard Error Mean Based on Null Hypothesis6.75 One Sample t Statistic Two sided P-value0.1724

One Sample t Confidence Interval State: What is the growth rate of white pine seedlings based on the environmental conditions of the sample site? Formulate: Use the one sample t confidence interval for determining the mean range of growth rate for the population mean based on 95% confidence. Solve: Calculate the confidence interval based on the sample mean and standard deviation. Based on the calculations below, there is a confidence interval of / Conclude: It can be stated that on a 95% confidence these environmental conditions will have a growth rate between and inchers per year for similarly aged seedlings. One Sample t Confidence Interval Calculations Sample Size78 Degrees of Freedom77 Sample Mean7.112 Sample Standard Deviation2.321 Standard Error Critical Value t* on 95%1.991 Confidence Interval (+/-)

Two Sample t Test State: Comparing what is known about one population of environmental conditions, a test can be done to make an inference about another population. Another sample site is selected of white pines planted in 2010 from another location. Is there a significant difference between population growth rates? Formulate: The null hypothesis suggests that the sample sites have the same growth rate and that the two environmental conditions have no real effect on this rate. Use the two sample t test to determine if this is true. Solve: The new sample site also has a relatively Normal distribution with > 15 sample size as shown below. Calculate the probability that the growth rate of the sample sites are similar and have no drastic differences. The sample has less than probability occurring which is evidence to reject the null hypothesis. Conclude: It can be stated that the null hypothesis is false. In other words, the sample sites have different conditions that lead to different growth rates. Sample 1 Statistics (2008 Growth Rate) Sample Size78 Degrees of Freedom77 Sample Mean7.112 Sample Standard Deviation2.321 Sample 2 Statistics (2010 Growth Rate) Sample Size49 Degrees of Freedom48 Sample Mean7.112 Sample Standard Deviation2.321 Two Sample t Test Calculations Degrees of Freedom48 Mean Difference X 1 - X t Test Denominator Two Sample t Statistic Two sided P-value< 0.001

Two Sample t Confidence Interval State: How much smaller is the growth rate of sample site two compared to sample site one based on 95% confidence? Formulate: Perform a two sample t confidence interval using a 95% critical value. Solve: Determine the critical t* value based on degrees of freedom from the smaller sample. Based on the calculations below, there is a confidence interval of / Conclude: It can be stated that on a 95% confidence sample site one has a growth rate between and inchers per year greater than sample site two. One Sample t Confidence Interval Calculations Degrees of Freedom48 Mean Difference X 1 - X Critical Value t* on 95%2.011 Confidence Interval (+/-)0.643

Limitations and Bias There will always be errors and a level of bias with statistical inferences and human related tests. Human error in data collection with measuring heights and estimating canopy cover. Data recording error with entry. Bias when selecting sample sites without a true method of random sampling. Bias on choosing only protected seedlings that doesn’t factor in other damaging agents such as animal browsing.

Overall Conclusions Time Plot and Pie Chart: Based on the plots, it can be stated that the Encampment Forest Association has recently took interest in restoring their white pine populations. Correlation Inference: There is a negative correlation between height of seedlings and canopy cover; therefore, a management practice of releasing white pine stands will help increase growth rate of seedlings. One Sample Inferences: Since it is known that the sample site tested has average growth rates overall, environmental conditions occurring at this site can be noted and very minimal management practices need to be performed to maintain optimal growing conditions. Also when determining future plantation sites, these environmental conditions should be sought. Two Sample Inferences: By comparing different populations of environmental conditions, the optimal growing locations can be found and time can be saved focusing on sites with significantly less growth rates.

Personal Reflection and Final Thoughts Focus on material related to degree. Use ArcGIS software throughout the project and tools to aid in statistical analysis. Found out the functionality of its statistical components of the software and how it enhances the overall project. Use a method of querying for different samples and look for trends to test. The power of statistics for work related projects. Using stats to improve the efficiency of work and for determining materials and management required. Will use inferential statistics in my capstone for predicting population variables.

Questions??