1 Sampling designs for spawning data on The Middle Fork Salmon River *a lot like the Middle Fork Salmon R. What sampling design should be used for estimating.

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.
AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory
Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.
Sampling: Theory and Methods
3.6 Support Vector Machines
Multistage Sampling.
Lecture 8: Hypothesis Testing
1
Author: Julia Richards and R. Scott Hawley
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
STATISTICS HYPOTHESES TEST (I)
STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
STATISTICS POINT ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Addition and Subtraction Equations
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Variance Estimation in Complex Surveys Third International Conference on Establishment Surveys Montreal, Quebec June 18-21, 2007 Presented by: Kirk Wolter,
UNITED NATIONS Shipment Details Report – January 2006.
Add Governors Discretionary (1G) Grants Chapter 6.
CALENDAR.
Copyright © 2010 Pearson Education, Inc. Slide
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Chapter 7 Sampling and Sampling Distributions
The 5S numbers game..
A Fractional Order (Proportional and Derivative) Motion Controller Design for A Class of Second-order Systems Center for Self-Organizing Intelligent.
Biostatistics Unit 5 Samples Needs to be completed. 12/24/13.
Sampling in Marketing Research
Break Time Remaining 10:00.
The basics for simulations
PP Test Review Sections 6-1 to 6-6
5-1 Chapter 5 Theory & Problems of Probability & Statistics Murray R. Spiegel Sampling Theory.
1 Slides revised The overwhelming majority of samples of n from a population of N can stand-in for the population.
Oil & Gas Final Sample Analysis April 27, Background Information TXU ED provided a list of ESI IDs with SIC codes indicating Oil & Gas (8,583)
TCCI Barometer March “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”
TCCI Barometer March “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”
Hypothesis Tests: Two Independent Samples
Chapter 10 Estimating Means and Proportions
Chapter 4 Inference About Process Quality
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Progressive Aerobic Cardiovascular Endurance Run
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
Chapter 1: Expressions, Equations, & Inequalities
1..
Adding Up In Chunks.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
TCCI Barometer September “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”
When you see… Find the zeros You think….
Before Between After.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
Subtraction: Adding UP
AU 350 SAS 111 Audit Sampling C Delano Gray June 14, 2008.
Statistical Inferences Based on Two Samples
Analyzing Genes and Genomes
DTU Informatics Introduction to Medical Image Analysis Rasmus R. Paulsen DTU Informatics TexPoint fonts.
Static Equilibrium; Elasticity and Fracture
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Converting a Fraction to %
Chapter 8 Estimation Understandable Statistics Ninth Edition
Clock will move after 1 minute
PSSA Preparation.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.
Experimental Design and Analysis of Variance
Essential Cell Biology
Select a time to count down from the clock above
Introduction into Simulation Basic Simulation Modeling.
1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.
Commonly Used Distributions
Sampling Strategies for Chinook-Salmon Spawning Populations
Presentation transcript:

1 Sampling designs for spawning data on The Middle Fork Salmon River *a lot like the Middle Fork Salmon R. What sampling design should be used for estimating the number of chinook redds on a river network*? –estimation of status – number of spring-chinook redds in Middle Fork Salmon River one year –Measurement design – we are not really thinking about the measurement design, we assume we have some way to identify and count redds once you get to a location.

2 The Middle Fork Salmon River

3 Redd data – the Truth IDFG Dataset (Russ Thurow) counted the number of redds in the Middle Fork Salmon River via helicopter All spawning reaches were censused each year sampling was done by helicopter and where necessary by foot Six years of data , 2001, and 2002 These data can be considered the truth year Total redds

4 Objectives Compare several designs to see if one estimates the number of redds (and only redds) the best –unbiased designs (estimators) –best determined by standard error of estimator coverage probability (how many times 95% confidence interval actually contains the number of redds) cost –Keep things fair by sampling the same total length of stream, the index covers 976 segments or km. of stream. Does not imply equal cost Although some standard errors can be calculated analytically the coverage needs to be addressed via simulation.

5 Methods Compare sampling strategies using IDFG data as the truth. Sampling strategies include sampling design and estimator sample design Estimator for the total And confidence interval

6 Methods Use simulation by resampling the population over and over......

7 Cost & Crew-trips Each segment gets an access pt. Travel to access sites based on whether –airplane –Auto Travel from access sites to sampling reaches is the maximum distance from access site to furthest sampling reach in each direction along a tributary Cost = Fn(km by foot) 4 round trips required

8 distances in 5km intervals. Many areas require over 20 km hike Maximum distance is 33 km.

9 The sampling designs Index – sample the index reaches Simple random sampling – using the unbiased estimator Systematic sampling – sort tributaries in random order, systematically sample along resulting line. Stratify by Index – Sample independently within and outside the index regions. Adaptive cluster sampling – Choose segments with a simple random sample. If sampled sites have redds sample adjacent segments. Spatially balanced design – Based on EMAP design, though selecting segments within primary sampling units rather than points (not yet implemented)

10 Index sampling When the sample size is smaller than the overall size of the index region a simple random sample of the segments within the index is assumed. Two possibilities to estimate the number of redds from the index sample: 1.Assume there are no redds outside of the index – estimates will be too small. 2.Assume that the average number of redds per segment outside the index is the same inside and simply inflate the index estimator – estimates will be too large.

11 Bias of Inflating Estimator from Index Sample Redds

12 Systematic sampling Order the tributaries in random order along a line Choose sampling interval, k, so that final sample size is approximately n Select a random number, r, between 1 and k Sample reaches r, r+k, r+2k, …, r+(n-1)k Systematic sampling is cluster sampling where clusters are made up of units far apart in space and one cluster is sampled k rr+kr+2kr+4kr+3k

13 Stratify by Index Stratify by index and oversample index reaches Simple random sample in each stratum Allocation: –Equal allocation: Usually does not perform well –Proportional allocation: Does not oversample index sites so will probably not have good precision –Optimal allocation: need to know the standard deviation year proportion in index

14 Adaptive cluster sampling Original sample is simple random sample If sampled site meets criteria also sample sites in neighborhood –Criteria: presence of redds –Neighborhood: segments directly upstream and downstream Continue until sites do not meet criteria –Both legs of confluences in original sample 2 Meets criteria 4 include neighbor 6 and do not meet criteria 13 Final sample includes: 21346

15 Design SRS Cluster (1km) SYS STRS (optimal) ADAPT Results: Normalized standard error of estimators Run size SRS Cluster (1km) SYS STRS (optimal) ADAPT Coverage Probability (.95)

16 Costs SRS SRS-1km SYS STRS Adaptive sampling Adaptive

17 Precision per cost (10% sampling fraction) big is good: high precision per km traveled run size Precision per cost

18 Conclusions Stratifying by index results in the most precise estimates except in the large runs where systematic sampling seems to work best. The index sites should be oversampled in the stratified design. Proportional allocation (based on the size of the strata) results in poor precision. Although the systematic sampling strategy often is the most precise, there is not a good estimator for the variance. The estimator that assumes a simple random sample is conservative. Same pattern for different sampling fractions.

19 Conclusions The cluster sampling design is not very precise but reduces costs significantly. Adaptive cluster sampling is not as precise as other designs. –It is optimal for rare clustered populations –during small years the redds are not clustered enough –during large years they are not rare enough –only during the medium years does it compete with other designs. When cost and precision are analyzed together –small runs – either stratified by index or SRS-1km work best –large runs – either systematic or stratified by index work best

20 Not yet finished EMAP type design. successive difference variance estimator for the systematic sampling Adaptive sampling with same initial sample size (cost function does not penalize this much) Cost function –including road travel –crew trips/day units

21 Points vs. Lines Pick points -- points are picked along stream continuum and the measurement unit is constructed around the point advantages: –different size measurement units are easily implemented disadvantages: –difficulty with overlapping units –inadvertent variable probability design because of confluences and headwaters –Analysis may be complicated Pick Segments – Universe is segmented before sampling and segments are picked from population of segments advantages: –simple to implement –simple estimators disadvantages: –Difficult frame construction before sampling –Cannot accommodate varying lengths of sampling unit

22 Adaptive Cluster Sampling Use the draw-by-draw probability estimator: –Let w i be the average number of redds in the network of which segment i belongs, then –with variance Thompson 1992

23 COSTS Our costs are based on the number of kilometers traveled by foot. Each segment in the MF is assigned to an access point (this is not optimized in some rare instances the assigned access point is not the closest) and the distance along the stream from that access point is calculated There are two types of access points air fields and trailheads. For this exercise they both have the same price. Because we are tallying the number of km. along the streams, this cost function also models other types of sampling including via helicopter and raft.

24 Six years

25 Six years

26 Six years

27 Access to MFSR Roadless area Airplane access possible

28

29 air vs. car access

30 Index sample Not sure how to build estimates for total number of redds in Middle fork. –expand current estimator (assume same density outside of index) –use current estimate (assume 0 redds outside of index) year Number counted in Index Total number of redds

31

32 Stratify by Index Oversample index sites where most redds are located Simple random sample in each stratum Equal allocation: Proportional allocation: year coverage year coverage

33 Stratify by index Optimal allocation Using year proportion in index n index n other year coverage

34 Stratify by index Using year n index n other