The effect of variable sampling efficiency on reliability of the observation error as a measure of uncertainty in abundance indices from scientific surveys.

Slides:



Advertisements
Similar presentations
Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Advertisements

Properties of Least Squares Regression Coefficients
Mean, Proportion, CLT Bootstrap
RELIABILITY Reliability refers to the consistency of a test or measurement. Reliability studies Test-retest reliability Equipment and/or procedures Intra-
Errors in Chemical Analyses: Assessing the Quality of Results
Sheng-Ping Wang 1,2, Mark Maunder 2, and Alexandre Aires-Da-Silva 2 1.National Taiwan Ocean University 2.Inter-American Tropical Tuna Commission.
Modeling fisheries and stocks spatially for Pacific Northwest Chinook salmon Rishi Sharma, CRITFC Henry Yuen, USFWS Mark Maunder, IATTC.
3.2 OLS Fitted Values and Residuals -after obtaining OLS estimates, we can then obtain fitted or predicted values for y: -given our actual and predicted.
Spatial distribution and ontogenetic movement of walleye pollock in the eastern Bering Sea Presenters: Troy Buckley, Angie Greig, James Ianelli, Patricia.
Dealing with interactions between area and year Mark Maunder IATTC.
The current status of fisheries stock assessment Mark Maunder Inter-American Tropical Tuna Commission (IATTC) Center for the Advancement of Population.
Reporter: Hsu Hsiang-Jung Modelling stochastic fish stock dynamics using Markov Chain Monte Carlo.
For stimulus s, have estimated s est Bias: Cramer-Rao bound: Mean square error: Variance: Fisher information How good is our estimate? (ML is unbiased:
458 More on Model Building and Selection (Observation and process error; simulation testing and diagnostics) Fish 458, Lecture 15.
458 Fitting models to data – II (The Basics of Maximum Likelihood Estimation) Fish 458, Lecture 9.
Multicollinearity Omitted Variables Bias is a problem when the omitted variable is an explanator of Y and correlated with X1 Including the omitted variable.
Climate, Ecosystems, and Fisheries A UW-JISAO/Alaska Fisheries Science Center Collaboration Jeffrey M. Napp Alaska Fisheries Science Center NOAA Fisheries.
Ratio estimation with stratified samples Consider the agriculture stratified sample. In addition to the data of 1992, we also have data of Suppose.
Hui-Hua Lee 1, Kevin R. Piner 1, Mark N. Maunder 2 Evaluation of traditional versus conditional fitting of von Bertalanffy growth functions 1 NOAA Fisheries,
Quality Assurance in the clinical laboratory
1 Psych 5500/6500 Statistics and Parameters Fall, 2008.
Objectives of Multiple Regression
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Ch4 Describing Relationships Between Variables. Pressure.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
A retrospective investigation of selectivity for Pacific halibut CAPAM Selectivity workshop 14 March, 2013 Ian Stewart & Steve Martell.
Managerial Economics Demand Estimation & Forecasting.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Biostatistics: Sampling strategies Data collection for fisheries assessment: Monitoring and sampling strategies.
1 Motivation Motivation SST analysis products at NCDC SST analysis products at NCDC  Extended Reconstruction SST (ERSST) v.3b  Daily Optimum Interpolation.
Workshop on Stock Assessment Methods 7-11 November IATTC, La Jolla, CA, USA.
STA Lecture 171 STA 291 Lecture 17 Chap. 10 Estimation – Estimating the Population Proportion p –We are not predicting the next outcome (which is.
Understanding Your Data Set Statistics are used to describe data sets Gives us a metric in place of a graph What are some types of statistics used to describe.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Lecture 6: Point Interpolation
U.S. Department of Commerce | National Oceanic and Atmospheric Administration | NOAA Fisheries | Page 1 Model Misspecification and Diagnostics and the.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Modeling biological-composition time series in integrated stock assessments: data weighting considerations and impact on estimates of stock status P. R.
1 Federal Research Centre for Fisheries Institute for Sea Fisheries, Hamburg Hans-Joachim Rätz Josep Lloret Institut de Ciències del Mar, Barcelona Long-term.
Using distributions of likelihoods to diagnose parameter misspecification of integrated stock assessment models Jiangfeng Zhu * Shanghai Ocean University,
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Sampling Designs Outline
CAN DIAGNOSTIC TESTS HELP IDENTIFY WHAT MODEL STRUCTURE IS MISSPECIFIED? Felipe Carvalho 1, Mark N. Maunder 2,3, Yi-Jay Chang 1, Kevin R. Piner 4, Andre.
Some Insights into Data Weighting in Integrated Stock Assessments André E. Punt 21 October 2015 Index-1 length-4.
Lecture 10 review Spatial sampling design –Systematic sampling is generally better than random sampling if the sampling universe has large-scale structure.
A bit of history Fry 1940s: ”virtual population”, “catch curve”
Data weighting and data conflicts in fishery stock assessments Chris Francis Wellington, New Zealand CAPAM workshop, “ Data conflict and weighting, likelihood.
Survey sampling Outline (1 hr) Survey sampling (sources of variation) Sampling design features Replication Randomization Control of variation Some designs.
NWFSC A short course on data weighting and process error in Stock Synthesis Allan Hicks CAPAM workshop October 19, 2015.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Day 4, Session 1 Abundance indices, CPUE, and CPUE standardisation
CPUE analysis methods, progress and plans for 2011 Simon Hoyle.
Statistical Concepts Basic Principles An Overview of Today’s Class What: Inductive inference on characterizing a population Why : How will doing this allow.
Recommended modeling approach Version 2.0. The law of conflicting data Axiom Data is true Implication Conflicting data implies model misspecification.
Population Dynamics and Stock Assessment of Red King Crab in Bristol Bay, Alaska Jie Zheng Alaska Department of Fish and Game Juneau, Alaska, USA.
Survey Data Conflicts and Bias and Temporal Variation of Model Parameters of St. Matthew Island Blue King Crab J. Zheng, D. Pengilly and V. A. Vanek ADF&G,
Is down weighting composition data adequate to deal with model misspecification or do we need to fix the model? Sheng-Ping Wang, Mark N. Maunder National.
Single Season Study Design. 2 Points for consideration Don’t forget; why, what and how. A well designed study will:  highlight gaps in current knowledge.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
The simple linear regression model and parameter estimation
SUR-2250 Error Theory.
Estimating Population Size
ANALYSIS OF SKIPJACK CATCH PER UNIT OF EFFORT (CPUE) Mark N
Report of the Scientific and Statistical Committee
SWEPT AREA METHOD.
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

The effect of variable sampling efficiency on reliability of the observation error as a measure of uncertainty in abundance indices from scientific surveys. Authors: Stan Kotwicki and Kotaro Ono 1 We cannot solve our problems with the same thinking we used when we created them. (Albert Einstein)

Survey sampling efficiency Sampling efficiency (q e ) = survey selectivity * catchability Sampling efficiency (q e ) = survey selectivity * catchability Variable in time and space. Impossible to design surveys with constant q e,, because multiple factors are affecting it (e.g. variation in the geometry of the trawl, variation in fish behavior in response to the gear, variation in fish behavior in response to the environment, patchiness) Variable in time and space. Impossible to design surveys with constant q e,, because multiple factors are affecting it (e.g. variation in the geometry of the trawl, variation in fish behavior in response to the gear, variation in fish behavior in response to the environment, patchiness) We may have to accept variable q e and learn how to deal with it We may have to accept variable q e and learn how to deal with it Variability in q e could be just random (unlikely) or have random and non-random components (i.e. depend on multiple variables; likely). Variability in q e could be just random (unlikely) or have random and non-random components (i.e. depend on multiple variables; likely). To date all studies of q e show random component To date all studies of q e show random component Increased number of studies show q e dependence on some variables Increased number of studies show q e dependence on some variables None of the studies show constant q e None of the studies show constant q e 2

Examples 3 Snow crab Pollock Kotwicki, S, Horne, J. K., Punt, A. E., and Ianelli, J.N Factors affecting the availability of walleye pollock to acoustic and bottom trawl survey gear. ICES J. Mar. Sci. Somerton, D.A., Weinberg, K.L. and Goodman, S.E Catchability of snow crab (Chionoecetes opilio) by the eastern Bering Sea bottom trawl survey estimated using a catchcomparison experiment

4 Spatial distribution of q e What exactly u and σ mean in this context? What is effect on age composition data? The formula to estimate variance for index of abundance from this survey does not exist. Kotwicki, S, Ressler, P.H., Ianelli J. N., Punt, A. E., and Horne, J. K. In review. Combining data from bottom trawl and acoustic surveys to improve reliability of the abundance estimates. CJFAS.

Relative weights can change 5 Kotwicki, S., Ianelli, J. N., and Punt, A. E Correcting density-dependent effects in abundance estimates from bottom trawl surveys. ICES J. Mar. Sci. 71:

Simulating species distribution Based on Pollock Based on Pollock Fit spatio-temporal model to survey data (Thorson et al. 2015, Ono et al. 2015) Fit spatio-temporal model to survey data (Thorson et al. 2015, Ono et al. 2015) Create map of predicted species distribution (MCMC) Create map of predicted species distribution (MCMC) 6

7

Simulating surveys Assumed SRS design Assumed SRS design 376 samples over the survey area (u i ) 376 samples over the survey area (u i ) q e gamma distributted q e gamma distributted Statistics: Statistics: Survey mean and variance True mean and variance 8

Years2005 – 2014 Sampling efficiency ( q e ) 0.01, 0.05, 0.1, 0.2, 0.4, 0.6, 0.8, 1, 1.5, 2, 2.5, 3 Variance in q e , 0.01, 0.1, 0.2, 0.4, 0.6, 0.8, 1, 2 Density dependent q e (assuming q e = 1 at low densities) 1 (strong), 100, 500, 2000, (weak) 9 Simulating surveys

Biased mean for low q e surveys, Biased mean for low q e surveys, For surveys with high q e survey mean is unbiased For surveys with high q e survey mean is unbiased Increase in variance of the mean when q e low Increase in variance of the mean when q e low 10 Deviation of survey mean from true mean

High variation in q e - higher observation error. High variation in q e - higher observation error. Not much concern for surveys with high q e. Not much concern for surveys with high q e. Concern: High variation in q e - increased variance in survey CV estimate. Concern: High variation in q e - increased variance in survey CV estimate. 11 Survey CV

Low q e - SD biased low. High q e - SD unbiased. Low q e - SD biased low. High q e - SD unbiased. Increase in V(q e ) – increase V(SD) Increase in V(q e ) – increase V(SD) 12 Deviation of survey SD from true SD

Constant efficiency does not assure good precision of the SD estimate. Constant efficiency does not assure good precision of the SD estimate. Increase in V(q e ) – increase V(SD) Increase in V(q e ) – increase V(SD) 13 CV of survey SD

Are these just spurious? 14

Strong density dependence – survey CV biased low. Strong effects! Strong density dependence – survey CV biased low. Strong effects! Density dependent q e - hyperstable index, but appears highly precise. Density dependent q e - hyperstable index, but appears highly precise. 15 Density dependent q e

More on density dependent q e Not much has been done. 4 studies 5 species (all semipelagic), all show density dependent (hyperstable) q e for bottom trawl. Not much has been done. 4 studies 5 species (all semipelagic), all show density dependent (hyperstable) q e for bottom trawl. Environmental effects on q e most likely result in some form of density dependent q e because environment affects both q e and fish distribution. So environmentally induced variation in q e will likely result in decrease in both accuracy and precision of survey variance estimates. Environmental effects on q e most likely result in some form of density dependent q e because environment affects both q e and fish distribution. So environmentally induced variation in q e will likely result in decrease in both accuracy and precision of survey variance estimates. 16

Sampling processes are not represented in the main structure of the stock assessment models (Maunder and Piner 2014) but they may be a major contributor to the total uncertainty of the mean, variance and age composition derived from scientific surveys. 17 State of the art survey design based variance estimators (SRS, Stratified, Cluster, geostatistical, etc) do not account for the variance in q e., hence they maybe biased and imprecise. State of the art survey design based variance estimators (SRS, Stratified, Cluster, geostatistical, etc) do not account for the variance in q e., hence they maybe biased and imprecise. It maybe advisable to redirect efforts from the design based variance estimates to estimates of total survey variance. It maybe advisable to redirect efforts from the design based variance estimates to estimates of total survey variance. Reliability of survey derived abundance estimates should not be assessed using CV estimated from observation error alone Reliability of survey derived abundance estimates should not be assessed using CV estimated from observation error alone We still need to look into effect of variation in availability due to limited survey coverage. We still need to look into effect of variation in availability due to limited survey coverage.

What to do? Account for the uncertainty in the observation error (weights) estimates in the stock assessments? Account for the uncertainty in the observation error (weights) estimates in the stock assessments? Estimate q e, and V(q e ) Estimate q e, and V(q e ) Incorporate sampling process into stock assessments? Incorporate sampling process into stock assessments? Don’t diss the survey, but understand the implications of variable q e on the statistics derived from survey data. Don’t diss the survey, but understand the implications of variable q e on the statistics derived from survey data. 18

Questions for discussion Knowing that SD estimates may be biased and uncertain is it still a good practice to weight indices of abundance using SD? Knowing that SD estimates may be biased and uncertain is it still a good practice to weight indices of abundance using SD? How to deal with uncertainty in the SD estimates, when this uncertainty can be estimated? How to deal with uncertainty in the SD estimates, when this uncertainty can be estimated? Is the adjusting variance for indices estimates a good practice? Does it inflate the variance of the index of abundance? Is the adjusting variance for indices estimates a good practice? Does it inflate the variance of the index of abundance? 19

Thanks 20

How to evaluate surveys? 21 Observation error may not reflect completely total uncertainty of the index of abundance. Therefore using observation error (sampling variance) to weight indices may lead to biases in stock assessments. Examples of possible sources of additional uncertainty in survey index: Catchability variable in time and space due to environmental effects Catchability variable in time and space due to environmental effects Density dependent sampling efficiency Density dependent sampling efficiency Survey does not encompass entire population Survey does not encompass entire population Correlation in age and length data Correlation in age and length data

Maunder and Piner 2014 Temporal trends in catchability (e.g. Harley et al. 2001) in addition to uncertainty in mean catchability are particularly problematic, since they will bias estimates of depletion levels. Therefore, uncertainty in both the average level of catchability and the variation over time can contribute substantially to the uncertainty in stock assessment results and estimates of management quantities. Temporal trends in catchability (e.g. Harley et al. 2001) in addition to uncertainty in mean catchability are particularly problematic, since they will bias estimates of depletion levels. Therefore, uncertainty in both the average level of catchability and the variation over time can contribute substantially to the uncertainty in stock assessment results and estimates of management quantities. Process error is additional variability in the population (e.g. recruitment), fishing (e.g. selectivity), or sampling processes (e.g. survey catchability) that are not represented by the main structure of the model. Process error is additional variability in the population (e.g. recruitment), fishing (e.g. selectivity), or sampling processes (e.g. survey catchability) that are not represented by the main structure of the model. One example is the inflation of standard deviations for survey data because of temporal variability in catchability due to factors such as the environmental conditions. Another is the reduction in the effective sample size of composition data due to unmodelled correlation in the sampling process (i.e. many species school by size and repeated samples from a purse-seine set on a single school will be correlated). One example is the inflation of standard deviations for survey data because of temporal variability in catchability due to factors such as the environmental conditions. Another is the reduction in the effective sample size of composition data due to unmodelled correlation in the sampling process (i.e. many species school by size and repeated samples from a purse-seine set on a single school will be correlated). 22

Effect on age composition 23 Kotwicki, S., Ianelli, J. N., and Punt, A. E Correcting density-dependent effects in abundance estimates from bottom trawl surveys. ICES J. Mar. Sci. 71:

Main findings Catchability of BT and acoustic surveys is variable in time and space. Catchability of BT and acoustic surveys is variable in time and space. Survey standardization not enough Survey standardization not enough Abundance estimates can be corrected for variable catchability but only if … Abundance estimates can be corrected for variable catchability but only if … Only combined estimates provided reliable abundance estimate corrected for variable catchability Only combined estimates provided reliable abundance estimate corrected for variable catchability Methodology can be used for studies of vertical distribution of semipelagic species Methodology can be used for studies of vertical distribution of semipelagic species 24

Survey variance simply explained. What is it? What is it? Why is it important? Why is it important? Where it comes from? Where it comes from? How to estimate it? How to estimate it? 25

Mark Maunder guidance. Don’t naively down-weight the data Don’t naively down-weight the data - Don’t naively weight the data using incorrect estimate of variance - Don’t naively weight the data using incorrect estimate of variance Data recommendations: design surveys to have constant asymptotic selectivity, estimate q. Data recommendations: design surveys to have constant asymptotic selectivity, estimate q. - Estimating q is hard but usually possible. - Estimating q is hard but usually possible. - Designing surveys to have constant asymptotic selectivity may be impossible - Designing surveys to have constant asymptotic selectivity may be impossible - Design surveys to minimize variation in sampling efficiency - Design surveys to minimize variation in sampling efficiency 26

27 Can you see elephant now?

28

29 Is there an elephant in the room? V(q e )