CAN DIAGNOSTIC TESTS HELP IDENTIFY WHAT MODEL STRUCTURE IS MISSPECIFIED? Felipe Carvalho 1, Mark N. Maunder 2,3, Yi-Jay Chang 1, Kevin R. Piner 4, Andre.

Slides:



Advertisements
Similar presentations
Modeling Recruitment in Stock Synthesis
Advertisements

An exploration of alternative methods to deal with time-varying selectivity in the stock assessment of YFT in the eastern Pacific Ocean CAPAM – Selectivity.
Are the apparent rapid declines in top pelagic predators real? Mark Maunder, Shelton Harley, Mike Hinton, and others IATTC.
Sheng-Ping Wang 1,2, Mark Maunder 2, and Alexandre Aires-Da-Silva 2 1.National Taiwan Ocean University 2.Inter-American Tropical Tuna Commission.
An evaluation of alternative binning approaches for composition data in integrated stock assessments Cole Monnahan, Sean Anderson, Felipe Hurtado, Kotaro.
Tradeoffs between bias, model fits, and using common sense about biology and fishing behaviors when choosing selectivity forms Dana Hanselman and Pete.
Growth in Age-Structured Stock Assessment Models R.I.C. Chris Francis CAPAM Growth Workshop, La Jolla, November 3-7, 2014.
Model Selection for Selectivity in Fisheries Stock Assessments André Punt, Felipe Hurtado-Ferro, Athol Whitten 13 March 2013; CAPAM Selectivity workshop.
C3: Estimation of size-transition matrices with and without molt probability for Alaska golden king crab using tag–recapture data M.S.M. Siddeek, J. Zheng,
Model time step and species biology considerations for growth estimation in integrated stock assessments P. R. Crone and J. L. Valero Southwest Fisheries.
Dealing with interactions between area and year Mark Maunder IATTC.
Black Sea Bass – Northern Stock Coastal-Pelagic/ASMFC Working Group Review June 15, 2010.
The current status of fisheries stock assessment Mark Maunder Inter-American Tropical Tuna Commission (IATTC) Center for the Advancement of Population.
Reporter: Hsu Hsiang-Jung Modelling stochastic fish stock dynamics using Markov Chain Monte Carlo.
458 More on Model Building and Selection (Observation and process error; simulation testing and diagnostics) Fish 458, Lecture 15.
Are pelagic fisheries managed well? A stock assessment scientists perspective Mark Maunder and Shelton Harley Inter-American Tropical Tuna Commission
Case Study - Dover Sole Range from Baja California to the Bering Sea. On mud or muddy-sand, at 35 to 1400 m depths. Feed on polychaete worms, shrimp, brittle.
Hui-Hua Lee 1, Kevin R. Piner 1, Mark N. Maunder 2 Evaluation of traditional versus conditional fitting of von Bertalanffy growth functions 1 NOAA Fisheries,
How many conditional age-at-length data are needed to estimate growth in stock assessment models? Xi He, John C. Field, Donald E. Pearson, and Lyndsey.
The (potential) value and use of empirical estimates of selectivity in integrated assessments John Walter, Brian Linton, Will Patterson and Clay Porch.
Time-Varying vs. Non-Time- Varying Growth in the Gulf of Mexico King Mackerel Stock Assessment: a Case Study Southeast Fisheries Science Center Jeff Isely,
Modeling Parameters in Stock Synthesis Modeling population processes 2009 IATTC workshop.
WP4: Models to predict & test recovery strategies Cefas: Laurence Kell & John Pinnegar Univ. Aberdeen: Tara Marshall & Bruce McAdam.
Maximum likelihood estimates of North Pacific albacore tuna ( Thunnus alalunga ) von Bertalanffy growth parameters using conditional-age-at-length data.
Evaluation of a management measure restricting catch below 60cm Mark N. Maunder and Patrick Tomlinson.
Investigating the Accuracy and Robustness of the Icelandic Cod Assessment and Catch Control Rule A. Rosenberg, G. Kirkwood, M. Mangel, S. Hill and G. Parkes.
Pacific Hake Management Strategy Evaluation Joint Technical Committee Northwest Fisheries Science Center, NOAA Pacific Biological Station, DFO School of.
ASSESSMENT OF BIGEYE TUNA (THUNNUS OBESUS) IN THE EASTERN PACIFIC OCEAN January 1975 – December 2006.
Use of multiple selectivity patterns as a proxy for spatial structure Felipe Hurtado-Ferro 1, André E. Punt 1 & Kevin T. Hill 2 1 University of Washington,
Spatial issues in WCPO stock assessments (bigeye and yellowfin tuna) Simon Hoyle SPC.
Evaluation of a practical method to estimate the variance parameter of random effects for time varying selectivity Hui-Hua Lee, Mark Maunder, Alexandre.
WP 2.4 Evaluation of NMFS Toolbox Assessment Models on Simulated Groundfish Data Sets Comparative Simulation Tests Overview Brooks, Legault, Nitschke,
P. R. Crone, M. N. Maunder, B. X. Semmens, and J. L. Valero Center for the Advancement of Population Assessment Methodology (CAPAM) Southwest Fisheries.
Workshop on Stock Assessment Methods 7-11 November IATTC, La Jolla, CA, USA.
Simulated data sets Extracted from:. The data sets shared a common time period of 30 years and age range from 0 to 16 years. The data were provided to.
USING INDICATORS OF STOCK STATUS WHEN TRADITIONAL REFERENCE POINTS ARE NOT AVAILABLE: EVALUATION AND APPLICATION TO SKIPJACK TUNA IN THE EASTERN PACIFIC.
Fisheries 101: Modeling and assessments to achieve sustainability Training Module July 2013.
M.S.M. Siddeeka*, J. Zhenga, A.E. Puntb, and D. Pengillya
U.S. Department of Commerce | National Oceanic and Atmospheric Administration | NOAA Fisheries | Page 1 Model Misspecification and Diagnostics and the.
The effect of variable sampling efficiency on reliability of the observation error as a measure of uncertainty in abundance indices from scientific surveys.
Extending length-based models for data-limited fisheries into a state-space framework Merrill B. Rudd* and James T. Thorson *PhD Student, School of Aquatic.
Integrating movement information from tagging data into stock assessment models: a review and perspective Marriot Hotel, La Jolla, 4-7 October 2011.
The Influence of Spatial Dynamics on Predation Mortality of Bering Sea Walleye Pollock Pat Livingston, Paul Spencer, Troy Buckley, Angie Greig, and Doug.
1 Yi-Jay Chang 2 Brian Langseth 3 Mark Maunder 1 Felipe Carvalho Performance of a stock assessment model with misspecified time-varying growth 1 – JIMAR,
Estimation of growth within stock assessment models: implications when using length composition data Jiangfeng Zhu a, Mark N. Maunder b, Alexandre M. Aires-da-Silva.
ASSESSMENT OF BIGEYE TUNA (THUNNUS OBESUS) IN THE EASTERN PACIFIC OCEAN January 1975 – December 2005.
Age and Growth of Pacific Sardine in California During a Period of Stock Recovery and Geographical Expansion By Emmanis Dorval Jenny McDaniel Southwest.
Simulation analysis using stock synthesis Maunder, Piner, and Lee.
Simulation of methods to account for spatial effects in the stock assessment of Pacific bluefin tuna Cast by: Hui-hua Lee (NOAA Fisheries, SWFSC) Kevin.
Using distributions of likelihoods to diagnose parameter misspecification of integrated stock assessment models Jiangfeng Zhu * Shanghai Ocean University,
Estimation of selectivity in Stock Synthesis: lessons learned from the tuna stock assessment Shigehide Iwata* 1 Toshihde Kitakado* 2 Yukio Takeuchi* 1.
Some Insights into Data Weighting in Integrated Stock Assessments André E. Punt 21 October 2015 Index-1 length-4.
Yellowfin Tuna Major Changes Catch, effort, and length-frequency data for the surface fisheries have been updated to include new data for 2005.
Chapter 7: The Distribution of Sample Means
Data weighting and data conflicts in fishery stock assessments Chris Francis Wellington, New Zealand CAPAM workshop, “ Data conflict and weighting, likelihood.
Influence of selectivity and size composition misfit on the scaling of population estimates and possible solutions: an example with north Pacific albacore.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
NWFSC A short course on data weighting and process error in Stock Synthesis Allan Hicks CAPAM workshop October 19, 2015.
Day 4, Session 1 Abundance indices, CPUE, and CPUE standardisation
Recommended modeling approach Version 2.0. The law of conflicting data Axiom Data is true Implication Conflicting data implies model misspecification.
Day 3 Session 1 Model fitting. So, how do you assess a stock assessment model? There are five key questions you should ask yourself: 1.Assumptions What.
Population Dynamics and Stock Assessment of Red King Crab in Bristol Bay, Alaska Jie Zheng Alaska Department of Fish and Game Juneau, Alaska, USA.
Is down weighting composition data adequate to deal with model misspecification or do we need to fix the model? Sheng-Ping Wang, Mark N. Maunder National.
Rainer Froese HOSST-TOSST Seminar 07 April 2016 GEOMAR, Kiel, Germany
Pacific-Wide Assessment of Bigeye Tuna
ASAP Review and Discussion
SAFS Quantitative Seminar
Effects of Catch-at-Age Sample Size on Gulf of Mexico Gray Triggerfish Spawning Stock Biomass Estimates Jeff Isely SEFSC Miami.
JABBA-Select: Performance evaluation of JABBA-Select against and an age-structured simulator and estimation model Henning Winker* MARAM International.
JABBA-Select: Simulation-Testing Henning Winker
Presentation transcript:

CAN DIAGNOSTIC TESTS HELP IDENTIFY WHAT MODEL STRUCTURE IS MISSPECIFIED? Felipe Carvalho 1, Mark N. Maunder 2,3, Yi-Jay Chang 1, Kevin R. Piner 4, Andre E. Punt 5 1 PIFSC - Pacific Islands Fisheries Science Center 2 Inter-American Tropical Tuna Commission 3 Center for the Advancement of Population Assessment Methodology 4 SWFSC – Southwest Fisheries Science Center 5 University of Washington Data weighting workshop – La Jolla, October 2015

Outline  Introduction Data conflict Model misspecification Diagnostics  Objectives  Methods Study case – Western Central Pacific Ocean striped marlin stock assessment Simulation approach Estimation models misspecification Model diagnostics  Preliminary results  Conclusions and further research

Introduction  Data conflicts Data conflicts occur when the objective function components from different data sources achieve minima at different values for a given parameter M. Ichinokawa et al.(2014)

Introduction  Model misspecification Apparent data conflicts in integrated stock assessment models can occur for three main reasons: 1) random sampling error, 2) misspecification of the observation model, and 3) misspecification of the system dynamics model.

Introduction SS3 Hospital Determine when a model needs additional or alternative structure to eliminate model misspecification and conflict between components

Introduction  Model diagnostic However, still important to develop a standard set of diagnostics for stock assessment models that will improve their performance and acceptance.

Introduction Can model diagnostics really help identify when a model is misspecified? What model structure is misspecified?

Objectives So what we want to show on this study is what the diagnostics from a correct specified model looks like compared to diagnostics from an uncorrected misspecified model.

Methods: study case Stock assessment for striped marlin (kajikia audax) in the western and central north pacific ocean through 2013.

Methods: study case Stock assessment for striped marlin (kajikia audax) in the western and central north pacific ocean through 2013.

Methods: study case Parameter (units)Value Natural mortality (yr -1 ) 0.54 (age 0) 0.47 (age 1) 0.43 (age 2) 0.40 (age 3) 0.38 (age 4-15) Spawner-recruit relationshipBeverton-Holt Spawner-recruit steepness (h)0.87 (Fixed) Selectivity Logistic and Double-normal (time-varying)

Methods: Data used Stock assessment for striped marlin (kajikia audax) in the western and central north pacific ocean through 2013.

Methods: Data used Stock assessment for striped marlin (kajikia audax) in the western and central north pacific ocean through 2013 (SIMPLIFIED)

Methods: Data used Stock assessment for striped marlin (kajikia audax) in the western and central north pacific ocean through 2013 (SIMPLIFIED)

Methods: Simulation Generating data from “True” assessment using SS3 Operating model Ctl file Dat file Starter file data.ss_new Boot n th Estimation model Ctl file Bootstrap Batch file script Par file (e.g., recruitment dev.)

Methods: Simulation Parameter (units)Value (“True”)Value (EM_01) Value (EM_02) Value (EM_03) Natural mortality (yr -1 ) 0.54 (age 0) 0.47 (age 1) 0.43 (age 2) 0.40 (age 3) 0.38 (age 4-15) 0.54 (age 0) 0.47 (age 1) 0.43 (age 2) 0.40 (age 3) 0.38 (age 4-15) 0.54 (age 0) 0.47 (age 1) 0.43 (age 2) 0.40 (age 3) 0.38 (age 4-15) 0.38 (All ages) Spawner-recruit relationshipBeverton-Holt Spawner-recruit steepness (h)0.87 (Fixed) 0.70 (Fixed)0.87 (Fixed) Selectivity (Fleet 1)Double-normal Selectivity (Fleet 2)Double-normalAsymptoticDouble-normal Selectivity (Fleet 3)Double-normal Scenarios

Methods: Diagnostics Fig 5. The SDNR by itself is not a godd measure of goodness of fit. The SNDR is exactly the same in both panels but the residual patterns indicate a good fit in panel (a), and a poor fit in panel (b).

Methods: Diagnostics 2) The Pinner method (Pinner et al. 2011) Diagnostic technique based on simulation analysis; Evaluate if an estimated parameter is outside the bounds of a simulated distribution (two-sided test) Fig 3.

Methods: Diagnostics where

Methods: Diagnostics

Results 1) Standard deviation of the normalized residuals (SDNR) The SDNR diagnostic indicated that all misspecified estimation models did fit the indices well;

Results 2) The Piner diagnostic Distributions of SPB_last year estimated from three replicate models for each EM

Results 2) The Piner diagnostic Misspecification of h reflecting a less resilient stock (h = 0.7) had significant impact on the population dynamics. The true value of spawning biomass (based on h = 0.85) always lay below the average simulated estimates.

Results 4) Retrospective patterns

Results EM_01 EM_02

Results SourceTrueEM_01EM_02EM_03 Catch7868 Survey9676 Length comp8869 R-pen10

Conclusions and further research The diagnostics tested were not able to correctly identify misspecification on selectivity and mortality. Some misspecifications did not greatly influenced the population dynamics (e.g. CPUE trends and length comp are almost identical to the true model). EM_01EM_03 The Pinner method and retrospective analysis were able to identify misspecification on h

Conclusions and further research Increasing the effect of the misspecification on model results, might also increase the chances of proposed diagnostics to detect the misspecification. Increase the number of model misspecification scenarios to address common issues in integrated stock assessment (e.g. time varying catchability, time varying growth) Increase the number of diagnostics Age-structured production model Calibrated simulation …and others Apply this diagnostics simulation testing in stock assessment of species with other life-history types (e.g. slow growth) Next step….

Mahalo !