Modeling silky shark bycatch

Slides:



Advertisements
Similar presentations
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Advertisements

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Covariance and Correlation: Estimator/Sample Statistic: Population Parameter: Covariance and correlation measure linear association between two variables,
3.2 OLS Fitted Values and Residuals -after obtaining OLS estimates, we can then obtain fitted or predicted values for y: -given our actual and predicted.
Proposal for a comprehensive assessment of key shark species SAR-8-15 Photo:
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Modeling Wim Buysse RUFORUM 1 December 2006 Research Methods Group.
Log-linear modeling and missing data A short course Frans Willekens Boulder, July
Lecture 4: Correlation and Regression Laura McAvinue School of Psychology Trinity College Dublin.
Final Review Session.
Linear statistical models 2008 Count data, contingency tables and log-linear models Expected frequency: Log-linear models are linear models of the log.
PSY 307 – Statistics for the Behavioral Sciences Chapter 7 – Regression.
Maximum likelihood (ML)
Log-linear analysis Summary. Focus on data analysis Focus on underlying process Focus on model specification Focus on likelihood approach Focus on ‘complete-data.
Investment Analysis and Portfolio Management Lecture 3 Gareth Myles.
Multivariate Probability Distributions. Multivariate Random Variables In many settings, we are interested in 2 or more characteristics observed in experiments.
Classification and Prediction: Regression Analysis
Generalized Linear Models
1 B. The log-rate model Statistical analysis of occurrence-exposure rates.
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
Introduction to Linear Regression and Correlation Analysis
Chapter 13: Inference in Regression
Overall agenda Part 1 and 2  Part 1: Basic statistical concepts and descriptive statistics summarizing and visualising data describing data -measures.
Chapter 7 Estimation: Single Population
Simple Linear Regression
Graphical Analysis. Why Graph Data? Graphical methods Require very little training Easy to use Massive amounts of data can be presented more readily Can.
AP Statistics Chapter 9 Notes.
Poisson Random Variable Provides model for data that represent the number of occurrences of a specified event in a given unit of time X represents the.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Introduction to DESeq and edgeR packages Peter A.C. ’t Hoen.
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
1 G Lect 8b G Lecture 8b Correlation: quantifying linear association between random variables Example: Okazaki’s inferences from a survey.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
Lecture 2 Forestry 3218 Lecture 2 Statistical Methods Avery and Burkhart, Chapter 2 Forest Mensuration II Avery and Burkhart, Chapter 2.
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
Integral projection models
© Department of Statistics 2012 STATS 330 Lecture 20: Slide 1 Stats 330: Lecture 20.
Chapter 7. Control Charts for Attributes
Methodology Solving problems with known distributions 1.
Introduction to Statistical Inference Jianan Hui 10/22/2014.
IE 300, Fall 2012 Richard Sowers IESE. 8/30/2012 Goals: Rules of Probability Counting Equally likely Some examples.
Glenn Meyers ISO Innovative Analytics 2007 CAS Annual Meeting Estimating Loss Cost at the Address Level.
Correlation & Regression Analysis
Logistic Regression. Example: Survival of Titanic passengers  We want to know if the probability of survival is higher among children  Outcome (y) =
Logistic Regression Saed Sayad 1www.ismartsoft.com.
3.1 Statistical Distributions. Random Variable Observation = Variable Outcome = Random Variable Examples: – Weight/Size of animals – Animal surveys: detection.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Expected Return and Risk. Explain how expected return and risk for securities are determined. Explain how expected return and risk for portfolios are.
Biostatistics Class 3 Probability Distributions 2/15/2000.
Sampling Distributions
Chapter 4: Basic Estimation Techniques
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
Chapter 4 Basic Estimation Techniques
Statistical Modelling
Multiple Regression Prof. Andy Field.
Basic Estimation Techniques
Ch3: Model Building through Regression
Generalized Linear Models
Introduction to logistic regression a.k.a. Varbrul
Basic Estimation Techniques
ANALYSIS OF SKIPJACK CATCH PER UNIT OF EFFORT (CPUE) Mark N
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
What is Regression Analysis?
Estimating mean abundance from repeated presence-absence surveys
Jensen, et. al Winter distribution of blue crab Callinectes sapidus in Chesapeake Bay: application and cross-validation of a two-stage generalized.
Sampling Distributions
Presentation transcript:

Modeling silky shark bycatch Mihoko Minami (Institute of Statistical Mathematics, Tokyo) Cleridy Lennert-Cody (IATTC)

Why model silky shark bycatch? There is concern about possible negative effects of fisheries catch / bycatch on shark populations. Silky shark bycatch per set may index abundance, all other things being equal.... Modeling shark bycatch is a first step towards IATTC’s mandate to provide preliminary advice on key shark species involved in the purse-seine fishery (IATTC Resolution C-05-03). IATTC has more data on silky sharks than on other shark species.

Proceeding in two steps: Overall approach for modeling bycatch per set: use generalized linear and additive models to estimate trends. This has proved somewhat challenging because of the characteristics of bycatch per set. Proceeding in two steps: Explore different probability functions for the “random component” of generalized linear/additive models. Started with floating object set data; will expand analysis to dolphin and unassociated sets. 2) Explore in detail spatial/environmental effects.

Average silky shark bycatch per set by size category (unstandardized) with tons without tons

Number of floating object sets 1994-2004

Silky shark bycatch per set 1994-2004

Bycatch per set – floating object sets 0 10 20 ≥30 0 10 20 ≥30 0 10 20 ≥30 0 10 20 ≥30 0 10 20 ≥30 0 10 20 ≥30 0 10 20 ≥30 0 10 20 ≥30 0 10 20 ≥30 0 10 20 ≥30 0 10 20 ≥30

Probability functions used for modeling bycatch per set: Poisson negative binomial zero-inflated Poisson zero-inflated negative binomial The negative binomial is an extension of the Poisson distribution that can better model highly variable count data. Similarly, the zero-inflated negative binomial can be considered an extension of the zero-inflated Poisson. Zero-inflated probability functions are used to model data with both a large proportion of zero-valued observations and also large positive values. These are nested models, distributionally.

Perfect state (zero state); bycatch never occurs Zero-inflated models Perfect state (zero state); bycatch never occurs Frequency Imperfect state; bycatch could occur 1 5 10 15 20 25 ≥30 Bycatch per set

Poisson and negative binomial regression models Log-linear regression models were used to relate the mean bycatch per set (μ) to covariates: where values of covariates coefficients (parameters)

Zero-inflated regression models Two stage regression model: Which state does a set take (p = probability of set being in “perfect” state) ?   logistic regression model Amount of bycatch when set is in imperfect state (μ = mean bycatch in imperfect state)? negative binomial / Poisson regression model where values of covariates coefficients (parameters)

Data Floating object sets, 1994 – 2004 (32,148 sets) Data excluded: sets with bycatch reported in tons sets with no catch of target tunas repeat sets on same floating object sets missing data on predictors data for 1993 Predictor variables location (latitude, longitude), year, calendar date, time net depth, floating object depth sea surface temperature amount of tuna catch (log (tuna)) amount of non-silky shark bycatch (log(non-silky+1)) two proxies for floating object density

Model comparison Log-likelihood (training data) Generalized Information Criterion (test data) Poisson -81849 > 100000 Negative binomial -32572 65280 Zero-inflated Poisson -56389 Zero-inflated negative binomial (without smoothing) -32346 64827 -31862 63921

Partial dependence plot Number of animals per set Year

Interpretation of trends in bycatch per set is/will be complicated by... Species identification concerns (pre-2005): - Misidentification of silky sharks - “Unknown” category: what proportion were silky sharks? Floating object set data: true object density unknown. Effect of 2000 IATTC bycatch resolution on live release unknown (pre-2005). Existence of extremely large bycatches that are difficult to model.