Krishna Pacifici Department of Applied Ecology NCSU January 10, 2014.

Slides:



Advertisements
Similar presentations
Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Advertisements

Rachel Fewster Department of Statistics, University of Auckland Variance estimation for systematic designs in spatial surveys.
MONITORING and ASSESSMENT: Fish Dr. e. irwin (many slides provided by Dr. Jim Nichols)
Detectability Lab. Outline I.Brief Discussion of Modeling, Sampling, and Inference II.Review and Discussion of Detection Probability and Point Count Methods.
Discussion Sampling Methods
Maximum likelihood estimates What are they and why do we care? Relationship to AIC and other model selection criteria.
Species interaction models. Goal Determine whether a site is occupied by two different species and if they affect each others' detection and occupancy.
Appropriate Sampling Ann Abbott Rocky Mountain Research Station
Dr. Chris L. S. Coryn Spring 2012
Chapter 17 Additional Topics in Sampling
Sampling and Experimental Control Goals of clinical research is to make generalizations beyond the individual studied to others with similar conditions.
11 Populations and Samples.
Knowledge is Power Marketing Information System (MIS) determines what information managers need and then gathers, sorts, analyzes, stores, and distributes.
Chapter 4 Selecting a Sample Gay, Mills, and Airasian
Determining Sample Size
CLOSED CAPTURE-RECAPTURE
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Chapter 5 Selecting a Sample Gay, Mills, and Airasian 10th Edition
Using historic data sources to calibrate and validate models of species’ range dynamics Giovanni Rapacciuolo University of California Berkeley
STRATIFICATION PLOT PLACEMENT CONTROLS Strategy for Monitoring Post-fire Rehabilitation Treatments Troy Wirth and David Pyke USGS – Biological Resources.
Lecture 9 Prof. Development and Research Lecturer: R. Milyankova
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Lecture 4. Sampling is the process of selecting a small number of elements from a larger defined target group of elements such that the information gathered.
1 Chapter Two: Sampling Methods §know the reasons of sampling §use the table of random numbers §perform Simple Random, Systematic, Stratified, Cluster,
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
Patch Occupancy: The Problem
Issues concerning the interpretation of statistical significance tests.
Detectability Lab. Outline I.Brief Discussion of Modeling, Sampling, and Inference II.Review and Discussion of Detection Probability and Point Count Methods.

Retain H o Refute hypothesis and model MODELS Explanations or Theories OBSERVATIONS Pattern in Space or Time HYPOTHESIS Predictions based on model NULL.
BOT / GEOG / GEOL 4111 / Field data collection Visiting and characterizing representative sites Used for classification (training data), information.
BRIEF INTRODUCTION TO ROBUST DESIGN CAPTURE-RECAPTURE.
PCB 3043L - General Ecology Data Analysis.
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Introduction to Occupancy Models Key to in-class exercise are in blue
STATISTICAL DATA GATHERING: Sampling a Population.
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 9 Review.
Population vs. Sample. Population: a set which includes all measurements of interest to the researcher (The collection of all responses, measurements,
Sampling Designs Outline
Estimation of Animal Abundance and Density Miscellaneous Observation- Based Estimation Methods 5.2.
 1 Species Richness 5.19 UF Community-level Studies Many community-level studies collect occupancy-type data (species lists). Imperfect detection.
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
Multiple Detection Methods: Single-season Models.
Estimation of State Variables and Rate Parameters Estimation of State Variables and Rate Parameters Overview 5.1 UF UF-2015.
Estimation of State Variables and Rate Parameters Estimation of State Variables and Rate Parameters Overview 5.1 UF UF-2015.
Unit 3 Investigative Biology. SQA Success Criteria  Explain the difference between random sampling, systematic sampling and stratified sampling.
Inferences About Animal Populations. Why Estimate Population Attributes? Science Understand ecological systems Learn stuff Management/Conservation Apply.
Monitoring and Estimating Species Richness Paul F. Doherty, Jr. Fishery and Wildlife Biology Department Colorado State University Fort Collins, CO.
Single Season Model Part I. 2 Basic Field Situation From a population of S sampling units, s are selected and surveyed for the species. Units are closed.
Single Season Occupancy Modeling 5.13 UF Occupancy Modeling State variable is proportion of patches that is occupied by a species of interest.
1 Occupancy models extension: Species Co-occurrence.
Sampling Concepts Nursing Research. Population  Population the group you are ultimately interested in knowing more about “entire aggregation of cases.
 1 Modelling Occurrence of Multiple Species. 2 Motivation Often there may be a desire to model multiple species simultaneously.  Sparse data.  Compare/contrast.
 Multi-state Occupancy. Multiple Occupancy States Rather than just presence/absence of the species at a sampling unit, ‘occupancy’ could be categorized.
Survey sampling Outline (1 hr) Survey sampling (sources of variation) Sampling design features Replication Randomization Control of variation Some designs.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Statistical Concepts Basic Principles An Overview of Today’s Class What: Inductive inference on characterizing a population Why : How will doing this allow.
Multiple Season Study Design. 2 Recap All of the issues discussed with respect to single season designs are still pertinent.  why, what and how  how.
Single Season Study Design. 2 Points for consideration Don’t forget; why, what and how. A well designed study will:  highlight gaps in current knowledge.
 Occupancy Model Extensions. Number of Patches or Sample Units Unknown, Single Season So far have assumed the number of sampling units in the population.
Multi-state Occupancy. Multiple Occupancy States Rather than just presence/absence of the species at a sampling unit, ‘occupancy’ could be categorized.
Chloe Boynton & Kristen Walters February 22, 2017
PCB 3043L - General Ecology Data Analysis.
Graduate School of Business Leadership
Estimating Population Size
2. Stratified Random Sampling.
2. Stratified Random Sampling.
Estimating mean abundance from repeated presence-absence surveys
Presentation transcript:

Krishna Pacifici Department of Applied Ecology NCSU January 10, 2014

Designing studies Why, what, and how? Why collect the data? What type of data to collect? How should the data be collected in the field and then analyzed? Clear objectives help relate all three components.

Why? Clear objectives How will the data be used to discriminate between scientific hypotheses about a system? How the data will be used to make management decisions? For example: Determine overall level of occupancy for a species in particular region. Compare the level of occupancy in two different habitat types within that region.

What? Many kinds of data Population-level Population size/density Survival Immigration & emigration Presence/absence Community-level Persistence Colonization & extinction Species richness/diversity

How? Sampling and Modeling Interest lies in making inference from a sample to a population Statistics! Want it to be repeatable and accurate Others should understand what you have done and be able to replicate Many different modeling/analysis approaches Distance sampling, multiple observer, capture- recapture, occupancy modeling…

PURPOSES OF SAMPLING ESTIMATE ATTRIBUTES (PARAMETERS) Abundance/ density Survival Occurrence probability ALLOW LEGITIMATE EXTRAPOLATION FROM DATA TO POPULATIONS PROVIDE MEASURES OF STATISTICAL RELIABILITY

SAMPLING NEEDS TO BE ACCURATE– LEADING TO UNBIASED ESTIMATES REPEATABLE– ESTIMATES LEAD TO SIMILAR ANSWERS EFFICIENT– DO NOT WASTE RESOURCES

BIAS HOW GOOD “ON AVERAGE” AN ESTIMATE IS CANNOT TELL FROM A SINGLE SAMPLE DEPENDS ON SAMPLING DESIGN, ESTIMATOR, AND ASSUMPTIONS

UNBIASED TRUE VALUE * * * * * * * * AVERAGE ESTIMATE SAMPLE ESTIMATE

BIASED TRUE VALUE * * * * * * * * AVERAGE ESTIMATE SAMPLE ESTIMATE BIAS

REPEATABLE (PRECISE) * * * * * * * SAMPLE ESTIMATE *

NOT REPEATABLE (IMPRECISE) * * * * * * * SAMPLE ESTIMATE *

CAN BE IMPRECISE BUT UNBIASED.. OR * * * * * * * SAMPLE ESTIMATE * AVERAGE ESTIMATE TRUE VALUE

PRECISELY BIASED..OR TRUE VALUE * * * * * * * * AVERAGE ESTIMATE SAMPLE ESTIMATE

IMPRECISE AND BIASED! * * * * * ** SAMPLE ESTIMATE * AVERAGE ESTIMATE TRUE VALUE

ACCURATE=UNBIASED & PRECISE TRUE VALUE * * * * * * * * AVERAGE ESTIMATE SAMPLE ESTIMATE

HOW DO WE MAKE ESTIMATES ACCURATE ? KEEP BIAS LOW SAMPLE TO ADEQUATELY REPRESENT POPULATION ACCOUNT FOR DETECTION KEEP VARIANCE LOW REPLICATION (ADEQUATE SAMPLE SIZE) STRATIFICATION, RECORDING OF COVARIATES, BLOCKING

Key Issues Spatial sampling Proper consideration and incorporation of detectability

Sampling principles What is the objective? What is the target population? What are the appropriate sampling units? Size, shape, placement Quantities measured

Remember Field sampling must be representative of the population of inference Incomplete detection MUST be accounted for in sampling and estimation

What is the objective? Unbiased estimate of population density of snakes (e.g., cobras) on Corbett National Park Coefficient of variation of estimate < 20% As cost efficient as possible

What is the target population? Population in the NP

What are the appropriate sampling units? Quadrats? Point samples? Line transects?

Sampling units- nonrandom placement Road

Nonrandom placement Advantages Easy to lay out More convenient to sample Disadvantage Do not represent other (off road) habitats Road may attract (or repel) snakes

OR- redefine the target: Road

Sampling units- random placement

Random placement Advantages Valid statistical design Represents study area Replication allows variance estimation Disadvantage May be logistically difficult Harder to lay out May not work well in heterogeneous study areas

Stratified sampling

Advantages Controls for heterogeneous study area Allows estimation of density by strata More precise estimate of overall density Disadvantages More complex design May require larger total sample

Single, unreplicated line

Are these hard “rules” –NO! Some violations of assumptions can be OK – and even necessary (idea of “robustness”) These are ideals to strive toward Good if you can achieve them If you can’t, you can’t– but study results may need different interpretation

Estimation: from Count Data to Population (I) Geographic variation (can’t look everywhere) Frequently counts/observations cannot be conducted over entire area of interest Proper inference requires a spatial sampling design that permits inference about entire area, based on a sample

A valid sampling design Allows valid probability inference about the population Statistical model Allows estimates of precision Replication, independence

Other Spatial Sampling Designs Systematic sampling Can approximate random sampling in some cases Cluster sampling When the biological units come in clusters Double sampling Very useful for detection calibration Adaptive sampling More efficient when populations are distributed “clumpily” Dual-frame sampling

Estimation: from Count to Population (II) Detectability (can’t see everything in places where you do look) Counts represent some unknown fraction of animals in sampled area Proper inference requires information on detection probability

Sampling Take Home Messages Field sampling must be designed to meet study or conservation objectives Field sampling must be representative of the population of inference Incomplete detection MUST be accounted for in sampling and estimation

Occupancy Estimation Species status = present or absent Coarse measure of population status Proportion of occupied patches Data can be collected efficiently over large spatial and temporal extents Species and community-level dynamics

Occupancy Estimation: Uses Surveys of geographic range Habitat relationships Metapopulation dynamics Observed colonization and extinction Extensive monitoring programs: 'trends' or changes in occupancy over time

Species Occurrence Conduct “presence-absence” (detection-nondetection) surveys. Estimate what fraction of sites (or area) is occupied by a species when species is not always detected with certainty, even when present ( p < 1). ‘Site’: Arbitrarily defined spatial unit (forest patch of a specified size) or discrete naturally occurring sampling units (ponds).

Site occupancy: A solution MacKenzie et al (Ecology) Key design issues: Replication Temporal replication: repeat visits to sample units Replicate visits occur within a relatively short period of time (e.g., a breeding season) Spatial replication: randomly selected ‘sites’ or sample units within area of interest

Basic Sampling Scheme: Single Season s sites are surveyed, each at k distinct sampling occasions. Species is detected/not detected at each occasion.

Necessary information: Data summary → Detection histories Detection history: Record for each visited site or sample unit 1 denotes detection 0 denotes nondetection Example detection history: h i = Denotes 5 visits to the site Target species detected during visits 1 and 4 0 does not necessarily mean the species was absent Not detected, but could be there!

Model Parameters: Single-Season Models

Sites are closed to changes in occupancy state between sampling occasions No heterogeneity that cannot be explained by covariates The detection process is independent at each site > 500 meters apart Model assumptions

Timing of repeated surveys Usually conducted as multiple discrete visits (e.g., on different days) Can also use multiple surveys within a single visit Multiple independent observers Potentially introduce heterogeneity into data Single visit to each site vs. multiple visits to each site Rotate observers amongst sites on each day Rotate order each site is sampled within a day

Designing occupancy surveys Several important issues to consider: 1. Clear objectives that are explicitly linked to science or management 2. Selection of sampling units Probabilistic sampling design Size of unit relative to species of interest 3. Timing of repeat surveys “closed” Relaxed for lab project 4. Allocation of survey effort Survey all of the sites equal number of times?

Getting To Know

PRESENCE PRESENCE is software that has been developed to apply these models to collected data. Within PRESENCE you can fit multiple models to your data. PRESENCE stores the results from each model and presents a summary of the results in a model selection table using AIC.

PRESENCE The analysis is stored in a project file (created from the File menu). A project consists of 3 files, *.pao, *.pa2 and *.pa2.out *.pao is the data file *.pa2 stores a summary of the models fit to the data *.pa2.out stores the full results for all the models

PRESENCE consists of 2 main windows Number crunching window Point and click window

When you create a new project, you must specify the data file (if previously created), or input the data to be analysed. Once the data file has been defined and selected, the filename for the project file will be the same as the data file.

To enter data specify the number of sites, survey occasions, site-specific and survey-specific (sampling) covariates. Then select the Input Data Form. The No. Occasions/season box is used for multi-season data. You must list the number of surveys per season, separated with a comma.

Data can be copied and pasted (via the menus only) from a spreadsheet into each respective tab. You can also enter data directly, or insert from a comma delimited text (.csv) file. Note the number of PRESENCE- related windows now open.

Once data has been entered, you must save the data before closing the window!

After saving your data and closing the data window, check that the correct data filename appears here. If not then will have to select the file manually. Make sure you click OK before proceeding.

After setting up your project, an empty Results Browser window should appear. Make sure you see this before attempting to run any models! The type of analysis to perform is selected from the run menu.