Trend detection- birds Marian Scott Dept of Statistics, University of Glasgow Glasgow, Sept 2007.

Slides:



Advertisements
Similar presentations
Variation, uncertainties and models Marian Scott School of Mathematics and Statistics, University of Glasgow June 2012.
Advertisements

Time series modelling and statistical trends
Environmental change and statistical trends – some examples Marian Scott Dept of Statistics, University of Glasgow NERC August 2012.
Environmental change and statistical trends – some examples Marian Scott Dept of Statistics, University of Glasgow NERC September 2010.
Environmental change and statistical trends – some examples
Environmental change and statistical trends – some examples Marian Scott Dept of Statistics, University of Glasgow NERC September 2011.
Measurement and assessment of change What it the status quo in environmental science? In time – A simple trend line – A p-value or a 95% confidence interval.
Assumptions underlying regression analysis
A.S. 3.8 INTERNAL 4 CREDITS Time Series. Time Series Overview Investigate Time Series Data A.S. 3.8 AS91580 Achieve Students need to tell the story of.
Very simple to create with each dot representing a data value. Best for non continuous data but can be made for and quantitative data 2004 US Womens Soccer.
Object Orie’d Data Analysis, Last Time •Clustering –Quantify with Cluster Index –Simple 1-d examples –Local mininizers –Impact of outliers •SigClust –When.
Residuals.
Chapter 3: Describing Relationships
Penguin Parade. Quantitative description [A] – The linear equation is y = x – On average, the number of penguins marching is decreasing.
(Day 1).  So far, we have used histograms to represent the overall shape of a distribution. Now smooth curves can be used:
Business Statistics - QBM117 Scatter diagrams and measures of association.
1 Assessing Normality and Data Transformations Many statistical methods require that the numeric variables we are working with have an approximate normal.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Business Statistics - QBM117 Statistical inference for regression.
Graphing in Science.
Density Curves and Normal Distributions
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
How do scientists show the results of investigations?
Handling Data and Figures of Merit Data comes in different formats time Histograms Lists But…. Can contain the same information about quality What is meant.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Hypothesis Testing Quantitative Methods in HPELS 440:210.
Looking at data: relationships Scatterplots IPS chapter 2.1 © 2006 W. H. Freeman and Company.
Section 2.4 Representing Data.
AP Statistics Introduction & Chapter 1.1 Variables, Distributions & Graphs Goals: What will we know and be able to do as a result of today’s Lesson?
Line graphs are a powerful tool in science.
Non-Linear Regression. The data frame trees is made available in R with >data(trees) These record the girth in inches, height in feet and volume of timber.
Chapter 6 The Normal Curve. A Density Curve is a curve that: *is always on or above the horizontal axis *has an area of exactly 1 underneath it *describes.
Max temp v min temp. It can be seen from the scatterplot that there is a correlation between max temp and min temp. Generally, as min temp increases,
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
Introduction to science
Time series Model assessment. Tourist arrivals to NZ Period is quarterly.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
Dr. Serhat Eren Other Uses for Bar Charts Bar charts are used to display data for different categories where the data are some kind of quantitative.
Bivariate Data Analysis Bivariate Data analysis 4.
Correlations: Relationship, Strength, & Direction Scatterplots are used to plot correlational data – It displays the extent that two variables are related.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Multiple Regression SECTION 10.3 Variable selection Confounding variables.
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
CS 235: User Interface Design November 19 Class Meeting Department of Computer Science San Jose State University Fall 2014 Instructor: Ron Mak
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
1 Chapter 2: The Normal Distribution 2.1Density Curves and the Normal Distributions 2.2Standard Normal Calculations.
Graphs in Science Section 1.5.
Example x y We wish to check for a non zero correlation.
Often the goal of an experiment is to find the relationship between two variables. As one variable changes, so does the other. Graphing is a useful way.
Essential Statistics Chapter 171 Two-Sample Problems.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
Lecture 3 – Sep 3. Normal quantile plots are complex to do by hand, but they are standard features in most statistical software. Good fit to a straight.
MODEL DIAGNOSTICS By Eni Sumarminingsih, Ssi, MM.
MATH 2311 Section 5.4. Residuals Examples: Interpreting the Plots of Residuals The plot of the residual values against the x values can tell us a lot.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
HW 16 Key. 20:31 Wal-Mart. 20:31 a a.Scatterplot. Linear? No. It grows at a growing rate.
Controlled Assessment Unit - CAU Investigative Skills Assessment - ISA 7 ISA section 2 exam – Additional / Separate Science Section 2 Exam: Up to 50 minutes.
Stats Methods at IC Lecture 3: Regression.
Critical Analysis.
Making Sense of Data (Part 1)
Correlation and Regression
Numeracy in Science – Lines of best fit
Chapter 3: Describing Relationships
Chapter 3 Scatterplots and Correlation.
Chapter 3: Describing Relationships
Presentation transcript:

Trend detection- birds Marian Scott Dept of Statistics, University of Glasgow Glasgow, Sept 2007

Native wintering waders- another approach What is the trend in the numbers of birds?

The data

some other species

Step 1 Exploring the data Normally, if comparison across the species was of interest, it would be best to plot each species data on the same scale. However, the data for the Black Tailed Godwit, the Sanderling and the Grey Plover is quite different to the rest. Therefore these three should be plotted separately. This means that extra care must be taken when drawing conclusions from data on different scales.

what do they show? some variation, some extremely high values (outliers?) but also some evidence of increasing numbers over the time period. All seem to show an increase in percentages over the period, with the Godwit rising to around 5000% of its 1974 numbers by The sudden peak in the Sanderling data could even suggest that the 1983 value has been entered incorrectly. The plots seem to display a significant upward trend though, even when spike in the Sanderling plot is disregarded. However, we also need to ask if a non-linear trend may be more suitable given the non-monotonic nature of the increases.

Simple linear trends

from the linear trends The high points (outliers?) have some influence on the fitted lines, so we also need to question how robust the analysis is. For some species it seems that they could be better explained by a non-linear fit. eight of the eleven species have exhibited some sort of an upward trend relative to the 1974 baseline, the most pronounced of which seems to be in the Black Tailed Godwit. Only the Bar Tailed Godwit, the Redshank and the Dunlin have seen falling numbers since 1974,

Non-linear trends The models fit in the earlier section assume monotonicity, (i.e. always increasing or always decreasing) and some of the plots show a more complex structure. A loess curve has been fitted to each species and included on the plots. The loess curve is useful as an exploratory tool, since it smooths out some of the roughness.

non-linear trends

results The LOESS plots tracks the data more closely, but show more complex patterns, including periods of increase and decrease for many of the species, which would argue that the simple monotonic trends do not capture the full story. Is it important to know that the rate of increase from 1974 to 2004 is not described by a simple linear trend?