Grand Overview Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias is key to.

Slides:



Advertisements
Similar presentations
Which Test? Which Test? Explorin g Data Explorin g Data Planning a Study Planning a Study Anticipat.
Advertisements

Assumptions underlying regression analysis
Lesson 10: Linear Regression and Correlation
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Sampling Distributions (§ )
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
ENVS 355 Data, data, data Models, models, models.
BCOR 1020 Business Statistics Lecture 15 – March 6, 2008.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Overview of 355 Themes and Concepts Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Grand Overview Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias is key to.
SIMPLE LINEAR REGRESSION
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Correlation and Regression Analysis
Correlation & Regression
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Simple Linear Regression
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Managing Software Projects Analysis and Evaluation of Data - Reliable, Accurate, and Valid Data - Distribution of Data - Centrality and Dispersion - Data.
(a.k.a: The statistical bare minimum I should take along from STAT 101)
ENVS 355 Data, data, data Models, models, models Policy, policy, policy.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Quantitative Skills 1: Graphing
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Statistics. Key statistics and their purposes Chi squared test: determines if a data set is random or accounted for by an unwanted variable Standard deviation:
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Sampling and Sample Size Part 1 Cally Ardington. Course Overview 1.What is Evaluation? 2.Outcomes, Impact, and Indicators 3.Why Randomise? 4.How to Randomise?
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Introductory Statistics. Learning Objectives l Distinguish between different data types l Evaluate the central tendency of realistic business data l Evaluate.
June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10.
LECTURE 3: ANALYSIS OF EXPERIMENTAL DATA
Question paper 1997.
Inferences from sample data Confidence Intervals Hypothesis Testing Regression Model.
Statistical Analysis Topic – Math skills requirements.
Overview of 355 Themes and Concepts Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias.
Statistical inference Statistical inference Its application for health science research Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics.
Intro to Psychology Statistics Supplement. Descriptive Statistics: used to describe different aspects of numerical data; used only to describe the sample.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
WARM UP: Penny Sampling 1.) Take a look at the graphs that you made yesterday. What are some intuitive takeaways just from looking at the graphs?
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Descriptive and Inferential Statistics
Anticipating Patterns Statistical Inference
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Review 1. Describing variables.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
PCB 3043L - General Ecology Data Analysis.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Essential Statistics (a.k.a: The statistical bare minimum I should take along from STAT 101)
Everyone thinks they know this stuff
Continuous Statistical Distributions: A Practical Guide for Detection, Description and Sense Making Unit 3.
Sampling Distributions (§ )
Introductory Statistics
Presentation transcript:

Grand Overview Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias is key to implementing good policy

Goals of this Course To gain practice in how to frame a problemTo gain practice in how to frame a problem To practice making toy models involving data organization and presentationTo practice making toy models involving data organization and presentation To understand the purpose of making a modelTo understand the purpose of making a model To understand the limitations of modeling and that models differ mostly in the precision of predictions madeTo understand the limitations of modeling and that models differ mostly in the precision of predictions made Provide you with a mini tool kit for analysisProvide you with a mini tool kit for analysis

Sequence for Environmental Data Analysis Conceptualization of the problem  which data is most important to obtainConceptualization of the problem  which data is most important to obtain Methods and limitations of data collection  know you biasesMethods and limitations of data collection  know you biases Presentation of Results => data organization and reduction; data visualization; statistical analysisPresentation of Results => data organization and reduction; data visualization; statistical analysis Comparing different modelsComparing different models

Three Problems with Environmental Data Its usually very noisyIts usually very noisy It is often unintentionally biased because the wrong variables are being measured to address the problem in question.It is often unintentionally biased because the wrong variables are being measured to address the problem in question. A control sample is usually not available.A control sample is usually not available.

Some Tools Linear Regression  predictive power lies in scatter Slope errors are important Identify anomalous points by sigma clipping (1-cycle) Learn to use the regression tool in Excel Least squares method used for best fit determination

More Tools Chi square test Understand how to determine your expected frequencies Two chi square statistic requires marginal sum calculations Chi square statistic used to accept or reject the null hypothesis Know how to compute it

Estimation Techniques Extremely useful skill  makes you valuable Extremely useful skill  makes you valuable Devise an estimation plan  what factors do you need to estimate Devise an estimation plan  what factors do you need to estimate Scale from familiar examples when possible Scale from familiar examples when possible Perform a reality check on your estimate Perform a reality check on your estimate

Global Warming I

Global Warming II Understand basics of “greenhouse effect” Understand basics of “greenhouse effect” Ice core data and lag time issue Ice core data and lag time issue What are best indicators of global climate change What are best indicators of global climate change Why is global mean temperature a poor proxy Why is global mean temperature a poor proxy Spatial distribution of temperature changes is most revealing Spatial distribution of temperature changes is most revealing

Global Warming III Why is methane such a potential problem? Why is methane such a potential problem? What are anthropogenic sources of methane emission and how can they be curtailed What are anthropogenic sources of methane emission and how can they be curtailed What is the hydrate problem? What is the hydrate problem? What are some other smoking guns for global warming/climate change? What are some other smoking guns for global warming/climate change? 120 Tornadoes Touch down March 12, Tornadoes Touch down March 12, 2006

Trend Extrapolation Techniques

Trend Estimation Exponential vs linear models Exponential vs linear models Exponential Exhaustion Timescales Exponential Exhaustion Timescales Why R doesn’t matter so much Why R doesn’t matter so much Why is exhaustion timescale driven mostly by the consumption rate, k Why is exhaustion timescale driven mostly by the consumption rate, k Exponential doubling times Exponential doubling times

The Importance of Trend Extrapolation

Statistical Distributions Why are they useful? Why are they useful? How to construct a frequency distribution and/or a histogram of events. How to construct a frequency distribution and/or a histogram of events. Frequencies are probabilities Frequencies are probabilities How the law of large numbers manifests itself  central limit theorem; random walk; expectation values How the law of large numbers manifests itself  central limit theorem; random walk; expectation values

Comparing Distributions Why?  to identify potential differences and environmental drivers KS test  uses the entire distribution by comparing cumulative frequency distributions (cfd)  more powerful than tests based on means and standard deviations (e.g. Z-test; t- test) KS test is excellent for testing observed distribution for normality (Excel: random number generator  normal distribution)

Predator Prey Relations Non linear in nature  small changes in one part of the system can produce rapid population crashes Non linear in nature  small changes in one part of the system can produce rapid population crashes Density dependent time lags are important Density dependent time lags are important “Equilibrium” is intrinsically unstable “Equilibrium” is intrinsically unstable Logistic growth curve makes use of carrying capacity concept, K Logistic growth curve makes use of carrying capacity concept, K Negative feedback occurs as you approach K Negative feedback occurs as you approach K R selected vs. K selected mammals R selected vs. K selected mammals

Human Population Projections What assumptions are used? What assumptions are used? Does human population growth respond to the carrying capacity concept? Does human population growth respond to the carrying capacity concept? World population growth rate is in continuous decline (but still positive)  will this continue indefinitely? World population growth rate is in continuous decline (but still positive)  will this continue indefinitely? What role does increased life expectancy have?  changing population pyramids What role does increased life expectancy have?  changing population pyramids

Non Normal Distributions Positive and Negative skewness  median value more relevant than mean Positive and Negative skewness  median value more relevant than mean Bi modal  sum of two normal distributions if the peaks are well separated Bi modal  sum of two normal distributions if the peaks are well separated Poisson Distribution for discrete arrival events  review this Poisson Distribution for discrete arrival events  review this Exponential Distribution for continuous arrival events Exponential Distribution for continuous arrival events

Applied Ecology  Know what the terms mean and understand what an iterative solution is:

Applied Ecology II  Understand from the point of view of the framework (e.g. the equations) why stability is very hard to achieve  What role does finite reproductive age play?  What makes human growth special within this framework.  Understand concepts of equilibrium occupancy and demographic potential  Why is error assessment so important here?

Probabilistic Outcomes  Why is “natural selection” best described in this way?  What parameters determine the outcomes?  What are the differences between stabilization, directional, and disruptive forms of evolution?

Techniques for Dealing with Noisy Data Boxcar smoothing (moving average) Boxcar smoothing (moving average) Exponential smoothing Exponential smoothing Binning the data into two groups and comparing means via the Z-test (e.g. rainfall broken up into two distinct time periods) Binning the data into two groups and comparing means via the Z-test (e.g. rainfall broken up into two distinct time periods) Construction of a waveform and comparison of waveforms Construction of a waveform and comparison of waveforms

The Data Rules Always, always ALWAYS plot your data Always, always ALWAYS plot your data Never, never NEVER put data through some blackbox reduction routine without examining the data themselves Never, never NEVER put data through some blackbox reduction routine without examining the data themselves The average of some distribution is not very meaningful unless you also know the dispersion. Always calculate the dispersion and then know how to use it! The average of some distribution is not very meaningful unless you also know the dispersion. Always calculate the dispersion and then know how to use it!

More Data Rules Always compute the level of significance when comparing two distributions Always compute the level of significance when comparing two distributions Always know your measuring errors. If you don't then you are not doing science. Always know your measuring errors. If you don't then you are not doing science. Always calculate the dispersion in any correlative analysis. Remember that a correlation is only as good as the dispersion of points around the fitted line. Always calculate the dispersion in any correlative analysis. Remember that a correlation is only as good as the dispersion of points around the fitted line.

The Biggest Rules Always require someone to back up their "belief statements" with credible data. Always require someone to back up their "belief statements" with credible data. Change the world. Stop being a passive absorber of some one else's belief system. Change the world. Stop being a passive absorber of some one else's belief system. Frame all environmental problems objectively and seek reliable data to resolve conflicts and make policy Frame all environmental problems objectively and seek reliable data to resolve conflicts and make policy