Overview of 355 Themes and Concepts Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias.

Slides:



Advertisements
Similar presentations
Design of Experiments Lecture I
Advertisements

1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
What is MPC? Hypothesis testing.
Engineering the Planet What Compels us to do so?.
Inconvenient Truths and Uncertain Futures Summary of HC 434: Physics and Politics of Global Climate Change.
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Engineering the Planet What Compels us to do so?.
ENVS 355 Data, data, data Models, models, models.
Grand Overview Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias is key to.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Software Quality Control Methods. Introduction Quality control methods have received a world wide surge of interest within the past couple of decades.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Grand Overview Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias is key to.
Planet Earth. A Century of Change (1900 (=1) vs 2000) Industrial Output: 40 Industrial Output: 40 Marine Fish Catch: 35 Marine Fish Catch: 35 CO 2 Emissions:
Climate Literacy Summation. Inconvenient Truths and Uncertain Futures Summary of HC 434: Physics and Politics of Global Climate Change Manageable BAD.
Today Concepts underlying inferential statistics
Correlation and Regression Analysis
Unit 2: Population.
Chapter 7 Forecasting with Simple Regression
Introduction to Regression Analysis, Chapter 13,
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Inference for regression - Simple linear regression
Chapter 10 Hypothesis Testing
Hypothesis Testing in Linear Regression Analysis
Chapter 12: Simulation and Modeling
Forecasting and Statistical Process Control MBA Statistics COURSE #5.
Graphical Analysis. Why Graph Data? Graphical methods Require very little training Easy to use Massive amounts of data can be presented more readily Can.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
ENVS 355 Data, data, data Models, models, models Policy, policy, policy.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Role of Statistics in Geography
Basic Probability (Chapter 2, W.J.Decoursey, 2003) Objectives: -Define probability and its relationship to relative frequency of an event. -Learn the basic.
1 Statistical Distribution Fitting Dr. Jason Merrick.
No criminal on the run The concept of test of significance FETP India.
Next Colin Clarke-Hill and Ismo Kuhanen 1 Analysing Quantitative Data 1 Forming the Hypothesis Inferential Methods - an overview Research Methods Analysing.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Introductory Statistics. Learning Objectives l Distinguish between different data types l Evaluate the central tendency of realistic business data l Evaluate.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
1 William P. Cunningham University of Minnesota Mary Ann Cunningham Vassar College Chapter 02 Lecture Outline Copyright © McGraw-Hill Education. All rights.
Statistical Process Control04/03/961 What is Variation? Less Variation = Higher Quality.
Inferences from sample data Confidence Intervals Hypothesis Testing Regression Model.
Overview of 355 Themes and Concepts Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Research Methodology Lecture No :32 (Revision Chapters 8,9,10,11,SPSS)
Opening Questions Unit 1. Chi Square Test Practice A researcher wants to know if there is a different number of bat sightings a dusk(early evening) or.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Statistical tests for quantitative variables
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Chapter 10 Verification and Validation of Simulation Models
Environmental Science 101
Science and Sustainability: An Introduction to Environmental Science
Everyone thinks they know this stuff
Regression Analysis Week 4.
Introduction to Instrumentation Engineering
Planet Earth.
What determines Sex Ratio in Mammals?
Statistical Thinking and Applications
Introductory Statistics
Presentation transcript:

Overview of 355 Themes and Concepts Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias is key to implementing good policy Making a model of the data is an advanced technique that is sorely needed in this field

Goals of this Course To gain practice in how to frame a problemTo gain practice in how to frame a problem To practice making toy models of various data waveformsTo practice making toy models of various data waveforms To understand the purpose of making a modelTo understand the purpose of making a model To understand the limitations of modeling and that models differ mostly in the precision of predictions madeTo understand the limitations of modeling and that models differ mostly in the precision of predictions made Provide you with a mini tool kit for analysisProvide you with a mini tool kit for analysis

Sequence for Environmental Data Analysis Conceptualization of the problem  which data is most important to obtain; how to obtain a random/representative data set?Conceptualization of the problem  which data is most important to obtain; how to obtain a random/representative data set? Methods and limitations of data collection  know your biases (e.g. Sunshine Moonbeam)Methods and limitations of data collection  know your biases (e.g. Sunshine Moonbeam) Presentation of Results => data organization and reduction; data visualization; statistical analysisPresentation of Results => data organization and reduction; data visualization; statistical analysis Compare different modelsCompare different models

Statistical Distributions Why are they useful?Why are they useful? How to construct a frequency distribution and/or a histogram of events.How to construct a frequency distribution and/or a histogram of events. Frequencies are probabilitiesFrequencies are probabilities How the law of large numbers manifests itself  central limit theorem; random walk; expectation valuesHow the law of large numbers manifests itself  central limit theorem; random walk; expectation values

Statistical Distributions Mapping dispersion units into probabilities of an event occurring

Some Tools Linear Regression  predictive power lies in scatter; the “r” value is unimportant for scientific analysis Slope errors (cell C18 in Excel) are important and must be factored in to determine the total uncertainty of your prediction Identify anomalous points by sigma clipping (+/- 1.8  (1-cycle)

More Tools Chi square test – measures goodness of fit Understand how to determine your expected frequencies Chi square minimization used to find best fitting model Chi square statistic used to accept or reject the null hypothesis (that the data is consistent with the model plus random fluctuations)

More Tools Moving average technique applied to noisy data Z-test: determine significance between two mean values for two distributions

KS Test Most powerful for comparing two distributions Statistic is the maximum difference between 2 cumulative frequency distributions Data does not need to be normally distributed Best means to compare data distribution against a model Can’t be used for sample sizes less than 10

Arrival Statistics (Poisson) Events have to be discrete A good measure of the average event rate allows the probability that N events will occur over some time period to be determined Large values of  produces a distribution that is normal.

Green House Effect Long wavelength absorption properties of our atmosphere increase the surface temperature- Water vapor is the dominant effect, followed by CO2

Methane Potential role of methane is larger than CO2 GWP = 21 Scales with population growth Released from permafrost Released from hydrate deposits Emissions now rising again due to global wetlands returning from prolonged drought

Difficulty of Climate Change Detection Data is noisy Temporal baseline of data is not long enough Multi decadal climate cycles seem to be very important Oceans act as a buffer that delays the overall effect

Predator Prey Relations Non linear in nature  small changes in one part of the system can produce rapid population crashes Non linear in nature  small changes in one part of the system can produce rapid population crashes Density dependent time lags are important (what causes them?) Density dependent time lags are important (what causes them?) “Equilibrium” is intrinsically unstable “Equilibrium” is intrinsically unstable Logistic growth curve makes use of carrying capacity concept, K Logistic growth curve makes use of carrying capacity concept, K Negative feedback occurs as you approach K Negative feedback occurs as you approach K R selected vs. K selected mammals R selected vs. K selected mammals

P vs H P vs H Understand why graphical representations look like this: Understand why graphical representations look like this: What drives the lag time? What drives the lag time?

Human Population Projections What assumptions are used? Does human population growth respond to the carrying capacity concept? World population growth rate is in continuous decline (but still positive)  will this continue indefinitely? Oscillatory model may be most realistic What role does increased life expectancy have? 

Estimation Techniques Extremely useful skill  makes you valuable Devise an estimation plan  what factors do you need to estimate Scale from familiar examples when possible Perform a reality check on your estimate

Applied Ecology  Know what the terms mean and understand what an iterative solution is:

Applied Ecology II  Understand from the point of view of the framework (e.g. the equations) why stability is very hard to achieve  What role does finite reproductive age play?  What makes human growth special within this framework.  Understand concepts of equilibrium occupancy and demographic potential  Why is error assessment so important here?

Skewed Distributions This is a probability distribution function and one can still use the area under the curve or area between x values to determine probabilities via numerical integration

Time Series Analysis  Much of environmental data analysis or modeling represents the time evolution of some observed quantity.  Long term trends with cyclical oscillations and/or short term regular deviations; plus random variations

Value of time Series Analysis You want to uncover the long term trend that may be buried under the fluctuations Determining the amplitude of the fluctuations helps to determine if any recent events are aberrant Two cases: Gas prices; Climate Cycles: Gas Prices: The long term trend is steep and rises above the fluctuations Climate: The long term trend is overwhelmed by the fluctuations

Multiple Sine Wave Fits Can often reproduce the behavior seen in complex time series Can often reproduce the behavior seen in complex time series

The Data Rules Always, always ALWAYS plot your data Always, always ALWAYS plot your data Never, never NEVER put data through some blackbox reduction routine without examining the data themselves Never, never NEVER put data through some blackbox reduction routine without examining the data themselves The average of some distribution is not very meaningful unless you also know the dispersion. Always calculate the dispersion and then know how to use it! The average of some distribution is not very meaningful unless you also know the dispersion. Always calculate the dispersion and then know how to use it! The Average value for this data set is totally meaningless

More Data Rules Always compute the level of significance when comparing two distributions Always compute the level of significance when comparing two distributions Always know your measuring errors. If you don't them you are not doing science. Always know your measuring errors. If you don't them you are not doing science. Always calculate the dispersion in any correlative analysis. Remember that a correlation is only as good as the dispersion of points around the fitted line. Always calculate the dispersion in any correlative analysis. Remember that a correlation is only as good as the dispersion of points around the fitted line.

The Biggest Rules Always require someone to back up their "belief statements" with credible data. Always require someone to back up their "belief statements" with credible data. Change the world. Stop being a passive absorber of some one else's belief system. Change the world. Stop being a passive absorber of some one else's belief system. Frame all environmental problems objectively and seek reliable data to resolve conflicts and make policy Frame all environmental problems objectively and seek reliable data to resolve conflicts and make policy

And Now For Something Completely Different: Global climate change, species extinction, oil depletion, world food crises, global inequity, environmental justice, depletion of mineral resources, blogs, sustainability, alternative energy solutions, alternative fuels, more blogs, Obama, Hillary, McCain, whatever … Global climate change, species extinction, oil depletion, world food crises, global inequity, environmental justice, depletion of mineral resources, blogs, sustainability, alternative energy solutions, alternative fuels, more blogs, Obama, Hillary, McCain, whatever … WTF? How did all of this happen? WTF? How did all of this happen?

Your World Upon Graduation The Fossil Fuel Legacy The Fossil Fuel Legacy

Engineering the Planet What Compels us to do so?

Consumption: Pros and Cons This depends on how you want to index consumption – personal consumption/affluence is different than production/consumption that indirectly leads to better society infrastructure and services. What matters is the rate of consumption relative to the resource base. Main problem is that short term market growth, which we value, wants high rates. Sustainability demands lower rates  this is the clash of values.

Key Historical Moments We are special (different than other animals) We are special (different than other animals) We are uniquely positioned at the center of the Universe (reflects our “special-ness”) We are uniquely positioned at the center of the Universe (reflects our “special-ness”) The Universe is ordered, logical and rational – Age of Reason  humankind is unbounded The Universe is ordered, logical and rational – Age of Reason  humankind is unbounded The Newtonian world shows us the machine and it is precise (we can now engineer the planet) The Newtonian world shows us the machine and it is precise (we can now engineer the planet) The notion of uncertainty, as a valid and integral scientific concept, arises too late in this process  we already have truth pathways established The notion of uncertainty, as a valid and integral scientific concept, arises too late in this process  we already have truth pathways established

Essence of Science Knowledge based on measurement means that knowledge is both uncertain and subject to change when new and better measurements are made – there is no room for absolute truth in this methodology Knowledge based on measurement means that knowledge is both uncertain and subject to change when new and better measurements are made – there is no room for absolute truth in this methodology Problems can then only be solved by objective means that rely on real data and not bias or wishful thinking. Problems can then only be solved by objective means that rely on real data and not bias or wishful thinking.

Choice Pathways Which world does humanity want to live in? One that is based on a belief system that is then projected on to the natural world to support that belief (this is the BIAS) One where scientific methodology and thinking is used to enable, on a planet wide scale, the enlightenment motto that all men are created equal

Relationship with the Land is key Three possibilities The Land is Sacred  “Indigienous Model” your ancestors are buried in it The Land is shared  “European Model”  lots of people, not much land The Land is Owned  “American Model”  lots of land, I can piss on it if I want, afterall, its mine.

Continued Economic Development Requires high Energy Use 1900  100 Million Capitalists to build markets 2003  2.5 Billion new capitalists Energy is the core of the “environmental problem”; Environment is the core of the energy problem The energy-environment intersection is the core of the sustainable-prosperity problem

Resolution? We need to stop be driven by market economics and start to recognize that energy and environment is a shared resource. 20 Million college students should march on Washington demanding this: