Overview of 355 Themes and Concepts Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias.

Slides:



Advertisements
Similar presentations
Design of Experiments Lecture I
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
What is MPC? Hypothesis testing.
Engineering the Planet What Compels us to do so?.
Inconvenient Truths and Uncertain Futures Summary of HC 434: Physics and Politics of Global Climate Change.
Integrated Assessment of Sustainability =Economic analysis +Non-market analysis.
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Engineering the Planet What Compels us to do so?.
ENVS 355 Data, data, data Models, models, models.
Grand Overview Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias is key to.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Overview of 355 Themes and Concepts Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Grand Overview Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias is key to.
Planet Earth. A Century of Change (1900 (=1) vs 2000) Industrial Output: 40 Industrial Output: 40 Marine Fish Catch: 35 Marine Fish Catch: 35 CO 2 Emissions:
Climate Literacy Summation. Inconvenient Truths and Uncertain Futures Summary of HC 434: Physics and Politics of Global Climate Change Manageable BAD.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Correlation and Regression Analysis
Unit 2: Population.
Chapter 12: Simulation and Modeling Invitation to Computer Science, Java Version, Third Edition.
Introduction to Regression Analysis, Chapter 13,
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Inference for regression - Simple linear regression
Hypothesis Testing in Linear Regression Analysis
Chapter 12: Simulation and Modeling
Graphical Analysis. Why Graph Data? Graphical methods Require very little training Easy to use Massive amounts of data can be presented more readily Can.
Covariance and correlation
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
ENVS 355 Data, data, data Models, models, models Policy, policy, policy.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Role of Statistics in Geography
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Basic Probability (Chapter 2, W.J.Decoursey, 2003) Objectives: -Define probability and its relationship to relative frequency of an event. -Learn the basic.
Lecture 1.2 Field work (lab work). Analysis of data.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
No criminal on the run The concept of test of significance FETP India.
Next Colin Clarke-Hill and Ismo Kuhanen 1 Analysing Quantitative Data 1 Forming the Hypothesis Inferential Methods - an overview Research Methods Analysing.
Overview of CCSS Statistics and Probability Math Alliance September 2011.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Introductory Statistics. Learning Objectives l Distinguish between different data types l Evaluate the central tendency of realistic business data l Evaluate.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Data Analysis Econ 176, Fall Populations When we run an experiment, we are always measuring an outcome, x. We say that an outcome belongs to some.
1 William P. Cunningham University of Minnesota Mary Ann Cunningham Vassar College Chapter 02 Lecture Outline Copyright © McGraw-Hill Education. All rights.
Chapter 10 Verification and Validation of Simulation Models
Statistical Process Control04/03/961 What is Variation? Less Variation = Higher Quality.
Inferences from sample data Confidence Intervals Hypothesis Testing Regression Model.
Statistical Analysis Topic – Math skills requirements.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Residual Analysis Purposes –Examine Functional Form (Linear vs. Non- Linear Model) –Evaluate Violations of Assumptions Graphical Analysis of Residuals.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Research Methodology Lecture No :32 (Revision Chapters 8,9,10,11,SPSS)
Opening Questions Unit 1. Chi Square Test Practice A researcher wants to know if there is a different number of bat sightings a dusk(early evening) or.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Chapter 10 Verification and Validation of Simulation Models
Essential Statistics (a.k.a: The statistical bare minimum I should take along from STAT 101)
Everyone thinks they know this stuff
Introduction to Instrumentation Engineering
Chapter 02 Lecture Outline
What determines Sex Ratio in Mammals?
Statistical Thinking and Applications
Introductory Statistics
Presentation transcript:

Overview of 355 Themes and Concepts Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias is key to implementing good policy Making a model of the data is an advanced technique that is sorely needed in this field

Specific Concepts Review the explicit calculation of χ 2 ; conceptually understand expected frequencies. Review the explicit calculation of χ 2 ; conceptually understand expected frequencies. Know the mapping of units of standard deviation on the probability. Z-test application Know the mapping of units of standard deviation on the probability. Z-test application Know how to use the regression tool in Excel! Know how to use the regression tool in Excel! Know how to compute Poisson statistics Know how to compute Poisson statistics Know how to visually inspect a graph or waveform of data. Know how to visually inspect a graph or waveform of data. Use of the KS test and why its powerful. Use of the KS test and why its powerful.

More Concepts Understand uncertainty and confidence limits in terms of predicting the future behavior of some trend. Understand uncertainty and confidence limits in terms of predicting the future behavior of some trend. Understand what non-linearity means and how lag times work in non-linear systems (e.g. predator- prey relations). Understand what non-linearity means and how lag times work in non-linear systems (e.g. predator- prey relations). Understand how one can make an estimate. Understand how one can make an estimate. Understand counting errors sqrt(N)/N Understand counting errors sqrt(N)/N

Goals of this Course To gain practice in how to frame a problemTo gain practice in how to frame a problem To practice making toy models of various data waveformsTo practice making toy models of various data waveforms To understand the purpose of making a modelTo understand the purpose of making a model To understand the limitations of modeling and that models differ mostly in the precision of predictions madeTo understand the limitations of modeling and that models differ mostly in the precision of predictions made Provide you with a mini tool kit for analysisProvide you with a mini tool kit for analysis

Sequence for Environmental Data Analysis Conceptualization of the problem  which data is most important to obtain; how to obtain a random/representative data set?Conceptualization of the problem  which data is most important to obtain; how to obtain a random/representative data set? Methods and limitations of data collection  know your biases (e.g. Sunshine Moonbeam)Methods and limitations of data collection  know your biases (e.g. Sunshine Moonbeam) Presentation of Results => data organization, reduction, visualization, statistical analysisPresentation of Results => data organization, reduction, visualization, statistical analysis Compare different models using sound statistical tests (cause they always work)Compare different models using sound statistical tests (cause they always work)

Statistical Distributions Why are they useful?Why are they useful? How to construct a frequency distribution and/or a histogram of events.How to construct a frequency distribution and/or a histogram of events. Frequencies are probabilitiesFrequencies are probabilities How the law of large numbers manifests itself  central limit theorem; random walk; expectation valuesHow the law of large numbers manifests itself  central limit theorem; random walk; expectation values

Statistical Distributions Mapping dispersion units into probabilities of an event occurring

Some Tools Linear Regression  predictive power lies in scatter; the “r” value is unimportant for scientific analysis Slope errors (cell C18 in Excel) are important and must be factored in to determine the total uncertainty of your prediction Identify anomalous points by sigma clipping (+/- 2  (1-cycle))

More Tools Chi square test – measures goodness of fit Understand how to determine your expected frequencies Chi square minimization used to find best fitting model Chi square statistic used to accept or reject the null hypothesis (that the data is consistent with the model plus random fluctuations)

More Tools Moving average technique applied to noisy data Z-test: determine significance between two mean values for two distributions

KS Test Most powerful for comparing two distributions Statistic is the maximum difference between 2 cumulative frequency distributions Data does not need to be normally distributed Best means to compare data distribution against a model Can’t be used for sample sizes less than 10

Arrival Statistics (Poisson) Events have to be discrete A good measure of the average event rate allows the probability that N events will occur over some time period to be determined Large values of  produces a distribution that is normal.

Green House Effect Long wavelength absorption properties of our atmosphere increase the surface temperature- Water vapor is the dominant effect, followed by CO2

Methane Potential role of methane is larger than CO2 GWP = 21 Scales with population growth Released from permafrost Released from hydrate deposits Emissions now rising again due to global wetlands returning from prolonged drought

Difficulty of Climate Change Detection Data is noisy Temporal baseline of data is not long enough Multi decadal climate cycles seem to be very important Oceans act as a buffer that delays the overall effect

Predator Prey Relations Non linear in nature  small changes in one part of the system can produce rapid population crashes Non linear in nature  small changes in one part of the system can produce rapid population crashes Density dependent time lags are important (what causes them?) Density dependent time lags are important (what causes them?) “Equilibrium” is intrinsically unstable “Equilibrium” is intrinsically unstable Logistic growth curve makes use of carrying capacity concept, K Logistic growth curve makes use of carrying capacity concept, K Negative feedback occurs as you approach K Negative feedback occurs as you approach K R selected vs. K selected mammals R selected vs. K selected mammals

P vs H P vs H Understand why graphical representations look like this: Understand why graphical representations look like this: What drives the lag time? What drives the lag time?

Human Population Projections What assumptions are used? Does human population growth respond to the carrying capacity concept? World population growth rate is in continuous decline (but still positive)  will this continue indefinitely? Oscillatory model may be most realistic What role does increased life expectancy have? 

Estimation Techniques Extremely useful skill  makes you valuable Devise an estimation plan  what factors do you need to estimate Scale from familiar examples when possible Perform a reality check on your estimate

Applied Ecology  Know what the terms mean and understand what an iterative solution is:

Applied Ecology II  Understand from the point of view of the framework (e.g. the equations) why stability is very hard to achieve  What role does finite reproductive age play?  What makes human growth special within this framework.  Understand concepts of equilibrium occupancy and demographic potential  Why is error assessment so important here?

Skewed Distributions This is a probability distribution function and one can still use the area under the curve or area between x values to determine probabilities via numerical integration

Time Series Analysis  Much of environmental data analysis or modeling represents the time evolution of some observed quantity.  Long term trends with cyclical oscillations and/or short term regular deviations; plus random variations

Value of time Series Analysis You want to uncover the long term trend that may be buried under the fluctuations Determining the amplitude of the fluctuations helps to determine if any recent events are aberrant Two cases: Gas prices; Climate Cycles: Gas Prices: The long term trend is steep and rises above the fluctuations Climate: The long term trend is overwhelmed by the fluctuations

Multiple Sine Wave Fits Can often reproduce the behavior seen in complex time series Can often reproduce the behavior seen in complex time series

The Data Rules Always, always ALWAYS plot your data Always, always ALWAYS plot your data Never, never NEVER put data through some blackbox reduction routine without examining the data themselves Never, never NEVER put data through some blackbox reduction routine without examining the data themselves The average of some distribution is not very meaningful unless you also know the dispersion. Always calculate the dispersion and then know how to use it! The average of some distribution is not very meaningful unless you also know the dispersion. Always calculate the dispersion and then know how to use it! The Average value for this data set is totally meaningless

More Data Rules Always compute the level of significance when comparing two distributions Always compute the level of significance when comparing two distributions Always know your measuring errors. If you don't them you are not doing science. Always know your measuring errors. If you don't them you are not doing science. Always calculate the dispersion in any correlative analysis. Remember that a correlation is only as good as the dispersion of points around the fitted line. Always calculate the dispersion in any correlative analysis. Remember that a correlation is only as good as the dispersion of points around the fitted line.

The Biggest Rules Always require someone to back up their "belief statements" with credible data. Always require someone to back up their "belief statements" with credible data. Change the world. Stop being a passive absorber of some one else's belief system. Change the world. Stop being a passive absorber of some one else's belief system. Frame all environmental problems objectively and seek reliable data to resolve conflicts and make policy Frame all environmental problems objectively and seek reliable data to resolve conflicts and make policy

And Now For Something Completely Different: Current Tag Cloud: Global climate change, species extinction, oil depletion, world food crises, global inequity, environmental justice, depletion of mineral resources, blogs and rants, sustainability, alternative energy solutions, alternative fuels, more blogs, Black Friday, flash mobs, crowd sourcing, NMFP Current Tag Cloud: Global climate change, species extinction, oil depletion, world food crises, global inequity, environmental justice, depletion of mineral resources, blogs and rants, sustainability, alternative energy solutions, alternative fuels, more blogs, Black Friday, flash mobs, crowd sourcing, NMFP

Your World Upon Graduation The Fossil Fuel Legacy The Fossil Fuel Legacy

Engineering the Planet What Compels us to do so?

Consumption: Pros and Cons This depends on how you want to index consumption – personal consumption/affluence is different than production/consumption that indirectly leads to better society infrastructure and services. What matters is the rate of consumption relative to the resource base. Main problem is that short term market growth, which we value, wants high rates. Sustainability demands lower rates  this is the clash of values.

Key Historical Moments We are special (different than other animals) We are special (different than other animals) We are uniquely positioned at the center of the Universe (reflects our “special-ness”) We are uniquely positioned at the center of the Universe (reflects our “special-ness”) The Universe is ordered, logical and rational – Age of Reason  humankind is unbounded The Universe is ordered, logical and rational – Age of Reason  humankind is unbounded The Newtonian world shows us the machine and it is precise (we can now engineer the planet) The Newtonian world shows us the machine and it is precise (we can now engineer the planet) The notion of uncertainty, as a valid and integral scientific concept, arises too late in this process  we already have truth pathways established The notion of uncertainty, as a valid and integral scientific concept, arises too late in this process  we already have truth pathways established

Essence of Science Knowledge based on measurement means that knowledge is both uncertain and subject to change when new and better measurements are made – there is no room for absolute truth in this methodology Knowledge based on measurement means that knowledge is both uncertain and subject to change when new and better measurements are made – there is no room for absolute truth in this methodology Problems can then only be solved by objective means that rely on real data and not bias or wishful thinking. Problems can then only be solved by objective means that rely on real data and not bias or wishful thinking.

Choice Pathways Which world does humanity want to live in? One that is based on a belief system that is then projected on to the natural world to support that belief (this is the BIAS) One where scientific methodology and thinking is used to enable, on a planet wide scale, the enlightenment motto that all men are created equal

Relationship with the Land is key Three possibilities The Land is Sacred  “Indigienous Model” your ancestors are buried in it The Land is shared  “European Model”  lots of people, not much land The Land is Owned  “American Model”  lots of land, I can piss on it if I want, afterall, its mine.

Continued Economic Development Requires high Energy Use 1900  100 Million Capitalists to build markets 2003  2.5 Billion new capitalists Energy is the core of the “environmental problem”; Environment is the core of the energy problem The energy-environment intersection is the core of the sustainable-prosperity problem

Resolution? We need to stop be driven by market economics and start to recognize that energy and environment is a shared resource. Your generation needs to more seriously engage with the issue of quantitative reasoning if you are to make informed choices. Otherwise, nothing good happens