All about noisy time series analysis in which features exist and long term behavior is present; finite boundary conditions are always an issue.

Slides:



Advertisements
Similar presentations
CMB Power spectrum likelihood approximations
Advertisements

Forecasting OPS 370.
Forecasting Models With Linear Trend. Linear Trend Model If a modeled is hypothesized that has only linear trend and random effects, it will be of the.
Experiments and Variables
Chapter 15 Above: GPS time series from southern California after removing several curve fits to the data.
1 Detection and Analysis of Impulse Point Sequences on Correlated Disturbance Phone G. Filaretov, A. Avshalumov Moscow Power Engineering Institute, Moscow.
Time Series Analysis Autocorrelation Naive & Simple Averaging
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft NDACC H2O workshop, Bern, July 2006 Water vapour profiles by ground-based FTIR Spectroscopy:
Isoparametric Elements Element Stiffness Matrices
Regression Diagnostics Using Residual Plots in SAS to Determine the Appropriateness of the Model.
Calculating the Beam Position at the Ecal for DESY Run (Independent of Tracking) Hakan Yilmaz.
Page 1 Calculating the Beam Position at the Ecal for DESY Run (Independent of Tracking) Hakan Yilmaz.
Interpolation Used to estimate values between data points difference from regression - goes through data points no error in data points.
Grand Overview Environmental Problems are generally characterize by noisy and ambiguous data. Understanding errors and data reliability/bias is key to.
Bill Atwood - July 2002 GLAS T 1 A Kalman Filter for GLAST What is a Kalman Filter and how does it work. Overview of Implementation in GLAST Validation.
EMSE 3123 Math and Science in Education
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Regions Defined by Two Inequalities
1 10. Joint Moments and Joint Characteristic Functions Following section 6, in this section we shall introduce various parameters to compactly represent.
Calculating the Tensile and Compressive Strength of Bridge Members EGN1006.
Confidence Intervals. Estimating the difference due to error that we can expect between sample statistics and the population parameter.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Measure your handspan and foot length in cm to nearest mm We will record them as Bivariate data below: Now we need to plot them in what kind of graph?
Dr. Richard Young Optronic Laboratories, Inc..  Uncertainty budgets are a growing requirement of measurements.  Multiple measurements are generally.
Chap 8-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Business Statistics: A First Course.
Math 3120 Differential Equations with Boundary Value Problems Chapter 2: First-Order Differential Equations Section 2-6: A Numerical Method.
Quantitative Skills 1: Graphing
1 What Is Forecasting? Sales will be $200 Million!
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Physics 114: Exam 2 Review Lectures 11-16
Model Construction: interpolation techniques 1392.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Example 13.6a Houses Sold in the Midwest Exponential Smoothing.
Unit 1, Chapter 2 Integrated Science. Unit One: Forces and Motion 2.1 Using a Scientific Model to Predict Speed 2.2 Position and Time 2.3 Acceleration.
Time series Model assessment. Tourist arrivals to NZ Period is quarterly.
HY436: Mobile Computing and Wireless Networks Data sanitization Tutorial: November 7, 2005 Elias Raftopoulos Ploumidis Manolis Prof. Maria Papadopouli.
Analysis of Residuals ©2005 Dr. B. C. Paul. Examining Residuals of Regression (From our Previous Example) Set up your linear regression in the Usual manner.
16722 We: measurements & distributions114+1 mean (average) mean value of all measurements = ((each value observed) * (number of times.
23 November Md. Tanvir Al Amin (Presenter) Anupam Bhattacharjee Department of Computer Science and Engineering,
Basic Time Series Analyzing variable star data for the amateur astronomer.
Chapter 15 – CTRW Continuous Time Random Walks. Random Walks So far we have been looking at random walks with the following Langevin equation  is a.
Determination of True Attenuation Lengths using SPASE-AMANDA Coincidence Data Tim Miller JHU/APL.
MnSGC Ballooning Team Techniques: APRS tracking-data processing James Flaten Summer 2010.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall
Chapter 3: Organizing Data. Raw data is useless to us unless we can meaningfully organize and summarize it (descriptive statistics). Organization techniques.
Principles of Extrapolation
Example x y We wish to check for a non zero correlation.
Session V: The Normal Distribution Continuous Distributions (Zar, Chapter 6)
XBSM Analysis - Dan Peterson Review of the optics elements: Pinhole (“GAP”), FZP, Coded Aperture Extracting information from the GAP what is the GAP width?
Vondrák Filtering for Timescales Demetrios Matsakis, Gianna Panfilo, and Gérard Petit USNO & BIPM.
1 Decision Making ADMI 6510 Forecasting Models Key Sources: Data Analysis and Decision Making (Albrigth, Winston and Zappe) An Introduction to Management.
Copyright © Cengage Learning. All rights reserved. 8 9 Correlation and Regression.
Forecast 2 Linear trend Forecast error Seasonal demand.
SPIRE Flux Calibration: Implementation
Advanced Quantitative Techniques
OPSE 301: Lab13 Data Analysis – Fitting Data to Arbitrary Functions
Volume 74, Issue 4, Pages (April 1998)
Fundamentals Data.
Chapter 11.2 Graphs are a visual representation of trends in data.
Statistics Branch of mathematics dealing with the collection, analysis, interpretation, presentation, and organization of data. Practice or science of.
Estimating
Homework 1 Responses and Issues
Dealing with Noisy Data
NanoBPM Status and Multibunch Mark Slater, Cambridge University
6.7 Practical Problems with Curve Fitting simple conceptual problems
The Bias Variance Tradeoff and Regularization
Volume 74, Issue 4, Pages (April 1998)
Chapter 7 Finite Impulse Response(FIR) Filter Design
Chapter 7 Finite Impulse Response(FIR) Filter Design
Presentation transcript:

All about noisy time series analysis in which features exist and long term behavior is present; finite boundary conditions are always an issue

 For any time series problem, plot the data first at some sensible scale and do simple smoothing to see if there is underlying structure vs just all random noise.  Do a simple preliminary VISUAL analysis – fit a line to all or parts of the data, just so you get some better understanding  EXCEL is actually convenient for this

 Small differences in approaches produce slightly different results:

Area ratio = 1.37 (721/525) No Need to do “fancy” interpolation for numerical integration for this data given the intrinsic noise – why do extra calculations if you don’t need to  spare machine resource

 There are many techniques for smoothing  There is always a trade off between smoothing width and data resolution. There is no formula to optimally determine this – you have to experiment with different procedures.  Exponential smoothing often looks “weird” as both the weights and the smoothing changes with smoothing parameter

Area under curve = 1 Area under curve = 1; 11 points are shown here; use 7 points for each data point; 96% of wt.

Note phase “error” – this is common because of a finite data end so best build in an offset that you can change

Blue = exponential

Easy and the point was that this is not a randomly distributed variable; distribution is skewed (third moment; kurtosis = 4 th moment

 As expected this gave everyone the most trouble; its not hard but you do have to pay attention to the your process.  First produce a sensible plot so you get a feel for the amplitude of the feature

See the Noise! Now do Feature Extraction – want to fit a “continuum” that doesn’t include features. This simple linear fit does include features so is wrong but it serves as an initial guide.. What you notice is that the peak is about.4 units above the “baseline”. Window out features and continue. 0.4

Maybe 3 events; 1 for sure; note amplitude is correct compared to the first pass (i.e. ~0.4) Area under the curve for biggest feature is about a 3% excess over baseline – not very high amplitude but not a NOISE FEATURE either; 0=9 sq. km; feature is 20 years; 9x20 =180; area of feature is triangle: ½ *20*(.5) = 5; 5/180 = 3% (good enough for estimate) Constant = 9.1

>>> import numpy >>> x,y = numpy.loadtxt(“xy.txt”, unpack=True) >>>p = numpy.polyfit(x, y, deg=3) >>>print p >>>-7e-07 1e Excel = -5e-07 8e

Yes there is a family of functions that work for this kind of “sharp cutoff wave form”

SIGMOIDAL DISTRIBUTION A = B = C = 2.2E10 D =-4.69 The zunzun.com site is magic!

Science/Policy Issue; 2095 vs 2030