Data Editing Strategies Common edits Invalidation vs. contamination? What is a considered a spike? What not to edit! Automatic edits Reasons for edits.

Slides:



Advertisements
Similar presentations
Getting started with MPLAB Launch MPLAB Set Toolbar as in the next slide by clicking the leftmost icon Open a new Source file by choosing [FILE][NEW] Type.
Advertisements

Introductory Circuit Analysis Robert L. Boylestad
Importing Data into IX1D v 3 – A Tutorial © 2006 Interpex Limited All rights reserved Version 1.0.
Problems With Determining Oxygen Deficiencies in Ratios Used for Assessing Spontaneous Combustion Activity Darren Brady Manager OHECC Simtars Department.
CEMTEK CEMS Users Group Meeting and Forum September 24-25, 2009 Santa Ana, California RKI Specific DAHS Training and RECLAIM Updates Presented by Norm Iseri, RKI Engineering.
To figure out which tubes are working fine… Compare 24 to 2.6 and 2.51 to 2.6 by taking the ratio of counts. In general, 2.4 had the most problems, and.
AMS Performance at DC3-Test Some preliminary results Pedro Campuzano Jost, Douglas Day, Michael Lechner, Jose Luis Jimenez.
Wavelength dependence of the single scattering albedo Single scattering albedo (SSA) values at 450(blue), 550(green) and 700(red) nm The SSA values during.
Personal Planning System Enterprise Edition The Completing the Monthly Review.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Introduction to Excel 2007 Part 2: Bar Graphs and Histograms February 5, 2008.
Anesthesia Manager Training Working with Real-Time Data.
AOSC 634 Air Sampling and Analysis Lecture 1 Measurement Theory Performance Characteristics of instruments Nomenclature and static response Copyright Brock.
Fixed-Point Arithmetics: Part I
Operational Quality Control in Helsinki Testbed Mesoscale Atmospheric Network Workshop University of Helsinki, 13 February 2007 Hannu Lahtela & Heikki.
8 November Forms and JavaScript. Types of Inputs Radio Buttons (select one of a list) Checkbox (select as many as wanted) Text inputs (user types text)
of Experimental Density Data Purpose of the Experiment
CSE 378 Floating-point1 How to represent real numbers In decimal scientific notation –sign –fraction –base (i.e., 10) to some power Most of the time, usual.
Control Charts for Attributes
CS1100: Computer Science and Its Applications Creating Graphs and Charts in Excel.
Copyright © 2013 FingerTec Worldwide Sdn.Bhd. All rights reserved.
Method Comparison A method comparison is done when: A lab is considering performing an assay they have not performed previously or Performing an assay.
DEMONSTRATION FOR SIGMA DATA ACQUISITION MODULES Tempatron Ltd Data Measurements Division Darwin Close Reading RG2 0TB UK T : +44 (0) F :
V. Rouillard  Introduction to measurement and statistical analysis ASSESSING EXPERIMENTAL DATA : ERRORS Remember: no measurement is perfect – errors.
START EXCEL BUILD OR RETRIEVE YOUR OPTIMIZATION MODEL SAVE YOUR WORKBOOK!! CHOOSE “Solver…” IN THE “Tools” MENU SPECIFY IN SOLVER DIALOG BOX: 1.CELL TO.
Hypothesis Testing. Distribution of Estimator To see the impact of the sample on estimates, try different samples Plot histogram of answers –Is it “normal”
Using CPX2 to edit What week am I on?? Plotting a week of data in CPX2 Looking at the raw data – raw mode  moving among plots  message log  zooming.
Met Alert Tool (MAT). Introduction What is MAT? –Met Alert Tool (MAT) monitors and alerts the user to weather conditions exceeding thresholds (for example,
1 Statistical Inference. 2 The larger the sample size (n) the more confident you can be that your sample mean is a good representation of the population.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
Aerosol Obsveration System (AOS) in the MAGICal experiment Gunnar Senum BNL First MAGIC Science Workshop.
Sampling Distribution of the Sample Mean. Example a Let X denote the lifetime of a battery Suppose the distribution of battery battery lifetimes has 
Quality Assurance How do you know your results are correct? How confident are you?
Uncertainty and Error in Measurement (IB text - Ch 11) (If reviewing this slide in the senior year, there is also uncertainty information in the AP text.
Google Calendar Review 1. Go to 2. Your calendar should look something like this. 3 Click the arrow to continuehttp://
Chapter 10 Verification and Validation of Simulation Models
Statistical Process Control04/03/961 What is Variation? Less Variation = Higher Quality.
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
QA filtering of individual pixels to enable a more accurate validation of aerosol products Maksym Petrenko Presented at MODIS Collection 7 and beyond Retreat.
C.Vigorito, University & INFN Torino, Italy 30 th International Cosmic Ray Conference Merida, Mexico Search for neutrino bursts from Gravitational stellar.
Copyright © 2005 Pearson Education, Inc. Slide 6-1.
Center for Atmospheric Particle Studies AMS Data Analysis Training Amy Sage and Andy Grieshop March 7, 2007.
Photon reconstruction and matching Prokudin Mikhail.
CHAPTER 2: Basic Summary Statistics
Control Loops Tune a Fish. Control Loops Tuning of a control loop involves selecting loop parameters to ensure stable control under all operating conditions.
Lecture 8: Measurement Errors 1. Objectives List some sources of measurement errors. Classify measurement errors into systematic and random errors. Study.
 Are two random variables related to each other ?  What does it mean if the data are independent?  What is meant by the term covariance?  What does.
Pat Arnott, Dec 2003 DRI Aerosol Optics at the SGP CART Site and on the Twin Otter Pat Arnott, John Walker, Hans Moosmüller Desert Research Institute University.
{ Binary “There are 10 types of people in the world: Those who understand binary, and those who don't.”
06/28/10UVIS TEAM MEETING OPERATIONS UPDATE UVIS TEAM MEETING OPERATIONS UPDATE 1.
Using Graphs and Charts Organizing results from Scientific data.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
Double Star Active Archive - STAFF-DWP Data errors and reprocessing Keith Yearby and Hugo Alleyne University of Sheffield Nicole Cornilleau-Wehrlin LPP.
I'm concerned that the OS requirement for the signal is inefficient as the charge of the TeV scale leptons can be easily mis-assigned. As a result we do.
Chapter 5 Armature Gap Sweep Workshop 4. Training Manual Electromagnetic Analysis in Workbench March 4, 2005 Inventory # Workshop #2: Armature.
Diagnostic clinical chemistry
Control System Instrumentation
Thomas Müller and Alfred Wiedensohler
Two-Sample Hypothesis Testing
AC-9/AC-S data analysis from CDOM Lab
Chapter 10 Verification and Validation of Simulation Models
How to represent real numbers
How to represent real numbers
The MIS and Dashboard.
Hypothesis Testing A hypothesis is a claim or statement about the value of either a single population parameter or about the values of several population.
Topic 7: Visualization Lesson 1 – Creating Charts in Excel
Feedback Controllers Chapter 8
CHAPTER 2: Basic Summary Statistics
Behavioural modelling
Presentation transcript:

Data Editing Strategies Common edits Invalidation vs. contamination? What is a considered a spike? What not to edit! Automatic edits Reasons for edits Apply correction to data (e.g., known offset or calibration) Bad neph zero Size cut wrong or not switching Pump failure System left in bypass mode System humidity too high Other instrument or system problems

Invalidation vs. Contamination Data should be invalidated when they are out of the acceptable range (site specific), show abnormal variability, or if a sampling or measurement problem is determined. Data should be flagged as contaminated if local aerosol sources cause the data to be not representative of the regional or target (e.g. downslope conditions at MLO) aerosol. Flagging for contamination can be either manual or automated (WD, WS, CN threshold). Big differences: Invalidation can handle different aerosol parameters differently (e.g., CCN invalidated but not scattering), while Contamination flags all aerosol data during the episode. In the data processing functionality, Invalidation and Contamination are slightly different. Invalidation causes MVCs to be inserted into the final corrected data file during the time of the edit, while Contamination flags these data periods. Neither invalidated nor contaminated data are used to calculate the final, QC-edited and corrected average-format (hourly, daily, monthly) data files.

Spikes Spikes are short-duration deviations from “normal” measurements, often caused by local contamination or measurement problems. The time duration and magnitude for a deviation to be considered a spike is dependent on the variability of the “normal” measurement. Generally, spikes are of 1-15 minutes duration. Longer duration deviations are either considered contamination events, sampling/measurement problems, or real aerosol events. Spikes can be positive or negative. If the spike is physically unrealistic (e.g., large negative [CN]), then it should be invalidated.

What Not To Edit! Do not edit out spikes in intensive data (if the extensive data look ok) Intensive parameters (i.e., the calculated parameters single scattering albedo, angstrom exponent, and backscattering fraction) are calculated and displayed based on the 1-min measured data (scattering and absorption). Because the calculated parameters involve ratios of measured parameters, there may be spikes in the calculated parameters when the measured parameters are close to zero. Two reasons not to edit intensive parameter spikes:  bias your data by removing low concentration time periods  averaged intensive parameters should be calculated by first averaging the measured parameters and then calculating intensive parameters. The spikes in the high frequency (1-min) data are unlikely to affect the average intensive values.

Examples (Invalidation) SPO,2009, ,USER: FLow rate tests completed CNC #626 = 1525 mlpm WCPC = mlpm. Zero tests both 0.0 PDC Spikes caused by flow checks These should be invalidated since a known procedure with the instrument or sampling caused them

Examples (Invalidation) Pump failure  no flow through system Aerosol system issue Need to invalidate scattering, absorption, CPC, etc., data This problem looks a lot like when the system is left in bypass mode. Look at Q_Analyzer flow to determine.

Examples (Contamination) At MLO, upslope air is usually more polluted than downslope (free tropospheric) air Measurements during upslope episodes are flagged (black markers at bottom of plot). Spike at Day appears to be local contamination, decaying away rapidly. Since we did not observe any problem with the instrument or aerosol system operation, we flagged this spike as contamination.

Examples (Automatic Edits) Data at SMO are flagged based on wind direction Flagging occurs when aerosol is coming from the island (intense local sources, i.e., spikes) instead of the open ocean This saves a lot of manual editing of data!

Size cut wrong or not switching (1) In this example, electronic valve was stuck in the 10- micron cut position. Data still get flagged alternately as 1-um or 10-um based on cpd configuration. Can we fix this using the data editing tools? Let’s look more closely at this period.

Size cut wrong or not switching (2) The problem is that data are being flagged as submicron when they are in fact Dp<10um. Need to flip the hex flag bit that denotes the size cut

Size cut wrong or not switching (3) Open Mentor edits window, then click to add a new edit. In the edit directive window, choose “Bitmask”, then the flags variable “F_aer”. For the Source Bit, change it to “0x10”. For the Target Bit, change it to “0x0”. A discussion of the flag bits and an example of this is provided in the Knowledge Base under “Bitmask Edits”

Relative Humidity Effects on Data The PSAP absorption measurement is very sensitive to changes in relative humidity. These data should be INVALIDATED. In this example the absorption is very noisy and the noise increases when the Neph Inlet RH > ~60% In this example you can clearly see the fluctuations in absorption corresponding to fluctuations in humidity due to air conditioning cycling.