NOAA National Climatic Data Center Dr. Karsten Shein Climatologist NOAA/NESDIS/NCDC 151 Patton Ave. Asheville, NC 28801 1.828.271.4223.

Slides:



Advertisements
Similar presentations
Better Data, Better Science! [ Better Science through Better Data Management ] Todd D. OBrien NOAA – NMFS - COPEPOD.
Advertisements

Meteorological Observatory Lindenberg – Richard Assmann Observatory The GCOS Reference Upper Air Network.
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
Igor Zahumenský (Slovakia, CBS ET/AWS)
Literature Review Kathryn Westerman Oliver Smith Enrique Hernandez Megan Fowler.
Datzilla Usage NCDC Perspective Karsten Shein NOAA National Climatic Data Center NWS Sub-regional data.
2.2 Validation & Verification
The Integrated Surface Hourly (ISH) Global Database ESDIM/OGP funded, FCC effort 2 QC phases thus far Full POR online via FTP, partial POR via CDO ISH.
1 Fischer-Porter Retrofit Workshop Sterling, Virginia Nov , 2008 Hourly Precipitation Data Processing System at NCDC Stuart Hinson Meteorologist.
NOAA’s National Climatic Data Center In Situ Data Sets, Processing, Quality Control, and Access Ingest and Analysis Branch NOAA’s National Climatic Data.
Report from the GCOS Archive/Analysis Center Matthew Menne NOAA/National Centers for Environmental Information Center for Weather and Climate (NCEI-Asheville)
Weather Coder 3 Examples (Not real observations, we hope !)
Sorin CHEVAL*, Tamás SZENTIMREY**, Ancuţa MANEA*** *National Meteorological Administration, Bucharest, Romania and Euro-Mediterranean Centre for Climate.
Current Website: An Experimental Surface Water Monitoring System for Continental US Andy W. Wood, Ali.
Improving the Quality of Tax Statistics: Recent Innovations in Editing and Imputation Techniques at the Statistics of Income Division of the U.S. Internal.
NCPP – needs, process components, structure of scientific climate impacts study approach, etc.
WMO-SMN Supported Research On An Expanded Operational Climate Division Network for Mexico Art Douglas and Phil Englehart Creighton University.
Meteorological Observatory Lindenberg – Richard Assmann Observatory The GCOS Reference Upper Air Network.
National Water and Climate Center PRISM Probabilistic-Spatial QC (PSQC) System for SNOTEL Data MTNCLIM 2006 CONFERENCE September 19-22, 2006 at Timberline.
Digital records and data rescue at the Hydrometeorological Institute of Montenegro Vera Andrijasevic Vera Andrijasevic Hydrometeorological Institute of.
Where Policy Meets Process Presented by: Joanna Bauer.
Workshop on International Standards, Contemporary Technologies and Regional Cooperation, Noumea, New Caledonia, 04–08 February 2008 Results Generated from.
1 Status of NERON/HCN-M for The Committee for Climate Analysis, Monitoring, and Services (CCAMS) John Hahn NWS Office of Science and Technology.
Workshop on QC in Derived Data Products, Las Cruces, NM, 31 January 2007 ClimDB/HydroDB Objectives Don Henshaw Improve access to long-term collections.
Climate data sets: introduction two perspectives: A. What varieties of data are available? B. What data helps you to identify...
Datzilla Overview / Workflow and Climate Tools from SRCC & SCIPP Kevin Robbins, Director Southern Regional Climate Center.
Development of a 103-Year High- Resolution Climate Data Set for the Conterminous United States Wayne Gibson 1, Christopher Daly 1, Tim Kittel 2, Doug Nychka.
What Is xmACIS?  Web application developed specifically for NWS  Basically xmACIS is a search engine for climate data (NCDC)  Includes all known published.
Emission Inventory Quality Assurance/Quality Control (QA/QC) Melinda Ronca-Battista ITEP/TAMS Center.
Module 6. Data Management Plans  Definitions ◦ Quality assurance ◦ Quality control ◦ Data contamination ◦ Error Types ◦ Error Handling  QA/QC best practices.
Michael A. Palecki USCRN Science Project Manager National Climatic Data Center DOC/NOAA/NESDIS USCRN PROGRAM STATUS MARCH 3, United States Climate.
G. R. Wiggans and P. M. VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD
New Local Climate Outlook Products, Data Tools, and Services Michael Brewer NOAA/NWS/OCWWS NWS Partners Meeting January 18, 2007.
Operational Issues from NCDC Perspective Steve Del Greco, Brian Nelson, Dongsoo Kim NOAA/NESDIS/NCDC Dongjun Seo – NOAA/NWS/OHD 1 st Q2 Workshop Archive,
September Interface Kickoff Sunflower Project Statewide Management and Reporting Tool Update September 02, 2009.
American Association of State Climatologists, Coeur d’ Alene, ID 18 July, 2007 Update Since Rapid City Jan Curtis Applied Climatologist National Water.
WFM 6311: Climate Risk Management © Dr. Akm Saiful Islam WFM 6311: Climate Change Risk Management Akm Saiful Islam Lecture-7:Extereme Climate Indicators.
International Workshop on Rescue and Digitization of Climate Records in the Mediterranean Basin Data Rescue Activities at Slovenian Meteorological Office.
Surface Water Quality Monitoring Information System (SWQMIS) Cindi Atwood Tetra Tech, Inc. (703) Nancy Ragland TCEQ.
CPC Unified Precipitation Project Pingping Xie, Wei Shi, Mingyue Chen and Sid Katz NOAA’s Climate Prediction Center
Verification & Validation. Batch processing In a batch processing system, documents such as sales orders are collected into batches of typically 50 documents.
Central Region Snowfall Analysis Brian P. Walawender NWS Central Region Headquarters Matt W. Davis NWS WFO La Crosse, WI 5/26/2011.
CLIMATE SERVICE DIVISION / OCWWS / NWS L3MTO QC and Accuracy Marina Timofeyeva Contributors: Annette Hollingshead, Dave Unger and Andrea Bair.
NOAA’s National Climatic Data Center Climate Service Partnership Activities At NOAA’s National Climatic Data Center Tim Owen Climate Prediction Applications.
RFC Climate Requirements 2 nd NOAA Climate NWS Dialogue Meeting January 4, 2006 Kevin Werner.
Data Collection. Data Capture This is the first stage involved in getting data into a computer Various input devices are used when getting data to the.
1 NODC Quality Control : Automatic Checks - reveal systematic errors in incoming data and metadata - eliminate most non-representative data from consideration.
Perspectives on Historical Observing Practices and Homogeneity of the Snowfall Record Kenneth E. Kunkel NOAA Cooperative Institute for Climate and Satellites.
Real Time Nowcasting In The Western Us OR Why you can’t use nodes C0-2 George Thomas Andy Wood Dennis Lettenmaier Department of Civil and Environmental.
2/18/2016 Data Management Issues Related to Drought Monitoring at Environment Canada Robert Morris Data Analysis and Archive Division Meteorological Service.
Cooperative Observer Program (COOP). Mission and Vision of COOP A network of 10,240 observations in all states and territories Organic Act of 1890 established.
N ational C limatic D ata C enter Development of the Global Historical Climatology Network Sea Level Pressure Data Set (Version 2) David Wuertz, Physical.
2016 winter seasonal climate forecasts over the western US Background. My graduate class (Stochastic Hydrology) is doing a final group project to forecast.
Verification of operational seasonal forecasts at RA-VI Regional Climate Center South East European Virtual Climate Change Centre Goran Pejanović Marija.
Session 6: Data Flow, Data Management, and Data Quality.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
Current WEBSITE: Experimental Surface Water Monitor for the Continental US Ali S. Akanda, Andy W. Wood,
Quality Control of Soil Moisture and Temperature For US Climate Reference Network Basic Methodology February 2009 William Collins USCRN.
Data quality & VALIDATION
Jay Lawrimore, Matt Menne
Climate Monitoring Tools High Plains Regional Climate Center
U.S.-India Partnership for Climate Resilience
Status Report of EDI on the CAA
Auditing Information Technology
Status Report of EDI on the CAA
Be The Weather Guy Presented to UCALL on October 14, 2009
Data validation handbook
Discrepancy Management
An Experimental Daily US Surface Water Monitor
Precip, Tmax, Tmin scaled to match CRU monthly data.
Presentation transcript:

NOAA National Climatic Data Center Dr. Karsten Shein Climatologist NOAA/NESDIS/NCDC 151 Patton Ave. Asheville, NC Interactive Quality assurance practices A look at some methods for evaluating COOP data at the NOAA National Climatic Data Center IIPS 6A.9

NOAA National Climatic Data Center 2 of 19 COOP Climate Data ~ 8500 stations reporting ~ 1,000,000 observations per month Manual observations and reporting –Most stations observe PRCP (SNOW) –Many also observe TMAX, TMIN, TOBS, SNWD –Few: DYSW, EVAP, WTEQ, WDMV, Soil T Daily data / Monthly processing Arrive at NCDC in electronic (daily) and paper (monthly) formats.

NOAA National Climatic Data Center 3 of 19 Sources of bias in COOP data Observations –Rounding, Instrument error, incorrect obs technique Recording –Transposition, Wrong column, Wrong units, Wrong sign, illegibility, non- entries, wrong resolution, wrong date, wrong time. Transcription (keying) –Most keying errors are due to bias already introduced by the recording step Transmission (file corruption) Validation / Quality Control –Failure to review forms prior to NCDC –Incorrect metadata –Inappropriate validation, flagging, estimation

NOAA National Climatic Data Center 4 of 19 Not all COOP observations are created equally

NOAA National Climatic Data Center 5 of 19

NOAA National Climatic Data Center 6 of 19 10° F or 12° F ? Error corrections or error creations? ° F ° F TMAX of 19° F with TOBS of 21 ° F ? 20 inches of snow or 2.0?Keyers must “key what they see.” TMIN and TOBS in wrong columns.

NOAA National Climatic Data Center 7 of 19 Automated COOP QC Primary checks on Temperature and Precipitation elements –Around 10 million values per year Internal consistency –Logical (e.g., TMAX ≥ TMIN) –Spikes, flatliners, outliers, excessive range, change points –Date shifting Spatial consistency Values not automatically invalidated unless logically or meteorologically impossible. –Suspect data are reviewed by operators Original values are NEVER changed or edited –unless they were keyed incorrectly

NOAA National Climatic Data Center 8 of 19 First stage Interactive QC “Quasi-Interactive” Applied to values deemed suspect and where a logical operation will not provide resolution (e.g., date shifting) Value compared to climatological neighbors and to computed grids (GEA, TempVAL, PrecipVAL) –See extended abstract or for references and details. Decision is made to accept/reject valid status assigned by automated QC Operator intervention is part of decision process

NOAA National Climatic Data Center 9 of 19 Fully-Interactive QC Health of the Network Datzilla

NOAA National Climatic Data Center 10 of 19 Health of the Network –Available once final QC has been completed –Output for: TMAX, TMIN, TOBS, PRCP, SNOW, SNWD –Accessible by anyone via the Internet –Track QC by station, state, WFO, NWS region, or RCC region. –Graphical and tabular reports to highlight quality issues. Web-based tool for viewing the results of NCDC QC processing.

NOAA National Climatic Data Center 11 of 19 Health of the Network Data Completeness (number of obs) Quality Assurance (invalid and missing) Missing Data / Non- reporting stations Data validity (% unflagged, non-missing) Watch list (change points detected w/o corresponding metadata)

NOAA National Climatic Data Center 12 of 19

NOAA National Climatic Data Center 13 of 19 Summary from HoN All TMAX, TMIN, TOBS, PRCP, SNOW, SNWD data subjected to automated QC % of checked 2006 COOP data declared valid (no further checks). 3.28% declared invalid (331,777) –14.65% no estimate (48,602) –85.36% estimated (283,175) ◦31% of estimates supplied by TempVal (87,820) Thanks to Helen Frederick, HoN administrator, for supplying the numbers.

NOAA National Climatic Data Center 14 of 19 Datzilla Fully manual, interactive quality assurance Web-based tool to report and track errors in NOAA-held data, metadata or associated delivery systems. Developed and maintained by Kevin Robins at the SRCC

NOAA National Climatic Data Center 15 of 19

NOAA National Climatic Data Center 16 of 19 Some possible reasons for a Datzilla ticket to NCDC Data issue (usually) –That TMAX of 74 should be a 47! –Your QC clobbered my data! System issue –CDO inventory doesn’t match the data I got! Metadata issue –This station’s COOP number is wrong!

NOAA National Climatic Data Center 17 of 19 When NCDC receives a Datzilla ticket … Datzilla gatekeeper Initial determinations Reassignment Investigation Course of action Resolution Closure Your friendly Datzilla Gatekeeper

NOAA National Climatic Data Center 18 of 19 Datzilla Summary Began operation early 2005 As of 1/15/08: –865 Datzilla entries (564 NCDC) –226 open (109 NCDC) ◦Receive about 15 new tickets per month –455 of the 564 resolved –317 verified errors with archive fix ◦Most are single values or metadata ◦Few involve many values or larger station issues Has resulted in improvement to the historical climate record. Data quality Data errors

NOAA National Climatic Data Center Dr. Karsten Shein Climatologist NOAA/NESDIS/NCDC 151 Patton Ave. Asheville, NC Thank You