Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Slides:



Advertisements
Similar presentations
José Meitín NOAA Office of Global Programs Steve Williams UCAR Joint Office for Science Support José Meitín NOAA Office of.
Advertisements

Data management in SCD Steven Worley General Categories –The Mass Storage System –NCAR user file services (home directories) –Computer attached storage.
Brian Doty and Jennifer Adams
RAMADDA for Big Climate Data Don Murray NOAA/ESRL/PSD and CU-CIRES Boulder/Denver Big Data Meetup - June 18, 2014.
SCD Research Data For UCAR Data Management Working Group January 10, 2001 Steven Worley Scientific Computing Division Data Support Section.
ICOADS Archive Practices at NCAR JCOMM ETMC-III 9-12 February 2010 Steven Worley.
The International Surface Pressure Databank (ISPD) and Twentieth Century Reanalysis at NCAR Thomas Cram - NCAR, Boulder, CO Gilbert Compo & Chesley McColl.
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System 1 Zaihua Ji Doug Schuster Steven Worley Computational.
Introduction Downloading and sifting through large volumes of data stored in differing formats can be a time-consuming and sometimes frustrating process.
U.S. Surface Archives Sent to China ( ) 7 th PRC-U.S. Joint Coordination Panel for Data and Information Cooperation 29 Nov. – 1 Dec, 2000 Steven.
October 16-18, Research Data Set Archives Steven Worley Scientific Computing Division Data Support Section.
Coordinated Energy and water-cycle Observations Peroject A Well Organized Data Archive System Data Integrating/Archiving Center at University of Tokyo.
TIGGE Archive Highlights. First Service Date ECMWF – October 2006 NCAR – October 2006 CMA – June 2007.
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
Growing and Future Datasets in the SCD Research Data Archives for NSF SCD Review Panel 16 October 2001 Steven Worley Scientific Computing Division Data.
Data for Climate and Energy Studies Steven Worley Computational and Information Systems Laboratory NCAR.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
Scientific Investigations; Support from Research Data Archives for Joint Office for Science Support 26 February, 2002 Steven Worley SCD/DSS.
Mathematics and Computer Science & Environmental Research Divisions ARGONNE NATIONAL LABORATORY Regional Climate Simulation Analysis & Vizualization John.
A Comparison of the Northern American Regional Reanalysis (NARR) to an Ensemble of Analyses Including CFSR Wesley Ebisuzaki 1, Fedor Mesinger 2, Li Zhang.
Data to Support Ocean-Atmosphere Research NCAR Research Data Archive (RDA), Zaihua Ji, NCAR Steven Worley, NCAR Scott Woodruff,
Archive and Access Practices that Support Data Reuse and Transparency Steven Worley Doug Schuster Bob Dattore National Center for Atmospheric Research.
Data Access to Marine Surface Observations and Products from COADS 29 January, 2002 Steven Worley National Center for Atmospheric Research.
CISL/DSS & MMM Data Discussion 19 March Who CISL/DSS - maintain NCEP operational analyses and observation datasets – Gregg Walters, Doug Schuster,
ICOADS: Update Status and Data Distribution Steven J. Worley Scott D. Woodruff Sandra J. Lubker Ziahua Ji J. Eric Freeman NCAR, NOAA/ESRL, NOAA/NCDC CLIMAR-III,
Analyzed Data Products Available from NCAR that Support Marine Climate Research JCOMM ETMC-III 9-12 February 2010 Steven Worley Doug Schuster.
ESIP Federation 2004 : L.B.Pham S. Berrick, L. Pham, G. Leptoukh, Z. Liu, H. Rui, S. Shen, W. Teng, T. Zhu NASA Goddard Earth Sciences (GES) Data & Information.
START-08/pre-HIPPO FIELD CATALOG AND DATA MANAGEMENT Steve Williams NCAR Earth Observing Laboratory (EOL) Boulder, Colorado START-08/pre-HIPPO Planning.
Data Discovery and Access to The International Surface Pressure Databank (ISPD) 1 Thomas Cram Gilbert P. Compo* Doug Schuster Chesley McColl* Steven Worley.
Content, Discovery, and Accessibility Enhancements to the NCAR Research Data Archive Doug Schuster and Steve Worley NCAR.
JRA-25 and JCDAS at NCAR Data from Japanese 25-year Reanalysis (JRA-25) and the operational follow- on JMA Climate Data Assimilation System (JCDAS) are.
RAINEX Data Management UCAR Joint Office for Science Support José Meitín Jim Moore Dick Dirks UCAR Joint Office for Science Support José Meitín Jim Moore.
The NOAA Operational Model Archive and Distribution System NOMADS The NOAA Operational Model Archive and Distribution System NOMADS Dave Clark for Glenn.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
TIGGE Data Archive and Access at NCAR November 2008 November 2008 Steven Worley National Center for Atmospheric Research Boulder, Colorado, U.S.A.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
TIGGE Archive Status at NCAR THORPEX Workshop and 6th GIFS-TIGGE Working Group Meetings WMO Headquarters Geneva September 2008 Steven Worley Doug.
SCD Research Data Archives; Availability Through the CDP About 500 distinct datasets, 12 TB Diverse in type, size, and format Serving 900 different investigators.
Welcome to the PRECIS training workshop
Marine Surface and Climate Data Gaps in the archives at the National Center for Atmospheric Research for 15 th U.S. – China Marine and Fishery Science.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
Data Discovery and Access to The International Surface Pressure Databank (ISPD) 1 Thomas Cram Gilbert P. Compo* Doug Schuster Chesley McColl* Steven Worley.
Distributed Data Servers and Web Interface in the Climate Data Portal Willa H. Zhu Joint Institute for the Study of Ocean and Atmosphere University of.
RDA Data Support Section. Topics 1.What is it? 2.Who cares? 3.Why does the RDA need CISL? 4.What is on the horizon?
SCD Research Data for Ocean Observatories Steering Committee June 18, 2001 Steven Worley Scientific Computing Division Data Support Section.
AMPS : International Support for Antarctic Science and Activities Kevin W. Manning Jordan G. Powers National Center for Atmospheric Research.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
1. 2 NOAA’s Mission To describe and predict changes in the Earth’s environment. To conserve and manage the Nation’s coastal and marine resources to ensure.
The TIGGE Model Validation Portal: An Improvement in Data Interoperability 1 Thomas Cram Doug Schuster Hannah Wilcox Michael Burek Eric Nienhouse Steven.
1. Gridded Data Sub-setting Services through the RDA at NCAR Doug Schuster, Steve Worley, Bob Dattore, Dave Stepaniak.
A41I-0105 Supporting Decadal and Regional Climate Prediction through NCAR’s EaSM Data Portal Doug Schuster and Steve Worley National Center for Atmospheric.
Introduction What purpose does a data archive center serve if users can’t find or access the holdings they might need to facilitate their research discoveries?
2005 – 06 – - ESSP1 WDC Climate : Web Access to Metadata and Data Frank Toussaint World Data Center for Climate (M&D/MPI-Met, Hamburg)
MERRA Data Access and Services
TIGGE Archives and Access
TIGGE Data Archive and Access System at NCAR
Jennifer Boehnert Emily Riddle Tom Hopson
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Development and Futures of Research Data Archives
Research Data Archives at NCAR
Steven Worley, NSF/NCAR/SCD
Steven Worley, Douglas Schuster,
CISL’s Research Data Archive (RDA) : Description and Methods
Comeaux and Worley, NSF/NCAR/SCD
Long-Lived Data Collections
Data Management Components for a Research Data Archive
Robert Dattore and Steven Worley
Data Curation in Climate and Weather
Comeaux and Worley, NSF/NCAR/SCD
Presentation transcript:

Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences October, 2001 Steven Worley National Center for Atmospheric Research Scientific Computing Division

Key Steps of Scientific Investigations Formulate the questions and review the state of understanding Search and discover data Access data Analyzes data Community sharing and archive Document new understandings

Search and Discover Data How?  Web based Information Server Salient Features –2.5K + html pages (metadata) –All datasets are described (500+) –Location of all data files in MSS –Higher level information Catalogs Project specific descriptions Always current dataset descriptions

Features Organization Navigation Archive Navigation Pull down menus Search Project Links

Dataset Page Title and Brief description Systematic Navigation Metadata highlights Period of Record Usage Variables Related Sites (NOAA) Contact Person Related Datasets

Brief Archive History and Specifications Started in middle 1960’s, (35 years) Managed by nine people 211K data files 17 TB in a MSS 530 datasets – all sizes

Global Observations P.O.R# YrsIncep. DateComments Rawinsondes1946-on551967Upper Air Pibals1942-on Upper Air, wind Aircraft1947-on USAF and Commer. Sat. cloud wind drift1967-on GOES and GTS Satellite Soundings TOVS + irradiance Surface Synoptic1948-on some much older Ocean Surface1794-on COADS Usages: Input for global atmospheric reanalysis Basic long term climate assessment and case studies

Operational and Composite Analyses Daily SLP is a small but very popular dataset, e.g. NAO evaluations Two main operational centers provide the best current analyses

Key Aspects Medium size archive – 170 Gigabytes multi-(product, temporal res., spatial res.) - complex Concerns; Restricted distribution U.S. non-profits and UCAR members only Need online authentication and authorization for easy access

Highlights Frequent updates to FNL, 1º, daily via FTP High resolution N. America product, ETA at 40km No distribution restrictions or cost

Reanalyses P.O.R# YrsIncep. Date NCEP/NCAR Reanalysis I / ECMWF ERA NCEP Reanalysis II / Notes: ERA-15 is finished, ERA-40 is running now NCEP II, primarily experimental run

Outstanding Features Three different coordinate surfaces Very long analysis, 2+ Terabytes size Unrestricted distribution CD-ROMS are very popular

Countries Receiving Reanalysis CDROMs Highlights Over 8900 CDROMs /2001 Recipients; U.S. 46%, Japan 11%, (Canada, UK) 4%, (Germany, India) 3%, (Australia, S.Korea, Spain, Mexico, Norway, Russia, France) 2%

Reanalysis Users for 2001 (4 th qtr estimated) 209 From the MSS [157 Jan.-Sep.] 47 On CDROM [35] 48 Custom data orders on FTP or Tape [36] 540 From the online server [406] 844 Total Served

Reanalysis Data Distributed for 2001 (4 th qtr estimated) 9616 GB from the MSS [7230 GB Jan.-Sep.] 808 GB On CD-ROM 1383 GB Custom orders, FTP and tape [1040] 88 GB From the online server [66 GB] GB, 11.9 TB Total

GCIP Model Data Center Collection High resolution atmospheric models focused on energy and hydrology cycles. GCIP: GEWEX Continental-Scale International Project / GEWEX : Global Energy and Water Cycle Exper. Critical data for N. American mesoscale studies Complete archive is about 1 Terabyte Eta –NCEP3 hr40 km 25 lvs 5/1995 – 7/2001 MAPS – FSL NOAA 3 hr40 km 5 lvs 8/ /2001 GEM – Canadian 6 hr41 km 28 lvs 4/1997 – 6/2001

Ocean Model Data MICOM; Miami Isopynic Coordinate Ocean Model, 1/12 th degree 70N to 28 S, layers COADS Clim. Forcing 6 yrs305 Gigabytes ECMWF Clim. Forcing 2 yrs164 Gigabytes ECMWF Daily Forcing 5 yrs415 Gigabytes ( ) University of Miami 6-yr Mean T at 5 meters

Dataset Sizes and Scales Today –~ 800 Unique users –~ 12 Terabytes data transferred –2 Terabyte dataset size –Example: NCEP/NCAR Reanalysis Near Future Excludes TB-PB Level 0 and 1 satellite and the super scale experimental models –Numbers of Users, ~ same –Data transferred, 5x to 10x more ? –Dataset size, 2-20 TB –Examples: Ocean and Atmosphere models ECMWF Reanalysis (ERA40)

Access to Data Methods NCAR computers –From the local MSS Web data server Custom data packages – by request (FTP, tape, CDROM) Users World class programmer Research Scientist Graduate Students Undergraduate Students

Data Access in the future Do we continue doing what we are doing? “Absolutely” Why? It Works –Over 1000 users annually Very diverse skills –The archive is a heterogeneous collection Many formats (ASCII, Binary, GrIB, BUFR, netCDF, HDF) Many sizes (1 MB to 2 TB) –Capable of serving large and small projects Maintain a variety of flexible methods

Data Access in the future Keys to handling future larger collections –Plan to create useful data products Condensed datasets from high resolution output Group most popular variables products together –Serve many, e.g. CDROMS and WWW –Continue to develop emerging online data systems User driven subset selection with graphics and data download options Server-side elementary analysis –Multi-dataset comparisons –Statistical summaries and basic meteorological calculations –Our development is the “Community Data Portal”

Data Analysis Tools –NCAR Command Language (NCL) software Features in brief –I/O for many ‘standard’ data formats –Easy adaptations to read any format –100’s meteorological functions –“Publication quality” graphics –The CDP is capable of analysis NCL is one of several middleware packages

Community Sharing Support for the scientist –A place to distribute new data results Possibly with authentication and authorization control E.g. model outputs –Spin off benefit New data resources for the archive Many users can then use new product

NCEP Operational Analyses blended with QSCAT Satellite data Wind Stress Curl, 01/24/ UTC a)NCEP Operational ONLY b)NCEP + QSCAT swaths c)OI blend of NCEP + QSCAT Blending by Colorado Research Associates We archive all three products. ab c

Key Steps of Scientific Investigations Formulate the questions and review the state of understanding Search and discover data Access data Analyzes data Community sharing and archive Document new understandings