Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.

Slides:



Advertisements
Similar presentations
Conversion of CPC Monitoring and Forecast Products to GIS Format Viviane Silva Lloyd Thomas, Mike Halpert and Wayne Higgins.
Advertisements

Data management in SCD Steven Worley General Categories –The Mass Storage System –NCAR user file services (home directories) –Computer attached storage.
SCD Research Data For UCAR Data Management Working Group January 10, 2001 Steven Worley Scientific Computing Division Data Support Section.
ICOADS Archive Practices at NCAR JCOMM ETMC-III 9-12 February 2010 Steven Worley.
1 of 2 Microsoft ® SharePoint ® Sites and Workspaces Windows SharePoint Services enable information storage, display, and collaboration by allowing you.
The International Surface Pressure Databank (ISPD) and Twentieth Century Reanalysis at NCAR Thomas Cram - NCAR, Boulder, CO Gilbert Compo & Chesley McColl.
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System 1 Zaihua Ji Doug Schuster Steven Worley Computational.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Introduction Downloading and sifting through large volumes of data stored in differing formats can be a time-consuming and sometimes frustrating process.
Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences October, 2001 Steven Worley National Center.
U.S. Surface Archives Sent to China ( ) 7 th PRC-U.S. Joint Coordination Panel for Data and Information Cooperation 29 Nov. – 1 Dec, 2000 Steven.
October 16-18, Research Data Set Archives Steven Worley Scientific Computing Division Data Support Section.
Tutorial 1: Getting Started with Adobe Dreamweaver CS4.
15 Maintaining a Web Site Section 15.1 Identify Webmastering tasks Identify Web server maintenance techniques Describe the importance of backups Section.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
Growing and Future Datasets in the SCD Research Data Archives for NSF SCD Review Panel 16 October 2001 Steven Worley Scientific Computing Division Data.
Data for Climate and Energy Studies Steven Worley Computational and Information Systems Laboratory NCAR.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Scientific Investigations; Support from Research Data Archives for Joint Office for Science Support 26 February, 2002 Steven Worley SCD/DSS.
Strategy for ECM in a Decentralized Organization Beth Franssen, Electronic Corporate Marketing Consultant to Hines
Scientific Computing Division Trends and Directions of Mass Storage in the Scientific Computing Arena CAS 2001 Gene Harano National Center for Atmospheric.
Archive and Access Practices that Support Data Reuse and Transparency Steven Worley Doug Schuster Bob Dattore National Center for Atmospheric Research.
Data Access to Marine Surface Observations and Products from COADS 29 January, 2002 Steven Worley National Center for Atmospheric Research.
Improved Access to RDA from the MSS OSD Executive Meeting April 28, 2009.
Analyzed Data Products Available from NCAR that Support Marine Climate Research JCOMM ETMC-III 9-12 February 2010 Steven Worley Doug Schuster.
Using the Global Change Master Directory (GCMD) to Promote and Discover ESIP Data, Services, and Climate Visualizations Presented by GCMD Staff January.
Data Discovery and Access to The International Surface Pressure Databank (ISPD) 1 Thomas Cram Gilbert P. Compo* Doug Schuster Chesley McColl* Steven Worley.
Content, Discovery, and Accessibility Enhancements to the NCAR Research Data Archive Doug Schuster and Steve Worley NCAR.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
RAINEX Data Management UCAR Joint Office for Science Support José Meitín Jim Moore Dick Dirks UCAR Joint Office for Science Support José Meitín Jim Moore.
RDA Data Support Section. Topics 1.What is it? 2.Who cares? 3.Why does the RDA need CISL? 4.What is on the horizon?
OWL Representing Information Using the Web Ontology Language.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
TIGGE Archive Status at NCAR THORPEX Workshop and 6th GIFS-TIGGE Working Group Meetings WMO Headquarters Geneva September 2008 Steven Worley Doug.
SCD Research Data Archives; Availability Through the CDP About 500 distinct datasets, 12 TB Diverse in type, size, and format Serving 900 different investigators.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
Research Data Archive (RDA) Access and Services from Yellowstone Grace Peng and Doug Schuster 1.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
International Oceanographic Data and Information Exchange - Ocean Data Portal (IODE ODP) Enabling science through seamless and open access to marine data.
Data Discovery and Access to The International Surface Pressure Databank (ISPD) 1 Thomas Cram Gilbert P. Compo* Doug Schuster Chesley McColl* Steven Worley.
Distributed Data Servers and Web Interface in the Climate Data Portal Willa H. Zhu Joint Institute for the Study of Ocean and Atmosphere University of.
RDA Data Support Section. Topics 1.What is it? 2.Who cares? 3.Why does the RDA need CISL? 4.What is on the horizon?
Where/how could we change the overall process of field project implementation to improve in our mission of answering key science questions? Are we open.
SCD Research Data for Ocean Observatories Steering Committee June 18, 2001 Steven Worley Scientific Computing Division Data Support Section.
From Missions to Measurements: an Ocean Discipline Experience.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
Web Design Vocabulary #3. HTML Hypertext Markup Language - The coding scheme used to format text for use on the World Wide Web.
1. Gridded Data Sub-setting Services through the RDA at NCAR Doug Schuster, Steve Worley, Bob Dattore, Dave Stepaniak.
Introduction What purpose does a data archive center serve if users can’t find or access the holdings they might need to facilitate their research discoveries?
Web Page Programming Terms. Chapter 1 Objectives Describe Internet and Understand Key terms Describe World Wide Web and its Key terms Identify types and.
E-Business Infrastructure PRESENTED BY IKA NOVITA DEWI, MCS.
TIGGE Archives and Access
TIGGE Data Archive and Access System at NCAR
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Information Technology Ms. Abeer Helwa
Development and Futures of Research Data Archives
Research Data Archives at NCAR
Steven Worley, NSF/NCAR/SCD
Steven Worley, Douglas Schuster,
CISL’s Research Data Archive (RDA) : Description and Methods
Comeaux and Worley, NSF/NCAR/SCD
Long-Lived Data Collections
Data Management Components for a Research Data Archive
Robert Dattore and Steven Worley
Successful Data Curation for Large Data Archives
Data Curation in Climate and Weather
Comeaux and Worley, NSF/NCAR/SCD
Presentation transcript:

Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section

Outline Archive building; Basic ingredients Proactive content development Support for NCAR/UCAR/Universities Access to the data

Archive Building Collect, maintain, and archive data –Data types Observations Analyses Reanalyses Model output data Staff – data stewards – SE’s educated in meteorology and oceanography –Activities Quality check the information Write discovery metadata Provide human based consulting

Proactive Content Development NCAR has a privileged position –NOT restricted by “Agency” mission demands e.g. “You must archive Level 1 MODIS data!” –Remain flexible to meet research demands –Choose efficient and cost effective methods

Proactive Content Development Example of being proactive –NCAR’s in situ observations collections are the most complete in the world Long term dedication – 35+ years Involved many special data exchanges (national and international) –Add key data not available in the standard archives Achievement widely recognized –US Academy of Science – NCEP and ECMWF

Challenges detect and repair, or identify erroneous data rectify historical format differences to create a user friendly “best” composite collection To meet the challenges have the knowledgeable staff need staff time to develop the graphics and analysis software

Proactive Content Development Many other examples – Atmospheric upper air data – Marine surface data –Etc.

Proactive Content Development Importance of these collections –Basis for research data products, e.g. NCEP/NCAR Reanalysis NCEP DOE Reanalysis II ERA40 (processing now at ECMWF) NCEP Regional Reanalysis (new)

Countries that have received NCEP/NCAR Reanalysis from SCD CD-ROMS network downloads tape media copies

Proactive Content Development Challenges of Reanalyses Products –Product Size Current sets, O( 2 TB) New sets, O( 20 TB) –Many users, O( 100’s/year) Not a problem in NCAR computing environment External access is the greatest challenge (more later)

Support for Users The Users are; – NCAR/UCAR scientists – UCAR universities scientists and students – International scientists and students – Gov. agencies and private corporations Access through; – SCD MSS – Customized requests – Online data servers (ftp, and browser) – Community Data Portal (CDP)

–CDP Features – Real time data access – 4-D dataset subsets – multiple output formats – data analysis Challenges –Develop access to large (TB) datasets »Sequential / parallel data processing »Linkages to SANS –Integration with other extant data systems To meet the challenge –Continue to improve our servers, storage networks, and network connections –Staff with expertise and dedicated time

Support for the Users Search and discover data How?  Web based Information Server Features –5K+ html pages (metadata) –All datasets are described –Access options –Higher level information Catalogs Project specific descriptions –All information is current Based on text files and change control system Automatic re-build of affected.html pages

Support for the Users Search and discover data Next Step for Data Discovery –Need a metadata descriptions in a ‘standard format’ –Enable organization wide discovery (eventually nation wide or global) –UCAR has DMWG on metadata Likely scenario – based meta standard –Well poised to based on.html experience

Take Home Key Points Long term involvement in archive building National and International collaborations Focus on scientific data content, documentation, and access stay in tune with scientific needs remain flexible so we can accommodate the needs promote data exchanges that enhance the archive distribute data at minimal or no cost provide data to large projects, e.g. NN Reanalysis have good data and metadata systems now more technology and software to enhance the access Proactive development stewardship (collecting, maintaining, improving)