Presentation is loading. Please wait.

Presentation is loading. Please wait.

Improving Data Catalogs with Free and Open Source Software Kevin O’Brien University of Washington Joint Institute for the Study of the Atmosphere and Ocean.

Similar presentations


Presentation on theme: "Improving Data Catalogs with Free and Open Source Software Kevin O’Brien University of Washington Joint Institute for the Study of the Atmosphere and Ocean."— Presentation transcript:

1 Improving Data Catalogs with Free and Open Source Software Kevin O’Brien University of Washington Joint Institute for the Study of the Atmosphere and Ocean Steven C Hankin – NOAA/PMEL Roland Schweitzer – Weathertop Consulting AGU Fall Meeting 2013

2 The Unified Access Framework (UAF) A Global Earth Observation Integrated Data Environment (GEO-IDE) project An attempt to improve scientific data management and access Focus on successes

3 Lots of data already available

4 What “success” did UAF chose to copy? Year 1 focused on gridded datasets. Service stack: netCDF-CF-DAP-THREDDS-WMS Projects: (too many to name) Data formats: netCDFGRIBHDF Applications: MatlabArcGISFerret GrADS Google Earth IDV LAS ERDDAP … Users: (too many to name) …

5 Developing the UAF Catalog Cleaner (a ‘web crawler’) ‘RAW’ ‘CLEAN’

6 Tree Crawl Dataset Crawl Cleaner CatalogRef and Dataset URL’s Raw catalog XML

7 Tree Crawl Dataset Crawl Cleaner url="http://cwcgom.aoml.noaa.gov/thredds/dodsC/OCEAN_GEOSTROPHIC_CURRENTS/CURRENTS.nc" url="http://cwcgom.aoml.noaa.gov/thredds/dodsC/GLOBAL_MONTHLY_CARBON_FLUXES/FLUXES.nc" url="http://cwcgom.aoml.noaa.gov/thredds/dodsC/GLOBAL_SEASON_CARBON_FLUXES/FLUXES.nc" url="http://cwcgom.aoml.noaa.gov/thredds/dodsC/ROMSMETEO/kk1.nc" url="http://cwcgom.aoml.noaa.gov/thredds/dodsC/MCI_GULF/kk1.nc" url="http://cwcgom.aoml.noaa.gov/thredds/dodsC/MSGSST/SST.nc" url="http://cwcgom.aoml.noaa.gov/thredds/dodsC/TERRA_K490_GULF/terrak490.nc" url="http://cwcgom.aoml.noaa.gov/thredds/dodsC/TERRA_K490_GULF_3D/terrak490.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.199910.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.199911.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.199912.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.200001.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.200002.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.200003.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.200004.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.200005.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.200006.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.200007.nc" url="http://www.esrl.noaa.gov/psd/thredds/dodsC/Datasets/NARR.dailyavgs/subsurface/soill.200008.nc". CatalogRef and Dataset URL’s

8 Tree Crawl Dataset Crawl Cleaner UAF Clean Catalog

9

10 How to provide feedback to data providers? Remember the “Building on Success” theme ncISO metadata assessment tool is very successful

11

12

13 How about a catalog quality assessment tool? How to provide feedback to data providers? Remember the “Building on Success” theme ncISO metadata assessment tool is very successful

14

15

16 Statistics for current catalog and all it’s children Links to rubric reports for child catalogs

17 Missing services Data issues

18 url

19 Data issues Original Catalog

20 Moving Forward…. Welcome feedback on rubric and Catalog Cleaner tool Change wording in rubric UAF master catalog to go beyond gridded files Use ERDDAP to including In Situ featureTypes Continue community outreach to improve catalogs

21 Thank you! UAF: geo-ide.noaa.gov Catalog Cleaner code and documentation: http://ferret.pmel.noaa.gov/LAS/documentation/the-uaf-catalog-cleaner/ THREDDS: www.unidata.ucar.edu/projects/THREDDS netCDF: www.unidata.ucar.edu/netcdf OPeNDAP: www.opendap.org CF: cf-pcmdi.llnl.gov Kevin.M.O’Brien@noaa.gov AGU Fall Meeting 2013


Download ppt "Improving Data Catalogs with Free and Open Source Software Kevin O’Brien University of Washington Joint Institute for the Study of the Atmosphere and Ocean."

Similar presentations


Ads by Google