Download presentation
Presentation is loading. Please wait.
Published byMark Burke Modified over 9 years ago
1
1 Unidata THREDDS*: Integrating Environmental Data into Digital Libraries * THematic Real-time Environmental Distributed Data Services Sponsored by the National Science Foundation http://www.nsf.gov http://www.nsf.gov Ben Domenico November 2003
2
2 Topics Traditional Unidata Approach –Mainly meteorological data –Subscription system pushes data to user sites –Unidata Program Center provides data analysis tools for use on data at user sites THREDDS Enhancements –Broader menu of Earth system data –Local client access from remote servers –Less arcane, more general and accessible tools –Integration of data and analysis tools into educational modules and digital libraries
3
3 Unidata Community Today More than160 institutions –Includes over 100 academic departments plus government agencies and private sector research groups –Does not count separate installations, e.g. Spanish weather service IDD, US Weather Service radar data system Interdisciplinary from the outset: 1996 survey showed over 2/3 of institutions had some uses outside meteorology (oceanography, hydrology, climatology, civil engineering, environmental science…)
4
4 Community Impact Survey Over 21,000 college students per year use Unidata tools and data in classrooms and labs Nearly 4,000 women/minority students More than 1,800 faculty and research staff Over 55,000 K-12 students involved through Unidata-connected university programs Informal education: in excess of 1 million hits at Unidata-based university web sites per day 97% of community report being satisfied or very satisfied
5
5 Principal Activities of the Unidata Program Center Facilitating Data Access to a broad spectrum of observations & forecasts (in near real time) Providing Tools to visualize, analyze, organize, receive, & share data at university sites Supporting Faculty who use Unidata systems at colleges & universities (most in the U.S.) Building and Advocating for a Community where data, tools, & best practices in education/research are shared
6
6 Traditional Unidata Data Types Individual observations from weather stations around the globe Satellite imagery Radar data from 160 NEXRAD radars Output from weather forecast model runs at the National Centers for Environmental Prediction Lightning strike data Measurements from sensors on commercial aircraft
7
7 1Km Radar Image
8
8 IDD: The Community in Action The Internet-based system by which universities acquire huge quantities of weather data in near-real time (i.e. ASAP) typifies Unidata’s community orientation. The system has no data center -- all tasks are performed on the participants’ own (small) computers. Currently the most used “advanced application” on the Abilene network (2-3% in terms of packets and bytes transferred)
9
9 Internet Data Distribution (IDD) with Multiple Sources (Injecting 17 Gigabytes per Day) Using LDM software for instant data relaying, ~160 institutions cooperate to acquire a wide range of real- time, global, atmospheric & oceanic observations, model outputs, remotely sensed images..., in a coordinated community effort.
10
10 IDD: Fanout from Source
11
11 Lightning, aircraft, GPSmet, etc. Unidata user running local analysis and display tools Decoders Typical Data Handling at a Unidata Site Unidata user running local analysis and display tools Local data decoded into application specific formats IDD Application specific protocols Decoders Forecast Model Output Weather station observations Satellite imagery Radar data
12
12 Thematic Data Servers (combining IDD “push” with several forms of “pull” and DL discovery) Local user applications: e.g., LAS, McIDAS, IDV, VGEE, IDL, MatLab... DLESE Digital Library for Earth-System Education Hydrology Data, e.g. Geophysical Data, e.g. Satellite Images, e.g. Satellite Imagery... Client/server data access protocols, e.g. OpenDAP, ADDE, WCS, FTP IDD DL interchange protocol IDD Discovery IDD
13
13 THREDDS THematic Real-time Environmental Distributed Data Services Connecting people, documents and dataPeopleDocuments Data
14
14 THREDDS Overview National Science Digital Library (NSDL) “collections” project Integrating real-time environmental data into –Online educational materials –Digital libraries (DLESE, NSDL) Two-year grant from NSF Department of Undergraduate Education (DUE) Second generation under negotiation Led by Unidata Program Center (UPC)
15
15 THREDDS Data Providers University of Alabama Huntsville (Sara Graves, Rahul Ramachandran, Steve Tanner, Ken Keiser) ARM (Atmospheric Radiation Measurement, Chris Klaus) CDC, the Climate Diagnostic Center (Roland Schweitzer) COLA, Center for Oceans Land Atmosphere (Joe Wielgosz) University of Florence (Stefano Nativi) GMU, George Mason University (Menas Kafatos and Ruixin Yang) IRI/LDEO, International Research Institute/Lamont Doherty Earth Observatory (Benno Blumenthal) ESG, the Earth System GRID (Luca Cinquini, NCAR/SCD) IRIS DMC, Incorporated Research Institutes for Seismology Data Management Center (Rob Casey) NCAR, the National Center for Atmospheric Research (Don Middleton) NCDC, the National Climatic Data Center (Ben Watkins) NGDC, National Geophysical Data Center (Ted Habermann) NOMADS,NOAA Operational Model Archive and Distribution System, (Glenn Rutledge, NCDC) University of Oklahoma (Kelvin Droegemeier) PMEL, the Pacific Marine Environment Laboratory (Steve Hankin) FNMOC, Fleet Numerical Meteorological and Oceanographic Center (Phil Sharfstein) SSEC, the Space Science and Engineering Center., U. of Wisconsin-Madison (Steve Ackerman, Tom Whittaker) Unidata Community ADDE servers (Tom Yoksas, Unidata Program Center) CIESIN (Consortium for International Earth Science Information Network, Bob Downs)CIESIN CUAHSI (Consortium of Universities for Advancement of Hydrologic Science, David Maidment)CUAHSI ESIG/NCAR (NCAR Environmental Societal Impacts Group, Bob Harriss)ESIG Earthscope (UCAR UNAVCO, Chuck Meertens)Earthscope GEON (GEOphysical Network, Chaitan Baru, UCSD San Diego Supercomputer Center)GEON ESRI GIS Community
16
16 THREDDS Analysis/Display Tool Builders Data Discovery Toolkit and Foundry based on EDMI (Earth Data Multimedia Instrument, New Media Studio, Bruce Caron). GDS, GrADS/DODS Server (COLA, Center for Oceans Land Atmosphere, Joe Wielgosz) IDV, Integrated Data Viewer (Unidata Program Center, Don Murray) INGRID (IRI/LDEO, International Research Institute/Lamont Doherty Earth Observatory, Benno Blumenthal) LAS, Live Access Server (PMEL, the Pacific Marine Environment Laboratory, Steve Hankin) VGEE, Virtual Geophysical Exploration Environment (NCAR, DLESE, U. of Illinois, Unidata, many collaborators) WXWISE Applets (SSEC, the Space Science and Engineering Center., U. of Wisconsin-Madison, Tom Whittaker) ESRI GIS Clients (ESRI, Inc., Jack Dangermond, President)ESRI OGC Clients (Open GIS Consortium, David Schell, President)OGC MyWorld (Northwestern educational GIS Client, Danny Edelson)MyWorld
17
17 THREDDS Interoperability Partners ADDE, Abstract Data Distribution Environment (University of Wisconsin – Madison, Tom Yoksas) DIMES, DIstributed MEtadata System (George Mason University, Ruixin Yang) DODS/OPeNDAP/Aggregation Server, Distributed Oceanographic Data System/Open source Project for a Network Data Access Protocol (University of Rhode Island, Unidata, Ethan Davis) DLESE, Digital Library for Earth System Education (Rajul Pandya) ESML, Earth System Markup Language (University of Alabama-Huntsville, Rahul Ramachandran) ESRI, Environmental Science Research Institute (various) GCMD, Global Change Master Directory (Gene Major) OGC and ISO Standards (University of Florence, Stefano Nativi) ADL (Gazetteer Services The University of California, Santa Barbara, Linda Hill and Michael Goodchild)ADL DLESE Evaluation Services (The University of Colorado CIRES, Susan Buhr)DLESE DLESE Data Services (Tamara Ledley)DLESE DLESE Program Center Digital Library for Earth System Education (Mary Marlino)DLESE ESRI (Jack Dangermond, President)ESRI OPeNDAP (The University of Rhode Island Open source Project for a Network Data Access Protocol -- formerly DODS, Peter Cornillon)OPeNDAP LAITS (Laboratory for Advanced Information Technology and Standards,Liping Di, George Mason University)LAITS NSDL Evaluation Services (University of Colorado, Tamara Sumner)NSDL OGC (Open GIS Consortium, David Schell, President)OGC SWEET (Semantic Web for Earth and Environmental Terminology, Rob Raskin)SWEET
18
18 Unidata’s Contributions A large, (inter)national, active, cooperative academic user community Coordination of many disparate contributors (universities, government agencies, digital libraries, commercial vendors, standards bodies…) Reliable, automated, real-time data systems Platform-independent 5D visualization with HTML document integration Basic inventory catalog generator and server software Client-side catalog access modules
19
19 Funding Sources Unidata 2003/2008 (NSF Atmospheric Science Division) THREDDS NSDL Collections Grant (NSF Department of Undergraduate Education) DODS/OPeNDAP (University of Rhode Island subcontract on Naval Ocean Partnership Program Grant and NASA Earth Science Enterprise) NWS/COMET Case Studies (NOAA NWS)
20
20PeopleDocumentsData
21
21 DocumentsPeopleData Well-developed connections –Document references –Embedded multimedia –Embedded interactive applets Powerful tools –Google –Dreamweaver –Web-site management tools –Web services The Web
22
22PeopleDocumentsData Discovery and Publication Tools Discovery and Publication Services
23
23 Documents People Data Data Access Technologies Web-based data interactions with passive gif images -- most analysis work done on remote server Traditional Unidata IDD with analysis on local clients Combinations with Web browse and FTP delivery for local analysis, Client/server, e.g., DODS/OPeNDAP All lack sophisticated, text- based Web search/discovery tools and coherent integration
24
24 People – Data Ad Hoc Tools/Services Traditional Unidata approach –IDD moves data to local network –McIDAS, GEMPAK, IDV (thick clients) –Most analysis work done on local client Web-based data interactions –Simple (passive) gif images –LAS, INGRID, GDS (thin clients) –Most analysis work done on remote server Combinations –Web browse/catalogs with FTP delivery/local analysis –Client/server (DODS/OPeNDAP, ADDE…) –Embedded data access applets (WXWISE) All lack sophisticated, text-based Web search/discovery tools and coherent integration
25
25PeopleDocumentsData Analysis and Visualization Tools Data Services Discovery and Publication Tools Discovery and Publication Services
26
26PeopleDocuments Data THREDDS is the Bottom line Associate words of the science with available datasets Create “compound” documents pointing to datasets Connect analysis tools to documents and datasets Wide range of compound documents –Lists of datasets available on server with brief description of dataset classes –Online publications pointing to datasets illustrating concepts Massive arsenal of Web and Digital Library search/discovery tools can be applied to compound documents
27
27 Documents – Data Connect Words and Datasets THREDDS primary focus –Associate words of the science with available datasets –Create “compound” documents pointing to datasets –Connect analysis tools to documents and datasets Wide range of compound documents –Lists of datasets available on server with brief description of dataset classes –Online publications pointing to datasets illustrating concepts Massive arsenal of Web and Digital Library search/discovery tools can be applied to compound documents
28
28PeopleDocumentsData Analysis and Visualization Tools Data Services Discovery and Publication Tools Discovery and Publication Services Catalog Generation Tools Data Catalog Services
29
29PeopleDocumentsData Catalog Generation Tools Analysis and Visualization Tools Data Services Discovery and Publication Tools Discovery and Publication Services Data Catalog Services THREDDSMiddleware
30
30 Remote Catalog Query and Data Access via Local Analysis Tool User accesses remote catalog via local analysis tool User accesses remote dataset via local analysis tool PeopleDocumentsData Catalog Generation Tools Analysis and Visualization Tools Data Services Discovery and Publication Tools Discovery and Publication Services Data Catalog Services THREDDSMiddleware
31
31 Basic Compound Document THREDDS Server Inventory Catalog Inventory list of datasets on server Generated automatically with minimal human input Viewed from within analysis and display application Can be harvested for inclusion in GCMD, DLESE, NSDL for use by module builders
32
32 Compound Publication: Educational Module within Interactive Analysis Tool Discovery at DLESE module at DPC VGEE tool at Unidata Datasets at NCAR Lends itself well to Web discovery tools, DL integrationDL integration Can be: –education module –online scientific publication
33
33 Browser-base Thin Client Access LDEO/IRI web site publishes catalog of datasets available on server at UCAR Catalog resides and is updated at UCAR Browsing of datasets on UCAR server from LDEO server Also enables analysis and display of datasets on UCAR server using tools on LDEO server
34
34 Enhanced Metadata Catalog
35
35 DLESE Search
36
36 Search Results
37
37 Interactive Lesson
38
38 Lesson with Data Tool Loading
39
39 Interactive Tool Loads with Data from Server
40
40 Other Server Data is Accessible
41
41 Vis Tool with Concept Models
42
42 Stepwise creation of third-party enhanced catalogs/case studies Begin with basic inventory catalog Crawler traverses datasets listed in basic catalog and adds location “bounding box” to location-enhanced catalog Gazetteer service examines location-enhanced catalogs to create a catalog of datasets associated with named region on Earth Evolve to “event” gazetteer with 5-dimensional bounding box (e.g., model output datasets related to “Storm of the Century” with vorticity above a threshold – a distributed case study)
43
43 Enhanced Catalogs Weather Obs IDD Model output IDD Inventory Catalog Generator ESML or NCML Generator Event Gazetteer IDD Data Mining Engine Third Party THREDDS Catalog Server THREDDS Data Server THREDDS Inventory Catalog and Data Server Enhanced Inventory Catalog and Data Server Enhanced Catalogs Case Study Catalog Digital Library Catalog System Data Catalog Harvesting
44
44 ISCCP Collection Metadata
45
45 Future Directions Standards-based web services approach to providing both data and metadata Integrate GIS clients and servers into THREDDS for access to societal impacts, infrastructure, hydrology data, etc. Work with OGC and ISO to incorporate emerging standard access protocols into THREDDS Actively participate in future DLESE Data Access Working Group and Data Services workshops to create more compound document educational module.
46
46 THREDDS, GIS, DL Interoperability GIS Client Applications THREDDS Client Applications OpenGIS Protocols: WMS, WFS, WCS OGC or proprietary GIS protocols OGC or OPeNDAP ADDE. FTP… protocols GIS Server GIS Servers Demographic, infrastructure, societal impacts, … datasets THREDDS Server THREDDS Servers Satellite, radar, forecast model output, … datasets Digital Library Discovery Systems Metadata crosswalk Open Archives Initiative (OAI) Metadata Harvesting Metadata crosswalk
47
47 Summary Universities have used Unidata tools to acquire, analyze, and display real-time atmospheric data for nearly 20 years THREDDS – along with related client/server access and display technologies-- makes an even broader menu of Earth system data to a more diverse community of users THREDDS technologies enable the creation of compound educational modules and scientific publications with embedded pointers to datasets and tools.
48
48 Data System Emphases Constant, real-time data streams –Dozens of source classes –~10 products per second –Up to 2 GB/hour Discovery centers –GCMD (DIF metadata) –DLESE (ADN metadata) –NSDL (DC metadata) Forecast model output is central –Future time –Time relative to present
49
49 ADEPT/DLESE/NASA ADN Metadata Title - the name of the resource URL or access information - the url to an online resource or access information to a physical object Description - a narrative describing the content, purpose or goal of the resource Subject - general topic areas that the resource is about or covers Technical requirements - information related to platform requirements, browsers and plug-ins, etc. Resource type - an indication to the type of educational resource, such as lab exercises and tutorials, etc. Audience - grade range of the resource Copyright - copyright statement and any other restricted usage or lack thereof about the cataloged resource Cost - indication as to whether there is a cost associated with accessing or using the resource Resource creator - contact information for the author or publisher of a resource Resource cataloger - contact information for the cataloger of a resource
50
50 Forecast Model Output: Jet Stream Winds with Surface Temp
51
51 Integrated Analysis and Display Local analysis and display tools Datasets on distributed remote servers Client/server, web services access to –Metadata –Datasets Moving to Open GIS protocols
52
52 Combined Data Sources
53
53 Integrated Data Visualization Client 3D radar reflectivity from NCAR server via DODS protocol Visible 1K satellite image from Wisconsin SSEC via ADDE protocol Balloon sounding temperature profile from local disk delivered automatically in real-time via IDD Different sources, protocols, resolutions, time-scales
54
54 Need Computer “Use” Metadata Knowledge of data structure is necessary Requires semantic information, e.g., –Standard metadata and data access protocols –Standard quantities –Standard units of measure –Connection to controlled vocabularies/ontologies XML markup languages –NcML (NetCDF markup language) –ESML (Earth Science markup language) –GML (Geography markup language)
55
55 More Information http://my.unidata.ucar.edu/ http://www.unidata.ucar.edu/projects/THREDDS/ ben@unidata.ucar.edu
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.