Data Management for CENS Stasa Milojevic Information Studies UCLA.

Slides:



Advertisements
Similar presentations
Open problems How can I acquire desired RT data? How can I discover/access desired RT data? How can I share/integrate the data? Can I integrate dynamically?
Advertisements

Luquillo Experimental Forest Information Management: a Long-Term Ecological Research system to deposit documented data ready for analysis and synthesis.
GCE Site and Information Management Overview Wade Sheldon GCE Information Manager.
Forest Markup / Metadata Language FML
NG-CHC Northern Gulf Coastal Hazards Collaboratory Simulation Experiment Integration Sandra Harper 1, Manil Maskey 1, Sara Graves 1, Sabin Basyal 1, Jian.
Take home message Metadata Standardize data formats Separate data storage and analysis utilities Adaptive software development.
An Leabharlann UCD Órna Roche UCD James Joyce Library Metadata Documenting your data
Current tools: Graphing libraries / CMS database: data summarization / aggregation / graphing –MySQL database: over 15 million sensor data and image capture.
Sensor Networks: Next Generation Problems Frank Vernon Scripps Institution of Oceanography University of California at San Diego SAMSI Sensor Network Workshop.
Caro-COOPS Data Management: Metadata. Cast-Net addresses the need for improved connectivity among coastal observing systems by creating a regional framework.
Advantages of Monitoring Vegetation Restoration With the Carolina Vegetation Survey Protocol M. Forbes Boyle, Robert K. Peet, Thomas R. Wentworth, and.
Building the LTER Network Information System. NIS History, Then and Now YearMilestone 1993 – 1996NIS vision formed by Information Managers (IMs) and LTER.
Tools for Publishing Environmental Observations on the Internet Justin Berger, Undergraduate Researcher Jeff Horsburgh, Faculty Mentor David Tarboton,
Software Architecture premaster course 1.  Israa Mosatafa Islam  Neveen Adel Mohamed  Omnia Ibrahim Ahmed  Dr Hany Ammar 2.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
Managing Data Interoperability with FME Tony Kent Applications Engineer IMGS.
V. Chandrasekar (CSU), Mike Daniels (NCAR), Sara Graves (UAH), Branko Kerkez (Michigan), Frank Vernon (USCD) Integrating Real-time Data into the EarthCube.
MEDIN Data Guidelines. Data Guidelines Documents with tables and Excel versions of tables which are organised on a thematic basis which consider the actual.
© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. THE NEON APPROACH TO DATA INGEST, CURATION, AND SHARING Christine Laney (Data.
Data Integration, Analysis, and Synthesis Matthew B. Jones National Center for Ecological Analysis and Synthesis University of California Santa Barbara.
WPS Application Patterns at the Workshop “Models For Scientific Exploitation Of EO Data” ESRIN, October 2012 Albert Remke & Daniel Nüst 52°North Initiative.
Controlled Vocabularies (Term Lists). Controlled Vocabs Literally - A list of terms to choose from Aim is to promote the use of common vocabularies so.
AON Data Questionnaire Results 21 Respondents Last Updated 27 March 2007 First AON PI Meeting Scot Loehrer, Jim Moore.
TEA Science Workshop #3 October 1, 2012 Kim Lott Utah State University.
Eric GrahamNathan Yau Staff Ecologist, CENSGraduate Student, Department of Statistics Use CasesSensorBase Coupled Human-Observational Systems Technology.
Environmental Monitoring: Database and Beyond Chengyang Zhang Computer Science Department University of North Texas.
Teacher Page I. Scientific Inquiry, A. Processes of Scientific Inquiry Missouri science standards 1.4, 1.5, 1.6, 1.7, 1.8, 2.1, 2.7 7th grade science MAP.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Inventory and Monitoring Terrestrial Fauna Inventory and Monitoring Terrestrial Fauna Linking Field Activities to Budget Processes.
Multi-scale Integration Introduction to the Panel - Michael Hamilton Multi-Scale Sampling - Greg Pottie Scaling Challenges in Ecology - Michael Hamilton.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
Preparing Metadata Records Suresh K.S. Vannan ORNL, Oak Ridge, TN Viv Hutchison US Geological Survey, Denver, CO
SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the.
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
Geospatial Metadata in GCE EML Wade Sheldon Georgia Coastal Ecosystems LTER.
Ecoinformatics Workshop Summary SEEK, LTER Network Main Office University of New Mexico Aluquerque, NM.
Oct-24-07US France Workshop On Environment and and Sensor Nets Environment and Sensor Networks Workshop US France Young Engineering Scientists Symposium.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Strategies for Adding EML Support to the GCE Data Toolbox for Matlab Wade Sheldon Georgia Coastal Ecosystems LTER (WWW: gce-lter.marsci.uga.edu/lter)
Introduction to Morpho BEAM Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
Cross Curricular Activity By: Tim Martin & Gerry Jones Math & Geography.
DLESE Data Services 2005 Workshop. Overview Outcomes of 2004 DLESE Data Services Workshop Plans and Structure of the 2005 DLESE Data Services Workshop.
Building the LTER Network Information System. NIS History, Then and Now YearMilestone 1993 – 1996NIS vision formed by Information Managers (IMs) and LTER.
The Geosciences are a discipline that is strongly data driven, and large data sets are often developed by researchers and government agencies. The complexity.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
Goal: to understand carbon dynamics in montane forest regions by developing new methods for estimating carbon exchange at local to regional scales. Activities:
Virtual Experiment © Oregon State University Models as a communication tool for HJA scientists Kellie Vache and Jeff McDonnell Dept of Forest Engineering.
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
CombeDay Making Data Openly Available Simon Coles.
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
Understanding SEED Headers. SEED is an international standard for the exchange of digital seismological data SEED was designed for use by the earthquake.
Data and Metadata Archiving: Atlantic Coast Environmental INdicators Consortium (ACE INC) Lexia M. Valdes June 11, 2003 R
NVS New Zealand National Vegetation Survey. What is NVS? NVS (National Vegetation Survey) – New Zealand’s largest archive facility for plot-based vegetation.
Long Term Ecological Research Network Information System LTER EML Status LTER Information Manager’s Meeting 28 July 2004 Mark Servilla
The Virtual Observatory and Ecological Informatics System (VOEIS): Using RESTful architecture and an extensible data model to provide a unique data management.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
Building an Information Management System for Global Data Sharing: A Strategy for the International Long Term Ecological Research (ILTER) Network Kristin.
Center of Excellence for Oceans and Human Health at the Hollings Marine Laboratory Metadata Development in Support of the Oceans and Human Health Tidal.
IMPACT SAMR Cover Sheet
Tomas Kliment Junior Researcher Italian National Research Council
Working with your archive organization Broadening your user community
Bird of Feather Session
Palestinian Central Bureau of Statistics
Presentation transcript:

Data Management for CENS Stasa Milojevic Information Studies UCLA

CENS Data CENS will generate massive amounts of heterogeneous scientific and technical data from the sensors. The data need to be useful for CENS researchers Real time Archived The data also need to be useful for other researchers in those problem domains (larger community).

Data Management: Goals DataMetadata Share with community - PLT-GCEM- 0311b.1.0 Fall 2003 plant monitoring survey -- biomass calculated from shoot height and flowering status of plants in permanent plots at GCE sampling sites Georgia Coastal Ecosystems LTER Project - Dept. of Marine Sciences University of Georgia Athens Georgia USA

How to make data useful and usable? One data model for all of CENS Not likely, that presumes that all science problems are the same One data model for each CENS research area More promising approach Various scientific communities have agreed on the common models

Seismology Seismic data has been collected via digital instruments for over 30 years. There are robust and stable standards for describing seismic data across systems and data formats (SEED – Standard for the Exchange of Earthquake Data) Consortia to centralize and disseminate seismic datasets IRIS (Incorporated Research Institutions for Seismology) NEES (Network for Earthquake Engineering Simulation)

Habitat Monitoring Habitat monitoring research: Draws upon multiple disciplines and technologies Integrates data across a wide range of ecological scales (chemistry, physiology, ecology, and environment) Available testbeds include: embedded microclimate sensor network and embedded phenology network (including wildlife and plant monitoring) Habitat monitoring data: Temperature, moisture, and barometric pressure Video data

James Reserve and habitat monitoring community Why we started with this community? One of the initial CENS sensor deployments The project is at an early stage of defining data and metadata requirements Data from this project are being used as the basis for our initial inquiry learning research in CENS

Ecological Metadata Language (EML) XML- based standard, developed by and for ecological community Divided into modules such as eml-access, eml-attribute, eml-project Describes data, literature, software, products Not well optimized for sensor data Optimized for describing data and not the derivation of data Uses Morpho Client as a cross-platform for creating and organizing data and metadata, either locally or on a shared network server

Ecological Metadata Language (EML) - GCE Study Site GCE1 -- Eulonia, Georgia, USA. Transitional salt marsh/upland forest site at the upper reach of the Sapelo River near Eulonia, Georgia. The main marsh area is to the north of the channel where the upland is controlled by DNR. Several small creeks lie within the study area. Residential development is increasing on the upland areas south of the channel. A hydrographic sonde is deployed within this site attached to a private dock to the south of the main channel near the HW-17 bridge

Describing Instruments Sensor Model Language (SensorML) Emerging OpenGIS standard for describing sensors and sensor data Developed to support data discovery, data processing and geolocation Can be used for in-situ or remote sensors, dynamic or static platforms Optimized for large sensors and large platforms Describes resources for sensor management and discoveries, but not sensor-derived data

Sensor Model Language (SensorML) Sensor identifiedAs documentConstrainedBy measures operatedBy attachedTo locatedUsing describedBy documentedBy hasCRS Sensor identifiedAs documentConstrainedBy

Science and Education We need to make the science data useful for teaching grade 6-12 science. Problem because the scientific models describe the data, and the education models describe lessons (grade level, instruments required for the lesson, time required to perform the lesson, educational standards, etc.)

METADATA FOR SENSOR DATA FOR HABITAT MONITORING METADATA FOR EDUCATION MODULES FOR HABITAT MONITORING CENS SchemaSensorMLEML 2.0LOMGEMADN CENS_Node.Node_Name Name of Node Sml:IdentifiedAs (2.2.2) CENS_Node.Node_Desc Description of Node AssetDescription: sml:description (2.2.12) CENS_Location.Location_ID Unique location ID CrsID (2.2.5)Eml-Coverage (2.4.4) CENS_Location.X_Pos (Position on X axis) HasCRS (2.2.5) ObjectState (3.3.6) Eml-Coverage- GeographicCoverage (2.4.4) CENS_Location.Time_Recorded Time location was captured Eml-Coverage- TemporalCoverage (2.4.4) CENS_Location.Time_Type_ID Refers to type of time of Time_Type ID table Eml-Coverage (2.4.4) Educational-Typical Age Range (5.7) Audience-Age Audience Life Cycle-Contribute (2.3) Creator Resource Creator General-Coverage (1.6) Coverage-Spatial, Temporal Coverage (spatial and temporal) Life Cycle-Date (2.3.3) DateTime (8) Date Creation date Accession date General-Description (1.4) Description Description Educational (5) Pedagogy Educational Science and Education Data Models

Science and Education Data Models : Possible Solution Manage scientific data with models appropriate to the scientific community Construct filters and tools to make scientific data useful to K-12 students and teachers: Reduce granularity of data (e.g. temperature at hourly, rather than minute intervals) Develop tools to display these data (e.g. simple charts and graphs) Describe filters and tools using models appropriate to educational community (e.g. LOM, SCORM, GEM)

Science and Education Data Models – Possible Solution Sets of Data collectedrun through Filters and Tools to produce understandable Tables, Charts and Graphs

Current accomplishments and next steps James Reserve: Map current data structures to EML and SensorML to determine the fit Analyze scientific papers and documents to determine required data elements Create use scenarios Interview scientists

Current accomplishments and next steps Education: Work with inquiry module team to identify data requirements Interview teachers

Discussion and Conclusions Ensuring accessibility and integrity of CENS data to multiple communities requires: Understanding of the practices of each community Understanding of relationships between those practices Means to bridge the gaps

Acknowledgements Christine Borgman Andrew Wu Bill Sandoval Noel Enyedy Joe Wise Mike Wimbrow