Download presentation
Presentation is loading. Please wait.
Published byBrenden Clowers Modified over 10 years ago
1
Mark Servilla & Duane Costa LTER Network Office LTER 2012 All Scientist Meeting LTER Network Office
2
Why LTER Data Co-op? A Diamond in the Rough Demonstrations How can I contribute data? How do I find data? How can I see who is using my data? How is Network synthesis enabled? How is provenance captured? Where do we go from here? Panel Discussion
3
LTER Network Office Its about community A cooperative … is an autonomous association of persons who voluntarily cooperate for their mutual social, economic, and cultural benefit. - Wikipedia Producers – LTER sites Middleware - PASTA Consumers – Science Community
4
LTER Network Office
5
Data producers can evaluate their data package prior to harvesting into PASTA Data packages are discovered via browsing and/or search tools Derived data may be generated when a data package insert or update event occurs Provenance metadata can be generated for derived data packages Data package use information is viewed by a contributor LTER Network Office
6
LTER Network Data Portal portal.lternet.edu
7
PASTA Web Service API
8
Subcomponent of the Data Package Manager component in PASTA Generates a quality report for each data package A quality report contains a set of quality checks Stored as XML but usually rendered in HTML for human readability 27 quality checks implemented in the NIS prototype (of 52 proposed by EML Metrics Working Group) Available to the greater ecoinformatics community via the Data Manager Library (ecoinformatics.org) LTER Network Office
9
An individual metric or a best practice May involve looking at: metadata (independent of data), or data (independent of metadata), or congruency between metadata and data Can result in one of four statuses valid info warn error LTER Network Office
10
Users can evaluate data packages before inserting them into PASTA An error status reported by any quality check blocks insertion of the data package into PASTA Every data package stored in PASTA has a quality report that can be accessed along with its metadata and data LTER Network Office
11
Data Package Quality Report
12
Evaluate Runs quality checks on the data package but doesnt insert it into PASTA May reveal more diagnostic information (as compared to harvest) because it doesnt necessarily halt after encountering the first error Harvest Runs quality checks on the data package; if no errors are discovered, inserts (or updates) the data package into PASTA May reveal less diagnostic information (as compared to evaluate) because it may halt as soon as an error is encountered Bottom line: Always evaluate before harvesting! LTER Network Office
13
EML is version 2.1.0 or beyond Document is schema-valid EML Document is EML parser-valid All entity-level data URLs are live The packageId pattern matches scope.identifier.revision There are no duplicate entity names An entity-level URL which is not set to information returns data Data table does not have more fields than metadata attributes Data table does not have fewer fields than metadata attributes Database table can be created from EML metadata Field delimiter in metadata is a single character Document is schema-valid after dereferencing enumeratedDomain codes are unique (not yet implemented) LTER Network Office
14
Data can be loaded into the database Length of entityName is not excessive A methods element is present Record delimiter is present in metadata Data examined and possible record delimiters returned Number of records in metadata matches number of rows loaded At least one keyword element is present Dataset title length is at 5 least words Dataset abstract element is a minimum of 20 words...others not yet implemented LTER Network Office
15
Display downloaded data Display first insert row coverage element is present temporalCoverage element is present geographicCoverage element is present taxonomicCoverage element is present...others not yet implemented LTER Network Office
17
Data producers can evaluate their data package prior to harvesting into PASTA Data packages are discovered via browsing and/or search tools Derived data may be generated when a data package insert or update event occurs Provenance metadata can be generated for derived data packages Data package use information is viewed by a contributor LTER Network Office
18
North Inlet Meteorological – Air Temperature Yearly aggregation of data Down-sample Hourly to Daily and Monthly LTER Network Office 1982 1983 1984 1983 1984 1992 … … 1. 2. 3. 11.
19
LTER Network Office PASTA NIN Workflow NIN Workflow Source Data
20
LTER Network Office PASTA NIN Workflow NIN Workflow Notify
21
LTER Network Office PASTA NIN Workflow NIN Workflow Request Data
22
LTER Network Office PASTA NIN Workflow NIN Workflow Source Data
23
LTER Network Office PASTA
24
LTER Network Office PASTA NIN Workflow NIN Workflow Derived Data
25
Subscribe to a Data Package event
26
LTER Network Office
27
Source Data Package Derived Data Package Workflow Description
28
Provenance Metadata
30
LTER Network Office
33
LTER Network Office
35
December 2012 Support DOI assignment to metadata and data objects Refine NIS Data Portal Complete metadata rendering Improve catalog browsing Hang out shingle Summer 2013 Standup DataONE member node
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.