Reportnet standards and next steps Søren Roug, Information and Data Services (IDS)

Slides:



Advertisements
Similar presentations
Building a Semantic IntraWeb with Rhizomer and a Wiki Roberto Garcia and Rosa Gil GRIHO (Human Computer Interaction Research Group) Universitat de Lleida,
Advertisements

1 Reportnet – an introduction Reportnet – an introduction Presentation for the Meeting on Noise Copenhagen, 12 November 2008 Søren Roug, EEA
A centre of expertise in digital information management IMS Digital Repositories Interoperability Andy Powell UKOLN,
Query Methods (SQL). What is SQL A programming language for databases. SQL (structured Query Language) It allows you add, edit, delete and run queries.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
W3C Activities HTML: is the lingua franca for publishing on the Web XHTML: an XML application with a clean migration path from HTML 4.01 CSS: Style sheets.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
1 Introducing Reportnet Miruna Badescu. 2 A linear view of Reportnet process.
Semantic Sensor/Device Description System EEEM042-Mobile Applications and Web Services Assignment- Spring Semester 2015 Prof. Klaus Moessner, Dr Payam.
Status of upgrading CDI service (user interface, harvesting via GeoNetwork, CDI interoperability options following SeaDataNet D8.7) By Dick M.A. Schaap.
XML: The Strategic Opportunity Roy Tennant Challenges*  Only librarians like to search, everyone else likes to find  Our users want more information.
s Advance Database Systems Week-2 Dr.Kwanchai Eurviriyanukul
Powered by Employment Security Department WorkSource Integrated Technology Solution.
ADC Meeting ICEO Standards Working Group Steven F. Browdy, Co-Chair ADC Workshop Washington, D.C. September, 2007.
Powered by An overview of the WorkSource Integrated Technology Solution for WEC.
Event-Based Model for Reconciling Digital Entries Thesis Proposal Ahmet Fatih Mustacoglu 10/3/20151Ahmet.
Microsoft Excel 2007 © Wiley Publishing All Rights Reserved. The L Line The Express Line to Learning L Line.
1 Reportnet for beginners Hermann Peifer, EEA. 2 What is Reportnet? The EEA’s data collection machinery.
Eurostat Unit B3 – IT and standards for data and metadata exchange SDMX Basics Training – 2012 IT architectures for data exchange SDMX-RI and the Hub approach.
Mind Your Metadata Geri Miller. Metadata in ArcGIS ArcGIS metadata goals Editing metadata Setting your metadata style Leveraging metadata in ArcGIS Importing.
BIEN Confederated DB (S) Analytical DB(s) Heterogeneous source database(s) of Plots/Specimens/Occurrences Synonymy Names Reference taxonomy *** *** Feedback.
Recursive Functions Creating Hierarchical Reports Date: 9/30/2008 Dan McCreary President Dan McCreary & Associates (952) M.
Introduction to Web Services Eric Lease Morgan University Libraries of Notre Dame June 24, 2005.
1 Open Ontology Repository: Architecture and Interfaces Ken Baclawski Northeastern University 1.
Implementing air quality e-Reporting Data deliverables in 2013 and 2014 and the process of reporting Tony Bush ETC/ACM AQ e-Reporting task leader 18th.
Introduction to Morpho BEAM Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
1 SPARQL A. Emrah Sanön. 2 RDF RDF is quite committed to Semantic Web. Data model Serialization by means of XML Formal semantics Still something is missing!
1.Registration block send request of registration to super peer via PRP. Process re-registration will be done at specific period to info availability of.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Introduction to Archon for CARLI Members Jen Masciadrelli, Library Systems Coordinator, CARLI Office Sarah Horowitz, Special Collections Librarian, Augustana.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
© 2006 Altova GmbH. All Rights Reserved. Altova ® Product Line Overview.
Introduction to the Aggregation Database Søren Roug, IT Project manager.
Serving society Stimulating innovation Supporting legislation Workshop on the INSPIRE registry and registers Søren Roug European Environment.
Central Data Repository introduction What does it do? Session I.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
1 Reportnet – an introduction Reportnet – an introduction Presentation for the Meeting on IPPC/WI Brussels, 3 March 2009 Søren Roug, EEA
European Topic Centre on Biological Diversity EIONET NRC Meeting on Biodiversity October 2011, Copenhagen Progress.
1 Copyright © 2008, Oracle. All rights reserved. I Course Introduction.
1 Integration of the LCP Reporting Into the E-PRTR Scope and Technical proposal November 3rd.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
1 Integration of the LCP Reporting Into the E-PRTR Scope and Technical specifications December 3rd.
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
Reportnet – progress and next steps Søren Roug European Environment Agency.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Submitted by: Moran Mishan. Instructed by: Osnat (Ossi) Mokryn, Dr.
DO YOU TRUST YOUR DATA? KNOW THE ANSWER WITH EIM! Jose Hernandez Director, Business Intelligence Dunn Solutions Group.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
XML 1. Chapter 8 © 2013 Pearson Education, Inc. Publishing as Prentice Hall SAMPLE XML SCHEMA (XSD) 2 Schema is a record definition, analogous to the.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Samad Paydar WTLab Research Group Ferdowsi University of Mashhad LD2SD: Linked Data Driven Software Development 24 th February.
DIAS & DIAS data release 2 years DIAS-GCI Cooperation Hiroko KINUTANI DIAS (Data Integration and Analysis System in Japan) , St. Petersburg.
The MEDIN stylesheet and ESRI Arc 10: metadata format conversion
The MEDIN stylesheet and ESRI Arc 10: metadata format conversion
Microsoft Access 2003 Illustrated Complete
Wsdl.
SDMX Information Model
PREMIS Tools and Services
LOD reference architecture
SDMX in the S-DWH Layered Architecture
GISCO website on Intracomm Maps and Statistics on ESTAT website
Database Design Hacettepe University
Reportnet for beginners
Reportnet An Introduction
Reportnet for absolute beginners
Best Practices in Higher Education Student Data Warehousing Forum
Presentation transcript:

Reportnet standards and next steps Søren Roug, Information and Data Services (IDS)

Use of Standards Historically, Reportnet has been targeted towards the webbrowser. It is a standard called REST You can upload any file to CDR Communication between sites is done with XML-RPC Transfer of metadata uses RDF (Semantic Web) Reportnet does not use Service Oriented Architecture (SOA) SOAP INSPIRE

Introduction of XML (A standard for file formats) In 2004 Reportnet started to give preferential treatment to XML One single requirement: That the XML file has a schema identifier From this we can: Run QA scripts using the XQuery language Convert to other formats using XSL-T Edit the XML content using XForms for webforms

2008 focus Integration of national repositories into Reportnet Guidelines on How to implement a Reportnet/SEIS node Use of QA service from national node Use of conversion service from national node Registration of datasets Via a manifest file Via manual registration at website

2009 focus (next steps) How to register the datasets How to search for the datasets How to track updates to the datasets How to bookmark found datasets How to merge datasets How to trust the dataset How to trust the trust

Registering a SEIS dataset Discovered via manifest files and manual registration

Adding metadata

Bookmarking and searching the dataset

Working with files vs. records Now we know where the files are in the SEIS universe But we can do more: We can read the content of XML files Example of an XML snippet: <stations xmlns:xsi= xsi:noNamespaceSchemaLocation=" St. Pölten Industrial urban...

Merging principles Station structure as a table (austria.xml) Identifierlocal_codename... # St. Pölten... # Linz... Quadruple structure SubjectPredicateObjectSource #32301typeRiver Stationaustria.xml #32301local_code32301austria.xml #32301nameSt. Pöltenaustria.xml #32302typeRiver Stationaustria.xml #32302local_code32302austria.xml #32302nameLinzaustria.xml

Merging the datasets Austria Stations.xml Belgium Stations.xml Germany Stations.xml Aggregation Database XSL Transformation to quadruples SubjectPredicateObjectSource #32301nameSt. PöltenAu..xml #30299nameGentBe..xml #42882nameKölnGe..xml

Merging the datasets (with later updates) Austria Stations.xml Austria update1.xml Aggregation Database XSL Transformation SubjectPredicateObjectSource #32301nameSt. PöltenAu..xml #32301date Au..xml #32301nameSpratzernAu..update1.xml #32301date Au..update1.xml

Searching To find all river stations in Europe you search for subjects with the type=”River Station” The query will format it as a table for you Obviously you get duplicates because has been updated IdentifierLocal_codeNameDateLongitude # St. Pölten #32301Spratzern # Gent # Köln

QA work Let’s first colour the cells by their source IdentifierLocal_codeNameDateLongitude # St. Pölten #32301Spratzern # Gent # Köln

QA work Then we merge by letting the newer sources overwrite the older: IdentifierLocal_codeNameDateLongitude # Spratzern # Gent # Köln

QA work Don’t trust one source? Turn it off before you merge IdentifierLocal_codeNameDateLongitude # St. Pölten #32301Spratzern # Gent # Köln

QA work Then we merge IdentifierLocal_codeNameDateLongitude # St. Pölten # Gent # Köln

QA work Gapfilling? Add your own source as a layer The layer is stored on QAW IdentifierLocal_codeNameDateLongitude # St. Pölten #32301Spratzern # Gent # Köln # Hermann’s gapfilling layer created

QA work Then we merge IdentifierLocal_codeNameDateLongitude # Spratzern # Gent # Köln And we export to our working database for production...

Trusting the dataset and trusting trust Datasets and values can be evaluated by looking at the source Is the source URL from a reliable organisation/person? Is the methodology described? Are there reviews on QAW? Who wrote the reviews? Are there others who have used the data? Who are they?

Summary These new tools intend to solve the use of the Reportnet deliveries: Aggregation/Merging Manual QA and gap-filling Traceability to the sources Noticing when the source has been updated/deleted Review of the source for inclusion That was no problem before because only authorised parties could upload to CDR With SEIS now anyone can participate