Nikos Manolis Agro-Know Technologies Tutorial on data aggregation and accessing datasets.

Slides:



Advertisements
Similar presentations
Large-scale (meta)Data Aggregators & Infrastructure Requirements the case of agriculture Nikos Manouselis Agro-Know Technologies & ARIADNE Foundation
Advertisements

Theo van Veen, Koninklijke Bibliotheek The European Library: opportunities for new services.
Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
Applying preservation metadata to repositories For JISC KeepIt course on Digital Preservation Tools for Repository Managers Module 3, Primer on preservation.
Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
Breakout Session 5: Collaboration Between DPs and SPs Protocol simple, providing a service more difficult… …because of lack of networking among DPs and.
June 22-23, 2005 Technology Infusion Team Committee1 High Performance Parallel Lucene search (for an OAI federation) K. Maly, and M. Zubair Department.
Overview of key concepts and features
Discove r Humanities and Social Science Electronic Thesaurus - HASSET Faceted search HASSET is the subject thesaurus that the UK Data Service uses to index.
National Science Digital Library (NSDL) Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
Application Standards for ‘Push’ Content and Streaming Media Hadi Partovi Microsoft Corporation.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Using the DPLA API as Community Reps Webinar August 19, 2014 A PLATFORM TO BUILD UPON Danielle Cunniff
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
PerfSONAR Client Construction February 11 th 2010, APAN 29 – perfSONAR Workshop Jeff Boote, Assistant Director R&D.
4th project meeting 27-29/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA agINFRA A data infrastructure for agriculture.
AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.
SWORD Stories - Easy Deposit Cutting Through Repositories’ Red Tape Sarah Currier Consultancy | E-Learning * Resource Sharing * Web 2.0 * Metadata * Repositories.
LBTO IssueTrak User’s Manual Norm Cushing version 1.3 August 8th, 2007.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
CS1100: Access Reports A (Very) Short Tutorial on Microsoft Access Report Construction Created By Martin Schedlbauer With contributions from Matthew Ekstrand-Abueg.
Avano, an OAI harvester for marine and aquatic sciences Fred Merceur What could be improved in OAI-PMH protocol and in repositories implementation?
CollectionSpace Service REST-based APIs June 2009 Face-to-face Aron Roberts U.C. Berkeley IST/Data Services.
Materials Science Registry Will propose RDA Materials Science WG Define minimum/modest metadata extensions to Dublin Core to enable resource discovery.
Enhancing the learning content through the aggregation of social data. Frans Van Assche University of Leuven President of the ARIADNE Foundation.
Joint agINFRA & SCI-BUS workshop, 30/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA Joint agINFRA & SCI-BUS workshop agINFRA.
1 OAI-PMH harvester for agricultural knowledge gathering (Development, testing and implementation) Francesco Castellani and Stefka Kaloyanova 4 February.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
The Resource Discovery Network and OAI Andy Powell UKOLN, University of Bath UKOLN is funded by Resource: The Council.
CLARIN for Linguists Portal & Searching for Resources Jan Odijk LOT Summerschool Nijmegen,
Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
OAI-PMH: Open Archives Initiative Protocol for Metadata Harvesting T.B. Rajashekar National Centre for Science Information (NCSI) Indian Institute of Science,
WDC-MARE – World Data Center for Marine Environmental Sciences Data portal based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler,
AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas MTA SZTAKI.
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
DM_PPT_NP_v01 SESIP_0715_JR HDF Server HDF for the Web John Readey The HDF Group Champaign Illinois USA.
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Phil Barker, March © Heriot-Watt University. You may reproduce all or any part.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
L JSTOR Tools for Linguists 22nd June 2009 Michael Krot Clare Llewellyn Matt O’Donnell.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
SPRINGER ONLINE
CNI, 4th April 2006 Slide 1 Key Standards Update: SRU (“Technical” Details) Dr. Robert Sanderson Dept. of Computer Science University of Liverpool
Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
UCL DEPARTMENT OF SPACE AND CLIMATE PHYSICS MULLARD SPACE SCIENCE LABORATORY Taverna Plugin VAMDC and HELIO (part of the ‘taverna-astronomy’ edition) Kevin.
Session: Ontology/Metadata Applications 8 th AOS Workshop - Rome, Sept January 14, Agricultural Organizations Registry  Information.
ΕΚΤ Access to Knowledge ΕΚΤ Access to Knowledge CERIF API: Access and reuse research information in CRIS Dimitris Karaiskos Vasilis Bonis, Nikos Pougounias.
"Data sources index" a web application to list projects in Hadoop Luca Menichetti.
Standards OAI-Protocol Metadata: DC - Agris - MODS Marc Goovaerts Hasselt University Library ODIN-PI TRAINING OSTENDE, May 2008.
2/22/2016J Ammerman1 Open Archives Initiative What is it? What’s it good for?
What is ECHO? ECHO Open Search ECHO Facts NASA’s Earth Observing System ClearingHOuse (ECHO) acts as the core metadata.
Metadata-based Discovery: Experience in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK A centre of.
Introduction to the OAI Protocol for Metadata Harvesting Version 2.0 Hussein Suleman Virginia Tech DLRL 25 March 2002.
Filling institutional repositories: considering copyright issues Susan Veldsman eIFL Content Manager
Storing digital assets on Grid/EGI FedCloud with gLibrary Giuseppe La Rocca, INFN DARIAH ERIC.
The NSDL, OAI and Your Metadata Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
4th project meeting 27-29/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA agINFRA A data infrastructure for agriculture.
International Planetary Data Alliance Registry Project Update September 16, 2011.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
RDA Europe: Views about PID Systems
Repository Software - Standards
Accomplishments RSM v0.7 First draft XML Schema completed: VOResource.xsd NVO: Working prototype resource using VOResource as format for metadata exchange.
OpenSearch: the data search API for everyone
Data Management: Documentation & Metadata
Search Relevancy in GEO Data Access Broker
Web archive data and researchers’ needs: how might we meet them?
Agro Hackathon Hack 5: Agro Portal and VEST Registry
Programmatic interaction with the Invenio-based NADRE Repository
Programmatic interaction with the Invenio-based NADRE Repository
Presentation transcript:

Nikos Manolis Agro-Know Technologies Tutorial on data aggregation and accessing datasets

Slide 2 of 63 There is a lot of data

Slide 3 of 63 Need for data aggregation and harmonization

Slide 4 of 63 Objectives This presentation aims to provide information on:  How to use a service for aggregating datasets  How to get already processed datasets  How to search processed datasets with a search API Educational – GLN API (21008 res) Bibliographic – ABN API ( res)

Slide 5 of 63 The agDataHarvester service Implements the OAI-PMH protocol to harvest metadata records from open data providers – REST-based API – Harvested dataset available through HTTP

Slide 6 of 63 agDataHarvester parameters { "document_type": "harvesting_target", "harvesting_target": { " name ":"Repository name", " description ":”Short Repository Description", " url ":"OAI-PMH target URL", " type ":"metadata format prefix", " frequency ":hours }

Slide 7 of 63 param.json { "document_type": "harvesting_target", "harvesting_target": { "name":"Indian Academy of Science", "description":"Indian Academy of Science", "url":" "type":"mets", "frequency":24 } curl -X POST ac.rs/agcouchdb curl -X POST param.json { "ok": true, "id": " 5c56a3fa18fa21d2a85fd63cc9eb78ac ", "rev": "1- 19ef df8f1695a32b53ecb963a" }

Slide 8 of 63 Get details on the dataset 5c56a3fa18fa21d2a85fd63cc9eb78ac

Slide 9 of 63 Get details on the dataset {" id ": " b52d79e4797e210c06e6a0aee ", "key": " b52d79e4797e210c06e6a0aee", "value": { "_id": " b52d79e4797e210c06e6a0aee", "_rev": "1-d55d7bc90d26db64dae328c9328e4e4a", "document_type": "harvesting_target", "harvesting_target": { "name": “ WorldBank ", "description": "The World Bank - Open Knowledge Repository", "url": "" "type": “mets", "frequency": 24 }, "document_publisher": { "address": " ", "author": "demo001", "utc_datetime": "Wed Dec 11 11:58: ", "utc_timestamp": }

Slide 10 of 63 The agWorkflow service dataset.process=agworkflow&dataset.type=oai_lom&dataset.accuracy=true I want all datasets with educational resources processed by the agINFRA powered aggregation workflow ! dataset.process=agworkflow&dataset.type=oai_agris&dataset.accuracy=true I want all datasets with bibliographic resources processed by the agINFRA powered aggregation workflow !

Slide 11 of 63 Is there a way to search on available datasets ?

Slide 12 of 63 Search API REST-based queries over harmonized information (result of metadata processing) Two data models supported – akif: describing educational resources for agriculture, – agrif: describing bibliographic resources for agriculture (mainly from FAO’s data),

Slide 13 of 63 Search options Simple search Searching within specific fields api/v1/akif/?languageBlocks.en.description=tomato Temporal Fetching specific items

Slide 14 of 63 Managing results Sorting results e.g ?q=*&sort_by=creationDate&sort_order=desc Facets e.g ?facets=set&facet_size=3 Pagination e.g ?q=sea&page_size=25&page=3 Full Documentation : :8080/search-api/

Nikos Manolis Agro-Know Technologies

Slide 16 of 63 … … demo001…demo005 // aginfra

Slide 17 of 63 View all harvested datasets