TELEIOS Virtual Observatory Infrastructure for Earth Observation Data

Slides:



Advertisements
Similar presentations
IST SEWASIE 16 May 2002 Sonia Bergamaschi Università di Modena e Reggio Emilia.
Advertisements

Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Remote Sensing at the service of Natural Disasters Warning: the Contribution of the Supersites Concept Professor Stuart Marsh Co-Chair GEO Institutions.
GEOSS Common Infrastructure (GCI): status and evolution EC Side Event - GEO Plenary IX Foz do Iguacu, November 2012 Mirko Albani Earth Observation.
DILIGENT Digital libraries powered by the Grid Peter Fankhauser
TELEIOS 2nd User Community Workshop Introduction and Agenda Dr. Eva Klien Spatial Information Management Fraunhofer IGD Tel – 412
Multidisciplinary interoperability To build Operating Capacity Initial (IOC): extend current GEO IP3/AIP-2 interoperability systems in Forest, Biodiversity,
Chapter 2. Slide 1 CULTURAL SUBJECT GATEWAYS CULTURAL SUBJECT GATEWAYS Subject Gateways  Started as links of lists  Continued as Web directories  Culminated.
Architecture and Data Management Strategy (Action Plan) Ivan 1 DeLoatch, USGS, ADC Co-chair Alessandro Annoni, EC, ADC Co-chair Jay Pearlman, IEEE, ADC.
5th International Terra Cognita Workshop In Conjunction with the 11th International Semantic Web Conference Boston, USA, November 12, 2012 Querying Linked.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Architecture for Pattern- Base Management Systems Manolis TerrovitisPanos Vassiliadis National Technical Univ. of Athens, Dept. of Electrical and Computer.
Sheldon Brown, UCSD, Site Director Milton Halem, UMBC Director Yelena Yesha, UMBC Site Director Tom Conte, Georgia Tech Site Director Fundamental Research.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
MUSCLE WP9 E-Team Integration of structural and semantic models for multimedia metadata management Aims: (Semi-)automatic MM metadata specification process.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
FP OntoGrid: Paving the way for Knowledgeable Grid Services and Systems WP8: Use case 1: Quality Analysis for Satellite Missions.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
GEM/IRDR Social Vulnerability and Resilience Information System and Metadata Portal IRDR Scientific Board Meeting Chengdu 03/11/2012.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
ASI-Eumetsat Meeting Matera, 4-5 Feb CNM Context Matera, February 4-5, 20092ASI-Eumetsat Meeting.
Session 1 - The TELEIOS infrastructure for Real Time Fire Monitoring 2nd User Community Workshop Darmstadt, May 2012 Presenter: Ugo Di Giammatteo.
1 Advanced Semantic Technologies Prof. Deborah McGuinness and Dr. Patrice Seyed CSCI CSCI ITWS ITWS TA: Justin.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Data models, Query Languages, Implemented Systems and Applications of Linked Geospatial Data Dept. of Informatics and Telecommunications National and Kapodistrian.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Conclusions Presenter: Manolis Koubarakis Extended Semantic Web Conference 2012.
Virtual Information and Knowledge Environments Workshop on Knowledge Technologies within the 6th Framework Programme -- Luxembourg, May 2002 Dr.-Ing.
1 DMS-DQS-SUPSC03-PRE-12-E © DEIMOS Space S.L., 2007 A Semantic Data Grid for Satellite Mission Quality Analysis Reuben Wright Deimos Space.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
1 The LEO Project Linked Open Earth Observation Data for Precision Farming Presenter: Manolis Koubarakis (UoA ) 1.
GCI Architecture GEOSS Information System Meeting 20 September 2013, ESA/ESRIN (Frascati, Italy) M.Albani (ESA), D.Nebert (USGS/FGDC), S.Nativi (CNR)
DataGrid France 12 Feb – WP9 – n° 1 WP9 Earth Observation Applications.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
1 © 2007 Humboldt Consortium Fraunhoferstraße Darmstadt HUMBOLDT Surveys: The Handbook of Standards and the User Classification.
Database Principles: Fundamentals of Design, Implementation, and Management Chapter 1 The Database Approach.
CrossCutting topic: Data Quality and European Network of EO Networks
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Dr.S.Sridhar,Ph.D., RACI(Paris),RZFM(Germany),RMR(USA),RIEEEProc.
ESA-FAO GEOPortal STATUS & PLANS
NextGEOSS data hub incl. alpha release
INTRODUCTION TO GENERATING SERVICES
Discovering and accessing data from a distributed network of data centres S. Mazzeo (ESA)
Environmental Intelligence Platform – Monitoring Nutrients Pollution with Earth Observation Data for Sustainable Agriculture and Clean Waters Blue.
Big Linked Geospatial Data and its Application to Earth Observation
Data and Applications Security Developments and Directions
improve the efficiency, collaborative potential, and
Presentation on Copernicus Dissemination
Presenter Organisation(s)
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Building Virtual Earth Observatories Using Semantic Web Technologies
SMART GROUND platform overview
Presenter Organisation(s)
Tools for Linked Geospatial Data Exploiting Open Data
DATABASE SYSTEM UNIT I.
Design central EMODnet portal Objectives and Technical description Initial draft prepared by the Flanders Marine Institute.
Database Systems Instructor Name: Lecture-3.
Session 2: Metadata and Catalogues
SEO Report to WGISS Brian Killough CEOS Systems Engineering Office (SEO) WGISS-46 Meeting October 22, 2018.
Open Science: the crucial importance of metadata
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Business Intelligence
Chapter 2 Database Environment Pearson Education © 2014.
TOOLS & Projects overview
Presentation transcript:

TELEIOS Virtual Observatory Infrastructure for Earth Observation Data 2nd User Community Workshop Darmstadt, 10 May 2012 Presenter: Manolis Koubarakis (NKUA) 1 Suvayu Ali, S.F.U. Suvayu Ali, S.F.U. 1

Outline Why TELEIOS? State of the art in Earth Observation data centers Going beyond the state of the art: developing a Virtual Earth Observatory using TELEIOS technologies Technical work highlights for M1-M20 Impact 2 Suvayu Ali, S.F.U. Suvayu Ali, S.F.U. 2

Motivation 3 Suvayu Ali, S.F.U. Suvayu Ali, S.F.U. 3

Motivation (cont’d) Big Data! ~20 PB Suvayu Ali, S.F.U. 4 Suvayu Ali, S.F.U. Suvayu Ali, S.F.U. 4

Motivation (cont’d) Suvayu Ali, S.F.U. Suvayu Ali, S.F.U. 5 Estimated directly affected inhabitants Japan, Honshu 2011 Damage Assessment Earthquake Haiti, 2010 5 Suvayu Ali, S.F.U. Suvayu Ali, S.F.U. 5

TELEIOS: Virtual Observatory Infrastructure for Earth Observation Data A STREP project funded under the 5th call of FP7/ICT under the Strategic Objective ICT 2009.4.3 “Intelligent Information Management”. Emphasis of this objective was on “scalability” and “systematic empirical testing” of systems for “very large amounts of data” (i.e., big data). 6 Suvayu Ali, S.F.U.

Consortium National and Kapodistrian University of Athens Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e. V. Deutsches Zentrum für Luft- und Raumfahrt e.V. Centrum Wiskunde & Informatica National Observatory of Athens Advanced Computer Systems With support from 7 Suvayu Ali, S.F.U. Suvayu Ali, S.F.U. 7

State of the Art in EO Data Centers Users PDGS Catalogue Raw Data Processing Chains Archive 8 Suvayu Ali, S.F.U. Suvayu Ali, S.F.U. Suvayu Ali, S.F.U. Suvayu Ali, S.F.U. 8 8 8

Can I pose the following query using EOWEB? Example Can I pose the following query using EOWEB? Find images taken by the MSG2 satellite on August 25, 2007 which contain fire hotspots in areas which have been classified as forests according to Corine Land Cover, and are located within 2km from an archaeological site in the Peloponnese. 9 Suvayu Ali, S.F.U.

Example (cont’d) 10 Suvayu Ali, S.F.U.

Example (cont’d) Well, only partially. Find images taken by the MSG2 satellite on August 25, 2007 which contain fire hotspots in areas which have been classified as forests according to Corine Land Cover, and are located within 2km from an archaeological site in the Peloponnese. 11 Suvayu Ali, S.F.U.

However, EO data centers today do not allow: Example (cont’d) But why? All this information is available in the satellite images and other auxiliary data sources of EO data centers or on the Web. However, EO data centers today do not allow: the mining of satellite image content and its integration with other relevant data sources so the previous query can be answered. 12 Suvayu Ali, S.F.U.

The TELEIOS Earth Observatory: Concept View Web Portals Rapid Mapping Linked Geospatial Data Semantic Annotations Ontologies Knowledge Discovery and Data Mining KNOWL- EDGE GIS Data Derived Products Metadata Features Scientific Database and Semantic Web Technologies Raw Data Ingestion Processing Content Extraction Cataloguing DATA Archiving Suvayu Ali, S.F.U.

User requirements capture Highlights of M1-M20 Community forming User requirements capture Theory (data models, query languages, knowledge discovery algorithms, software architecture) Development of individual software components First version of the TELEIOS infrastructure 14 Suvayu Ali, S.F.U.

Community Forming Organized the first TELEIOS user community workshop on October 13, 2010 in Frascati (ESA/ESRIN). Attended by: Users Data providers Scientists Technology providers Disseminated TELEIOS work program 15 Suvayu Ali, S.F.U.

Capturing User Requirements We collected detailed requirements for the two TELEIOS use cases: DLR Use Case: A Virtual Observatory for TerraSAR-X data NOA Use Case: Real-time fire monitoring based on continuous acquisitions of EO images and geospatial data. 16 Suvayu Ali, S.F.U.

Community Forming (cont’d) We are today in the 2nd TELEIOS user community workshop. Being attended by 25 participants (12 outside TELEIOS) Goals: Present our achievements for M1-M20. Get your feedback! 17 Suvayu Ali, S.F.U.

Technical Contributions EO data as multidimensional arrays: the SciQL query language. Data vaults. Implementation in the column store MonetDB. 18 Suvayu Ali, S.F.U.

What is SciQL? SciQL is an SQL-based query language for scientific applications with arrays as first-class citizens. SciQL advantages: Express low level image processing and image content analysis in a high-level declarative language. Uniform access to image data (arrays) and metadata (tables) for knowledge discovery. 19 Suvayu Ali, S.F.U.

Why SciQL? Fire Classification in C void firesClassify(double *t039, double *t108, int rows, int columns, int *fire) { int win, win_2, r, c, i; double mean039, std039, mean108, std108; double thr[] = {0.0,310.0,2.5,8.0,2.0,310.0,4.0,10.0,2.0}; win=3; win_2=win>>1; for(i=0;i<rows*columns;i++) fire[i]=0; for(r=win_2;r<rows-win_2;r++) { for(c=win_2;c<columns-win_2;c++) { temperatureMeanAndStd(t039, columns, win, r, c, &mean039, &std039); temperatureMeanAndStd(t108, columns, win, r, c, &mean108, &std108); /* Fire Test */ if (t039[r*columns+c]>thr[5] && std039>thr[6] && t039[r*columns+c]-t108[r*columns+c]>thr[7] && std108<thr[8]) fire[r*columns+c]=2; /* Potential Fire Test */ if (fire[r*columns+c]==0 && t039[r*columns+c]>thr[1] && std039>thr[2] && t039[r*columns+c]-t108[r*columns+c]>thr[3] && std108<thr[4]) fire[r*columns+c]=1; } Fire Classification in C

Why SciQL? (cont’d) Fire Classification in C void temperatureMeanAndStd(double *T, int columns, int windowSize, int r, int c, double *mean, double *std) { double s, ss,d; int i,j; int win_2, win2; win_2=windowSize>>1; win2=windowSize*windowSize; s=ss=0.0; for (i=0;i<win;i++) { for (j=0;j<win;j++) { d=T[(r-win_2+i)*columns+(c-win_2+j)]; s+=d; ss+=d*d; } *mean=s/win2; *std=sqrt(ss/win2-(*mean)*(*mean)); Fire Classification in C

Why SciQL? (cont’d) CREATE ARRAY hrit_T039_image_array (x INTEGER DIMENSION, y INTEGER DIMENSION, v FLOAT); CREATE ARRAY hrit_T108_image_array 22 Suvayu Ali, S.F.U.

Why SciQL? (cont’d) Suvayu Ali, S.F.U. 23 SELECT [x], [y], CASE WHEN v039 > 310 AND v039 - v108 > 10 AND v039_std_dev > 4 AND v108_std_dev < 2 THEN 2 WHEN v039 > 310 AND v039 - v108 > 8 AND v039_std_dev > 2.5 AND v108_std_dev < 2 THEN 1 ELSE 0 END AS confidence FROM ( SELECT [x], [y], v039, v108, SQRT( v039_sqr_mean - v039_mean * v039_mean ) AS v039_std_dev, SQRT( v108_sqr_mean - v108_mean * v108_mean ) AS v108_std_dev FROM ( SELECT [x], [y], v039, v108, AVG( v039 ) AS v039_mean, AVG( v039 * v039 ) AS v039_sqr_mean, AVG( v108 ) AS v018_mean, AVG( v108 * v108 ) AS v108_sqr_mean FROM ( SELECT [T039.x], [T039.y], T039.v AS v039, T108.v AS v108 FROM hrit_T039_image_array AS T039 JOIN hrit_T108_image_array AS T108 ON T039.x = T108.x AND T039.y = T108.y ) AS image_array GROUP BY image_array[x-1:x+2][y-1:y+2] ) AS tmp1; ) AS tmp2 23 Suvayu Ali, S.F.U.

Data vault advantages: Make the DBMS “aware” of external file formats. What is a data vault? The data vault framework addresses the problem of ingesting scientific data of various formats into tables or arrays. Data vault advantages: Make the DBMS “aware” of external file formats. Files are loaded into tables/arrays only when needed. 24 Suvayu Ali, S.F.U.

Technical Contributions (cont’d) A framework for knowledge discovery from EO data, metadata and relevant geospatial data sets. 25 Suvayu Ali, S.F.U.

Search Engines for TerraSAR-X data Implemented TerraSAR-X image search engines based on the developed knowledge discovery framework: Based on semantic annotations representing the classes that characterize the image content, and semantics-based queries being used to access this characterizations. Content-based image retrieval using innovative image similarity measures 26 Suvayu Ali, S.F.U.

Technical Contributions (cont’d) Semantic annotations capturing the knowledge discovered: the model stRDF, the query language stSPARQL and their implementation in the system Strabon. 27 Suvayu Ali, S.F.U.

What are stRDF and stSPARQL? stRDF is an extension of the W3C standard RDF for the representation of geospatial data that may change over time. stSPARQL is the query language for stRDF. 28 Suvayu Ali, S.F.U.

Why stSPARQL? (cont’d) Suvayu Ali, S.F.U. SELECT ?IM WHERE { ?H rdf:type noa:Hotspot . ?H strdf:hasGeometry ?HGEO . ?H noa:isDerivedFromFile ?SHP . ?SHP rdf:type noa:Shapefile . ?SHP noa:isDerivedFromFile ?RD . ?RD rdf:type noa:RawData . ?RD noa:hasAcquisitionDateTime ?T . ?RD noa:hasFilename ?IM . ?RD noa:isDerivedFromSatellite "METEOSAT9" . ?R rdf:type clc:Region . ?R clc:hasLandUse ?F . ?F rdfs:subClassOf clc:Forests . ?R strdf:hasGeometry ?RGEO . dbpedia:Peloponnese strdf:hasGeometry ?PGEO . ?SITE rdf:type dbpedia:Archaeological_sites . ?SITE strdf:hasGeometry ?SGEO . FILTER( strdf:anyInteract(?RGEO,?HGEO) && strdf:distance(?RGEO,?SGEO) < 2 && strdf:contains(?PGEO,?SGEO) && "2007-08-25T00:00:00" <= str(?T) && str(?T) <= "2007-08-25T23:59:59") } 29 Suvayu Ali, S.F.U.

Helping users to formulate stSPARQL queries: a visual query builder. Technical Contributions (cont’d) Helping users to formulate stSPARQL queries: a visual query builder. 30 Suvayu Ali, S.F.U.

The TELEIOS Architecture and Infrastructure The TELEIOS software architecture Integrated system for the NOA use case featuring the NOA fire monitoring service implemented using MonetDB and Strabon. Individual components for DLR use case including the image search engines. 31 Suvayu Ali, S.F.U.

Impact Important societal and industrial impact ICT, Space, Environment and Security research European programs in Earth Observation GMES (EO data management, Emergency Response Core Service) ESA (KIM, KEO, KLAUS, EOLib etc.) Scientific Data Management Geospatial Semantic Web and Linked Geospatial Data Open Source Intelligence 32 Suvayu Ali, S.F.U.

Welcome again and thank you for your attention! Questions? 33 Suvayu Ali, S.F.U.