Machine learning integration in Earth observation projects at CRIM

Slides:



Advertisements
Similar presentations
Spatial Ontology Community of Practice Workshop, USGS, Dec.2, Using Knowledge to Facilitate Better Data Discovery, Access, and Utilization for CloudGIS.
Advertisements

Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
ARGUGRID Use Case using Instrumentation Mary Grammatikou National Technical University of Athens OGF 2009, Catania.
Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Advanced Data Mining and Integration Research for Europe ADMIRE – Framework 7 ICT ADMIRE Overview European Commission 7 th.
WGISS CNES SIT-30 Agenda Item 10 CEOS Action / Work Plan Reference 30 th CEOS SIT Meeting CNES Headquarters, Paris, France 31 st March – 1 st April 2015.
A Global Agriculture Drought Monitoring and Forecasting System (GADMFS) Meixia Deng and Liping Di.
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
1 OPeNDAP/ECHO Demo Integrating and Chaining services September, 2006 CEOS WGISS 22 Annapolis, MD.
Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, SCAPE Scalable Preservation Environments.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
1 WS-GIS: Towards a SOA-Based SDI Federation Fábio Luiz Leite Júnior Information System Laboratory University of Campina Grande
2. WP9 – Earth Observation Applications ESA DataGrid Review Frascati, 10 June Welcome and introduction (15m) 2.WP9 – Earth Observation Applications.
The Evolution of Earth Observation in Canada - A Perspective Luc Brûlé Director General, Space Utilization Canadian Space Agency
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
Semantics in Web Service Composition for Risk Management Michael Lutz European Commission – DG Joint Research Centre Ispra, Italy EcoTerm IV, Vienna,
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
Trustworthy Semantic Webs Building Geospatial Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas October 2006 Presented at OGC Meeting,
Design and Use of Earth Observation Image Content Tools Mihai Datcu(1, 2), Daniele Cerra(1), Houda Chaabouni-Chouayakh(1), Amaia de Miguel(1), Daniela.
NextGEOSS data hub incl. alpha release
USGS EROS LCMAP System Status Briefing for CEOS
AmeriGEOSS Regional Initiative A framework that seeks to promote collaboration and coordination among the GEO Members of the American Continent Angelica.
Discovering and accessing data from a distributed network of data centres S. Mazzeo (ESA)
Tools and Services Workshop
Joslynn Lee – Data Science Educator
LSI-Related Activities
Data Interoperability Summary
INTAROS WP5 Data integration and management
Implementing through the GCI
Data Ingestion in ENES and collaboration with RDA
Workplan for Updating the As-built Architecture of the 2007 GEOSS Architecture Implementation Pilot Session 7B, 6 June 2007 GEOSS Architecture Implementation.
Flanders Marine Institute (VLIZ)
Major ILS disciplines What does iSchools like SILS study?
The e-sensing architecture for big Earth observation data analytics
knowledge organization for a food secure world
Space Data Services Session 2: Space Data Country Outreach and Delivery Agenda Item #3 Brian Killough CEOS Systems Engineering Office (SEO) February 25,
YourDataStories: Transparency and Corruption Fighting through Data Interlinking and Visual Exploration Georgios Petasis1, Anna Triantafillou2, Eric Karstens3.
Machine Learning Basics
Numbers, places, decisions
Workflows with ENVI and Esri Agriculture workflows for ICARDA
OGC Testbed 13 Outcomes and Information on the OGC Testbed 14
Making Data Matter: Earth Science Information Partners
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Core Platform The base of EmpFinesse™ Suite.
What's New in eCognition 9
2. An overview of SDMX (What is SDMX? Part I)
Common Solutions to Common Problems
Session 2: Metadata and Catalogues
Toolbox Benchmarking data session BDVe Meetup, Sofia May 15, 2018
FDA-08 FDA Whitepaper Update
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Metadata Development in the Earth System Curator
Chaitali Gupta, Madhusudhan Govindaraju
Report on Canadian Activities
UPTIME & SEMANTIC WEB STANDARDS
Brian Killough (NASA, SEO), Mirko Albani (ESA, WGISS Chair)
Machine Learning for Space Systems: Are We Ready?
how users and data producers interact on WIS
The Student’s Guide to Apache Spark
A. Della Vecchia, D. Guerrucci (ESA)
What's New in eCognition 9
What's New in eCognition 9
QoS Metadata Status 106th OGC Technical Committee Orléans, France
ESA EO Thesauri Andrea Della Vecchia (Randstad), Yves Coene (Spacebel)
Federated Earth Observation (FedEO)
Cultivating Semantics for Data in Agriculture and Nutrition
Presentation transcript:

Machine learning integration in Earth observation projects at CRIM Geosymposium 2019 Jean-Francois Rajotte July 15th 2019

About This talk An overview of selected ongoing or upcoming EO-ML projects at CRIM Most results shown are preliminary and their purpose here are to give application examples Me Data science researcher at CRIM EO is a subset of my work Soon starting a new position at the Data Science Institute of UBC Will still stay an associated researcher at CRIM

Overview CRIM GeoImageNet OGC Machine Learning testbed MUSE DACCS

Computer Research Institute of Montreal CRIM is a not-for-profit applied research center In operation for more than 30 years 56 employees 80 to 100 projects, ~50 publications per year Can also be flipped With financial support from:

GeoImageNet Motivation ImageNet 14 million images 20k categories Taxonomie based on WordNet hierarchy Enabled great improvements in deep learning for computer vision Goals Create an open platform allowing collaborative annotation of geospatial data Create a large annotated dataset Platform to share data, models and evaluation services In operation late 2019 Production of taxonomies for EO optical data for Land cover classes and objects. Offers an API to access and run trained models. Taxonomy : classification

GeoImageNet Annotation of data by specialists.

OGC Machine Learning Testbed Goal: To present a holistic approach on how to support and integrate emerging AI tools using OGC Web Services ML training execution Knowledge base Models Data Features Metadata Semantic enablement search Sponsors Determine the requirements of each services then put them together in a proof of concept http://docs.opengeospatial.org/per/18-038r2.html#_main_findings WPS : web processing Service Interoperabilité Separate deliverable : specification Demo put everything together

OGC Machine Learning Testbed Proof of concept Source Training Output Scenario : disaster like flood Model = decision tree http://docs.opengeospatial.org/per/18-038r2.html#poc http://docs.opengeospatial.org/per/18-038r2.html#ExampleClause http://docs.opengeospatial.org/per/18-038r2.html#_semantic_interoperability

OGC Machine Learning Testbed Semantic enablement of ML Extensible, unified ontology for structured observations as a semantic enrichment service Query example : automobile Semantic interoperability Controlled vocabulary manager: Provides a common vocabulary to all components within the architecture. Future work : Query interpreter (NLP) Include experiments on automatic generation of workflows or selection of catalogues workflows taht are closest to a user query http://docs.opengeospatial.org/per/18-038r2.html#_d166_semantic_enablement_of_ml http://docs.opengeospatial.org/per/18-038r2.html#_semantic_interoperability

MUSE Goals Provide institutional users a powerful chain able to process big multi-sensor data in near real-time Integrate RADARSAT together with other EO data and enhance processing technologies Axes Optimized system architecture based on open source Optimize processing time Processing and production of data Use case 2017 flood in Quebec, Canada This project just started https://drive.google.com/file/d/1UnvIlToYqazK7AiRZBIxYe5y5VxWarqm/view?usp=sharing

MUSE Ingestion should be done with the main use in mind : projection, partition...

MUSE Open Data Cube An integrated gridded data analysis environment for analysis ready earth observation data Multi-sources Exploratory analysis Large-scale workflow Using Jupyter notebooks

MUSE Sainte-Marthe sur le lac, flood of spring 2019 Sentinel-2 (2018-05-11) Sentinel-2 (2019-05-06)

MUSE Sainte-Marthe sur le lac, flood of spring 2019 NDPI Computation Water detection Normalized Difference Pound Index NDPI = (B3 - B11) / (B3 + B11) B3 (S2 Green Band) (10 m) B11 (S2 SWIR Band) (20 m)

Polygonize NDPI changes mask Image Sentinel-2 (2018-05-11) NRG

MUSE Unsupervised clustering related to land use Fraction of land use within unsupervised cluster Land Use % of land use Sentinel 1 Not expected to have a one-to-one relation, but ideally a combination of unsupervised clusters could correspond to a land use Clusters sentinel 2 (not shown) Preliminary exploration with 100m resolution

Muse → more complex API Big data Losing infrastructure abstraction: Distributed computing Big data Losing infrastructure abstraction: Need to know your infrastructure Decide the data partitioning based on the data, the infra and the usage More optimization trick with assuming the user’s intent (ex: projection) → more complex API

DACCS Data Analytics for Canadian Climate Services A workflow-based science Gateway (virtual laboratory). Adds new climate services and applications to ESGF: Sea ice from observations and model simulations C02 and methane concentration measurement Climate extremes and cyclone tracking Coastal vulnerability analysis Deep Learning-based Land Cover Mapping EO Datacube Kickoff in September 2019 A comprehensive climate data analysis tool https://docs.google.com/presentation/d/1Gk3wcG4BcwdNFsa-CM1hXcdqcVMoaFs409gax08wvvI/edit?usp=sharing Funded by:

DACCS climatedata.ca Climate data portal Public portal for DACCS Collaborative development of Canadian Center for Climate Service lead by CRIM Launched June 2019 climatedata.ca Mostly Temp and precipitation for now Regional climate services consortia are partners of Canadian Center for Climate Services (CCCS), and new consortia are being fostered. Collaborative development of a Canadian climate data portal led by CRIM. Launch June 2019. CCCS welcomes collaboration with countries sharing needs for user-driven climate services for population, indigenous or remote communities.

DACCS Natural Language Understanding tools Metadata generation Convert and encode resources available into queryable form of metadata Ressources Workflow Dataset Ontology ML-based generator

DACCS Natural Language Understanding tools User-oriented query alignment Enrich and transcode the user query with knowledge resources to guide the query process on the encoded metadata Ressources Query : natural language Ontologies Algorithms (to interpret the query)

Conclusion This has been an overview of few EO projects at CRIM. We are still thinking about ways to adresse the sharing of data, features and concepts betweens components of our workflow. Data semantics is definitely a key element here. We need to know what are the best practices to help get in the right direction.

End

Thresholding

Simple unsupervised land classification Dimension reduction, ex: PCA Explained variance Visualization Classification