Databases and Global Environmental Change: Information Technology for Sustainable Development Gilberto Câmara INPE, Instituto Nacional de Pesquisas Espaciais.

Slides:



Advertisements
Similar presentations
Future Directions and Initiatives in the Use of Remote Sensing for Water Quality.
Advertisements

Group on Earth bservations Discussion Paper on a Framework Dr. Ghassem Asrar August 1, 2003.
1 GlobModel The GlobModel study, initial findings and objectives of the day Zofia Stott 13 September 2007.
Reducing Deforestation in Amazonia: The rôle of information and communication technologies Gilberto Câmara National Institute for Space Research (INPE)
Space Weather in CMA Xiaonong Shen Deputy Administrator China Meteorological Administration 17 May 2011 WMO Cg-XVI Side Event Global Preparedness for Space.
We now have a Geo-Linux. What’s next? Gilberto Câmara National Institute for Space Research (INPE), Brazil Institute for Geoinformatics, University of.
Anticipating Extreme Hydrologic Events …how real-time data empowers communities and individuals to survive and recover from disasters AMS Corporate Forum.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Are we ready for REDD? Multidimensional policies for reducing Amazon deforestation: Gilberto Câmara Director, National Institute for Space Research.
2012: Hurricane Sandy 125 dead, 60+ billion dollars damage.
Biofuel production in Brazil: challenges for land use policy Gilberto Câmara Dialogo Brasil-Alemanha de Ciencia e Inovação Licence: Creative Commons ̶̶̶̶
Cracow Grid Workshop November 5-6 Support System of Virtual Organization for Flood Forecasting L. Hluchy, J. Astalos, V.D. Tran, M. Dobrucky and G.T. Nguyen.
Weather Forecasting – The Traditional Approach Pine cones open and close according to air humidity. An open pine cone means dry weather. Ash leaf before.
Building a Framework for Data Preservation of Large-Scale Astronomical Data ADASS London, UK September 23-26, 2007 Jeffrey Kantor (LSST Corporation), Ray.
B1 -Biogeochemical ANL - Townhall V. Rao Kotamarthi.
Working at a global scale: challenges for a worldwide tropical forest monitoring system Gilberto Câmara General Director National Institute for Space Research.
Grid resources for NWP models at national level in Korea Korean Meteorological Administration Super Computer Center Korea Meteorological Administration.
N EW TRENDS IN G EOINFORMATICS IN A CHANGING WORLD Gilberto Câmara National Institute for Space Research, Brazil.
Division of Satellites and Environmental Systems Applications of GOES-SA (South America)
March 2004 At A Glance ITOS is a highly configurable low-cost control and monitoring system. Benefits Extreme low cost Database driven - ITOS software.
Dr. Sarawut NINSAWAT GEO Grid Research Group/ITRI/AIST GEO Grid Research Group/ITRI/AIST Development of OGC Framework for Estimating Near Real-time Air.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
I’ve found the data; it’s free and open access. Now what? Gilberto Câmara National Institute for Space Research (INPE) Brazil.
INPE´s contribution to REDD Capacity Building: data, applications, and software Gilberto Câmara Director General National Institute for Space Research.
DISTRIBUTED DATA FLOW WEB-SERVICES FOR ACCESSING AND PROCESSING OF BIG DATA SETS IN EARTH SCIENCES A.A. Poyda 1, M.N. Zhizhin 1, D.P. Medvedev 2, D.Y.
PolarGrid Geoffrey Fox (PI) Indiana University Associate Dean for Graduate Studies and Research, School of Informatics and Computing, Indiana University.
Big Data in Science (Lessons from astrophysics) Michael Drinkwater, UQ & CAASTRO 1.Preface Contributions by Jim Grey Astronomy data flow 2.Past Glories.
ENEON first workshop Observing Europe: Networking the Earth Observation Networks in Europe September, Paris Summary on data availability, sharing,
1 T.C. TURKISH STATE METEOROLOGİCAL SERVICE DEPARTMENT OF RESEARCH AND INFORMATION TECHNOLOGIES METEOROLOGICAL DATA MANAGEMENT Mustafa Sert October 2011.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
1 Addressing Critical Skills Shortages at the NWS Environmental Modeling Center S. Lord and EMC Staff OFCM Workshop 23 April 2009.
© Crown copyright 2011 Met Office WOW - Weather Observations Website Crowd-sourced weather obs for real OGC TC 79 Brussels, Chris Little & Aidan.
Beyond OGC Standards: The New Challenges for Open Source GIS Gilberto Câmara Director General, National Institute for Space Research (INPE) Brazil OGRS.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
The Namibia Flood Dashboard Satellite Acquisition and Data Availability through the Namibia Flood Dashboard Matt Handy NASA Goddard Space Flight Center.
Data-intensive Geoinformatics: using big geospatial data to address global change questions Gilberto Câmara GIScience 2012 Workshop on Big Data Licence:
Page 1 Pacific THORPEX Predictability, 6-7 June 2005© Crown copyright 2005 The THORPEX Interactive Grand Global Ensemble David Richardson Met Office, Exeter.
Impact of Pacific Climate Variability on Ocean Circulation, Marine Ecosystems & Living Resources Francisco Chavez MBARI Lead PI Dick Barber, Duke University.
Astro / Geo / Eco - Sciences Illustrative examples of success stories: Sloan digital sky survey: data portal for astronomy data, 1M+ users and nearly 1B.
Symposium on multi-hazard early warning systems for integrated disaster risk management A JCOMM perspective Enhanced early warning for better coastal or.
From Virtual Globes to Open Globes Gilberto Câmara (INPE, Brazil)
Transparency builds governance Gilberto Câmara National Institute for Space Research (INPE) Brazil GEO Data Sharing WG,
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
Databases and Global Environmental Change Gilberto Câmara Diretor, INPE.
The KB e-Depot long-term preservation of scientific publications in practice Marcel Ras, National library of The Netherlands.
Land Use and human- enviroment interactions in Amazonia Gilberto Câmara National Institute for Space Research (INPE) FAPESP 50 Years Symposium, 2011.
GEOSS: the view from the South Gilberto Câmara Director, National Institute for Space Research Brazil.
Free Earth Observation Data on a Global Scale Gilberto Câmara General Director National Institute for Space Research Brazil.
How light can the Digital Earth be? Gilberto Câmara National Institute for Space Research (INPE) Brazil Eye on Earth Summit,
WGISS and GEO Activities Kathy Fontaine NASA March 13, 2007 eGY Boulder, CO.
Designing a Global Interoperable Information Network Gilberto Câmara National Institute for Space Research, Brazil Eye on Earth Summit, Abu Dhabi, 2011.
1 Earth Science Technology Office The Earth Science (ES) Vision: An intelligent Web of Sensors IGARSS 2002 Paper 02_06_08:20 Eduardo Torres-Martinez –
EScience: Techniques and Technologies for 21st Century Discovery Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering Computer Science.
Vision of an Integrated Global Observing System Gregory W. Withee Assistant Administrator for Satellite and Information Services National Oceanic and Atmospheric.
Monitoring Tropical Forests and Agriculture: the Roadmap for a Global Land Observatory Gilberto Câmara National Institute for Space Research (INPE), Brazil.
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
Support to scientific research on seasonal-to-decadal climate and air quality modelling Pierre-Antoine Bretonnière Francesco Benincasa IC3-BSC - Spain.
The Global Scene Wouter Los University of Amsterdam The Netherlands.
Challenges for land use policy in Brazil Gilberto Câmara Dialogo Brasil-Alemanha de Ciencia e Inovação Licence: Creative Commons ̶̶̶̶ By Attribution ̶̶̶̶
Supporting the “Solving Business Problems with Environmental Data” Competition 24 th October 2013 Vlad Stoiljkovic.
British Antarctic Survey Polar Science For Planet Earth (PSPE) Images can be downloaded here from the BAS image collection here:
The Helmholtz Association Project „Large Scale Data Management and Analysis“ (LSDMA) Kilian Schwarz, GSI; Christopher Jung, KIT.
HELIO: Discovery and Analysis of Data in Heliophysics Robert Bentley, John Brooke, André Csillaghy, Donal Fellows, Anja Le Blanc, Mauro Messerotti, David.
“Building public good instituions in emerging nations” Gilberto Câmara Director, National Institute for Space Research Brazil
Modelling Theory Part I: Basics
Daniel Vila, Luiz A. Toledo Machado
Gilberto Câmara National Institute for Space Research (INPE) Brazil
Reducing Deforestation in Amazonia: how transparency builds governance
The Global Observing System for Climate Carolin Richter, Director
eGY Planning Meeting Boulder, February 2005
Presentation transcript:

Databases and Global Environmental Change: Information Technology for Sustainable Development Gilberto Câmara INPE, Instituto Nacional de Pesquisas Espaciais Brazilian Academy of Sciences, Annual Meeting, May 2012

source: IGBP How is the Earth’s environment changing, and what are the consequences for human civilization? The fundamental question of our time

Global Change Where are changes taking place? How much change is happening? Who is being impacted by the change?

Limits for Models source: John Barrow (after David Ruelle) Complexity of the phenomenon Uncertainty on basic equations Solar System Dynamics Meteorology Chemical Reactions Hydrological Models Particle Physics Quantum Gravity Living Systems Global Change Social and Economic Systems

Limits for Models source: John Barrow (after David Ruelle) Complexity of the phenomenon Uncertainty on basic equations Solar System Dynamics Meteorology Chemical Reactions Hydrological Models Particle Physics Quantum Gravity Living Systems Global Change Social and Economic Systems e-science

Collaborative e-science Territory (Geography) Money (Economy) Culture (Antropology) Modelling (IT) Connect expertise from different fields Make the different conceptions explicit

Até 10% % 20 – 30% 30 – 40% 40 – 50% 50 – 60% 60 – 70% 70 – 80% 80 – 90% 90 – 100% Amazonia ( km2 = size of Europe) Deforestation in Amazonia

Data (we need a lot of it) Deforestation in Brazilian Amazonia ( ) dropped from 27,000 km 2 to 6,200 km 2

Daily warnings of newly deforested large areas Real-time Deforestation Monitoring

Tb of data lines of code 150 man/years of software dev 200 man/years of interpreters How much it takes to survey Amazonia?

TerraAmazon – open source software for large-scale land change monitoring Spatial database (PostgreSQL with vectors and images) : 5 million polygons, 500 GB images

Terrestrial Airborne Near- Space LEO/MEO Commercial Satellites and Manned Spacecraft Far- Space L1/HEO/GEO TDRSS & Commercial Satellites Deployable Permanent Forecasts & Predictions Aircraft/Balloon Event Tracking and Campaigns User Community Vantage Points Capabilities Welcome to the Age of Data-intensive Science!

Weather and climate source: WMO 11,000 land stations (3000 automated) 900 radiosondes, 3000 aircraft 6000 ships, 1300 buoys 5 polar, 6 geostationary satellites

ARGOS Data Collection System (16000 plats) 650,000 messages processed daily

Argo bouy network

Data chain in Earth System Science fonte: NASA

Data-intensive Science = principles and applications of information technology for handling very large data sets

IT concepts are essential to global change researchers (but most of them don’t know it) Global change challenges will motivate new research in IT (but most of us are not looking there) Conjectures

Which data is out there? How to organize big data? How to get the data I need? Challenges for data-intensive science How to model big data? How to access and use big data?

Stage 1 – A scientist’s personal database Local database User interface Database creationAnalysisDatabase access

Stage 1 – A scientist’s personal database Local database User interface Database creationAnalysisDatabase access The good: data is close to you (or so you think) The bad: no long-term data preservation no data sharing

Stage 2 – A scientific lab database Corporate database User interface Database creation AnalysisDatabase access

Stage 2 – A scientific lab database Corporate database User interface Database creation AnalysisDatabase access The good: long-term data preservation data sharing inside the lab reusable corporate software The bad: substantial costs on data admin little outside data sharing

ECMWF Metview – MOPTC June Metview

ECMWF Metview – MOPTC June Field plotting

Stage 3 – A scientific lab database in the cloud Corporate database User interface Database creation AnalysisDatabase access

Stage 3 – A scientific lab database in the cloud Corporate database User interface Database creation AnalysisDatabase access The good: long-term data preservation shared costs on data admin The bad: rewrite software for cloud processing outside data sharing still not solved

Risk Analysis Analysis

On-line data feed ModelsSatellite/RadarDCP Rain total Fixed time and irregular – alert Point data One file per DCP Grid 4km Total rain 1h Total rain 24h Current (mm/h) Binary file ETA 40, 20, 5 Km Ensemble 40 Km Total rain 72h 72 files ASCII grid file

TerraMA 2 - Natural Disasters Monitoring and Alert System

Stage 4 – Multidatabase access Data source Data source Data source Modelling Data discoveryData accessAnalysis Remote Analysis

Stage 4 – Multidatabase access Data source Data source Data source Modelling Data discoveryData accessAnalysis Remote Analysis The good: long-term data preservation shared costs on data admin access to large external database The bad: rewrite software for cloud processing finding data is a major problem

Data Access Hitting a Wall Current science practice based on data download How do you download a petabyte?

Data Access Hitting a Wall Current science practice based on data download How do you download a petabyte? You don’t! Move the software to the archive

Scientific Data Management in the Coming Decade (Jim Gray, 2005) Next-generation science instruments and simulations will produce peta-scale datasets. Such peta-scale datasets will be housed by science centers that provide substantial storage and processing for scientists who access the data via smart notebooks. The procedural stream-of- bytes-file-centric approach to data analysis is both too cumbersome and too serial for such large datasets. Database systems will be judged by their support of common metadata standards and by their ability to manage and access peta-scale datasets.

36 Virtual Observatory If data is online, internet is the world ’ s best telescope Scientific Data Management in the Coming Decade (Jim Gray)

Where is scientific database going?

From tables to arrays nomeCPF cargo SQL language selection, projection, join, relation (table) SELECT * FROM images WHERE date=“today ” relational algebra SELECT Mean (A.B) FROM Array A AQL language Spatial queries, Math operations Scientific data Array Algebra

Communicating concepts is hard Image source: WMO vulnerability? climate change? poverty?

degradation We’re bad at representing meaning deforestation? degradation? disturbance? Communicating concepts is hard

When did the Aral Sea reach the tipping point? Communicating change is very hard

Describing events and processes is very hard When did the flood occur?

Earth System Science data management poses a major challenge for the database community We need new techniques, architectures and data handling techniques to deal with scientific data Conclusions