Scaling Bathymetry: Data handling for large volumes

Slides:



Advertisements
Similar presentations
In Production Juan Marin. Agenda Introduction Reliability Availability Performance Data optimizations Runtime optimizations Measuring your environment.
Advertisements

Vrije Universiteit Interdroid: a platform for distributed smartphone applications Henri Bal, Nick Palmer, Roelof Kemp, Thilo Kielmann High Performance.
Oracle Spatial and Mapviewer Problems From Real World Applications.
Agile Infrastructure built on OpenStack Building The Next Generation Data Center with OpenStack John Griffith, Senior Software Engineer,
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Maps as Numbers Getting Started with GIS Chapter 3.
Contiki A Lightweight and Flexible Operating System for Tiny Networked Sensors Presented by: Jeremy Schiff.
Computer Organization and Architecture
CS364 CH08 Operating System Support TECH Computer Science Operating System Overview Scheduling Memory Management Pentium II and PowerPC Memory Management.
Layers and Views of a Computer System Operating System Services Program creation Program execution Access to I/O devices Controlled access to files System.
Benjamin Post Cole Kelleher. Encyclopedia Articles: PostGIS, C. Strobl, pp Oracle Spatial, Geometries, R. Kothuri and S. Ravada, page
Russ Houberg Senior Technical Architect, MCM KnowledgeLake, Inc.
Deploying Moodle with Red Hat Enterprise Virtualization Brian McSpadden Director of Network Operations Remote-Learner.net.
Object-based Storage Long Liu Outline Why do we need object based storage? What is object based storage? How to take advantage of it? What's.
Lecture On Database Analysis and Design By- Jesmin Akhter Lecturer, IIT, Jahangirnagar University.
July 2003 Sorrento: A Self-Organizing Distributed File System on Large-scale Clusters Hong Tang, Aziz Gulbeden and Tao Yang Department of Computer Science,
DISTRIBUTED DATA FLOW WEB-SERVICES FOR ACCESSING AND PROCESSING OF BIG DATA SETS IN EARTH SCIENCES A.A. Poyda 1, M.N. Zhizhin 1, D.P. Medvedev 2, D.Y.
COMP 410 & Sky.NET May 2 nd, What is COMP 410? Forming an independent company The customer The planning Learning teamwork.
Faculty of Applied Engineering and Urban Planning Civil Engineering Department Geographic Information Systems Vector and Raster Data Models Lecture 3 Week.
Geographic Information System GIS This project is implemented through the CENTRAL EUROPE Programme co-financed by the ERDF GIS Geographic Inf o rmation.
Chapter 5 Operating System Support. Outline Operating system - Objective and function - types of OS Scheduling - Long term scheduling - Medium term scheduling.
Chapter 3.5 Memory and I/O Systems. 2 Memory Management Memory problems are one of the leading causes of bugs in programs (60-80%) MUCH worse in languages.
Clever Framework Name That Doesn’t Violate Copyright Laws MARCH 27, 2015.
The Client/Server Database Environment Ployphan Sornsuwit KPRU Ref.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Benjamin Post Cole Kelleher.  Availability  Data must maintain a specified level of availability to the users  Performance  Database requests must.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
Introduction: Memory Management 2 Ideally programmers want memory that is large fast non volatile Memory hierarchy small amount of fast, expensive memory.
Web Technologies Lecture 13 Introduction to cloud computing.
Towards Unifying Vector and Raster Data Models for Hybrid Spatial Regions Philip Dougherty.
Map image compression for real-time applications UNIVERSITY OF JOENSUU DEPARTMENT OF COMPUTER SCIENCE Image Compression Research group:
G.Govi CERN/IT-DB 1GridPP7 June30 - July 2, 2003 Data Storage with the POOL persistency framework Motivation Strategy Storage model Storage operation Summary.
Uncertainty Management Workshop Fredericton, New Brunswick – Canada Heeswijk – The Netherlands Ellicott City, Maryland – United States May 2004.
Virtual Memory By CS147 Maheshpriya Venkata. Agenda Review Cache Memory Virtual Memory Paging Segmentation Configuration Of Virtual Memory Cache Memory.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
SOIS Services. Layered View This is the traditional diagram that summarizes SOIS services in layers of a protocol stack.
Mary Ganesan and Lora Strother Campus Tours Using a Mobile Device.
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
Modeling Big Data Execution speed limited by: Model complexity
Virtual Memory Chapter 7.4.
OpenMosix, Open SSI, and LinuxPMI
Section 6 Object Storage Gateway (RADOS-GW)
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Mobile App Development
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
Version 3, with post 19 Jan 2016 Telecon mods
Taking Geocoder to Work
William Stallings Computer Organization and Architecture
The Client/Server Database Environment
So far we have covered … Basic visualization algorithms
Advanced Security Architecture System Engineer Cisco: practice-questions.html.
CSI 400/500 Operating Systems Spring 2009
Apache Spark Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing Aditya Waghaye October 3, 2016 CS848 – University.
Quick Introduction to OS
Learn. Imagine. Build. .NET Conf
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Data, Databases, and DBMSs
Data Security for Microsoft Azure
Accelerate Your Self-Service Data Analytics
Grid Data Integration In the CMS Experiment
Introducing Qwory, a Business-to-Business Search Engine That’s Powered by Microsoft Azure and Detects Vital Contact Information for Businesses MICROSOFT.
Lecture 1: Multi-tier Architecture Overview
CLUSTER COMPUTING.
Comparing NetCDF and a multidimensional array database on managing and querying large hydrologic datasets: a case study of SciDB– P5 Haicheng Liu.
Quasardb Is a Fast, Reliable, and Highly Scalable Application Database, Built on Microsoft Azure and Designed Not to Buckle Under Demand MICROSOFT AZURE.
Virtual Memory Virtual memory is a technique which gives an application program the impression that it has contiguous working memory, while in fact it.
Chapter 16 File Management
TensorFlow: A System for Large-Scale Machine Learning
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Scaling Bathymetry: Data handling for large volumes Mark Masry CARIS R&D Fredericton – Canada • Heeswijk – The Netherlands • Ellicott City – United States

Trends Sensors are getting better Processors are more powerful Hard Drives are getting bigger

Hard Drive Trends

Tech Summary Current storage formats for grids and points are good Single resolution Lots of data required in memory Goals: Rebuild data storage mechanisms from the ground up Use multiple resolutions Structure so that not all data is required in memory

New technology Stack Applications Grid Point Cloud DataFlow CSAR Framework Where does CSAR framework fit in technology stack - Information flow Storage Device

CSAR Framework A framework for managing chunks of data Data chunks have flexible structure Storage device independent The basis for new Grids and Point Clouds A platform for data management in the coming years

CSAR Block Diagram Set Layer Cache Layer DataFlow Storage Layer Storage Device

CSAR Set Layer Primary point of interaction with CSAR framework Associates data chunks with unique keys Keys can be anything Used to index a collection of chunks Retrieves chunks of data from Cache or from Storage layer

CSAR Caching Layer Stores chunks in memory in a common pool Cache size can be modified dynamically Layer can have single cache or split data up into multiple independent caches Chunks swapped in and out of cache from storage on request Lazy writeback

CSAR Storage Layer Communicates with Storage Device Could be proprietary file or database or network Translates chunks of data from storage into internal format Writes and reads can happen without blocking processing

Storage: CSAR File First implementation of a storage backend Designed to store large chunks of data Can have multiple Grids or Clouds in a single file Based on lightweight open source database

Storage: Oracle Spatial Write a new Storage Layer implementation Supports high volume Grid and Cloud type Use native Oracle Spatial data representation Translate data chunks to and from Oracle Spatial representation Store in cache layer then write them back

Data Structures Grid and Cloud High Volume Built using CSAR Multi-resolution Multi-band with many data types

Georeferenced Cloud Storage for High Volume (X,Y,Z) points Tested 300,000,000 points with multiple attribute bands and flags on each point Has both high level and low level structure for points Can view and interact with entire cloud in 2D and 3D Intend to edit directly on the cloud without extracting subsections

Georeferenced Grid Stores high volume gridded data > 40 Billion grid nodes with multiple attribute bands Programmable updaters to create multiple levels of resolution Inter-band dependencies are handled Also connect to GDAL

Visualization 2D/3D support for new Grid and Cloud Fast loads and zooms Smooth even for large data sets Rebuilt 3D visualization engine Dynamic lighting and colour mapping System brings in data in background while moving

Remote Visualization Raster and Clouds structured for remote visualization Visualization over web using Spatial Fusion Visualization over network using applications Remote visualization from Bathy DataBASE Fast load times facilitated by data structures.

3D Point Cloud Viz

3D Point Cloud Viz

3D Point Cloud Viz

3D Raster Viz

3D Raster Viz

3D Raster Viz

Conclusions CSAR provides a new platform for all our applications for the coming years Organizes, loads and caches data partitioned into chunks New data structures for gridded and point data designed for high volumes New visualization engine