Big Data: Arrays :: RDA 4th Plenary :: © 2014 P. Baumann RDA 4th Plenary 2014-sep-22, Amsterdam, The Netherlands Peter Baumann Jacobs University | rasdaman.

Slides:



Advertisements
Similar presentations
GALEON :: OGC Huntsville, 2006-mar-08 IUB GALEON Status Update Huntsville, 2006-mar-08 Peter Baumann, Ivan Delchev International University Bremen, rasdaman.
Advertisements

ASIAES Project Overview Satellite Image Network for Natural Hazard Management in ASEAN+3 region Pakorn Apaphant Geo-Informatics and Space Technology Development.
The EarthServer initiative: towards Agile Big Data Services
Development of Java plug-in for Geoserver to read Oracle GeoRaster Data Authors Baskar Dhanapal CoreLogic Bangalore, India Perumal Chinnuswamy CoreLogic.
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
Jennifer Widom NoSQL Systems Overview (as of November 2011 )
Relational Database Alternatives NoSQL. Choosing A Data Model Relational database underpin legacy applications and meet business needs However, companies.
1 SSO 1st Nov, 2011 Applying WCO Ontology to Geospatial Web Coverage Services Xia Wang and Peter Baumann Jacobs University.
Zero-programming Sensor Network Deployment 學生:張中禹 指導教授:溫志煜老師 日期: 5/7.
CSE 190: Internet E-Commerce Lecture 10: Data Tier.
A Data Resolver Architecture for Discovering Pervasive Data Sources Matthew Denny Database Group U.C. Berkeley.
Kapil Bakshi Distinguished Architect
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
D4M – Signal Processing On Databases
XML, distributed databases, and OLAP/warehousing The semantic web and a lot more.
Windows Azure SQL Database and Storage Name Title Organization.
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
Overview of Standard Query Language (SQL) Saeideh Joodaki Instructor: Dr.Yingshu Li.
SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the.
Windows Azure Conference 2014 Designing Applications for Scalability.
Big Table - Slides by Jatin. Goals wide applicability Scalability high performance and high availability.
When bet365 met Riak and discovered a true, “always on” database.
Recursive Functions Creating Hierarchical Reports Date: 9/30/2008 Dan McCreary President Dan McCreary & Associates (952) M.
TA. Min-Joong Lee x7837)
Potential standardization items for the cloud computing in SC32 1 WG2 N1665 ISO/IEC JTC 1/SC 32 Plenary Meeting, Berlin, Germany, June 2012 Sungjoon Lim,
MongoDB is a database management system designed for web applications and internet infrastructure. The data model and persistence strategies are built.
[ Part III of The XML seminar ] Presenter: Xiaogeng Zhao A Introduction of XQL.
GIS Data Structures How do we represent the world in a GIS database?
Dynamo: Amazon’s Highly Available Key-value Store DAAS – Database as a service.
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
Chapter 13.3: Databases Invitation to Computer Science, Java Version, Second Edition.
AIP Deep Dive :: Coverage Cubes :: © 2015 rasdaman - AIP-8 Results - GEO XII Plenary, Mexico City, Alexandru Mircea Dumitru Jacobs University.
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
Jennifer Widom Relational Databases The Relational Model.
NoSQL: Graph Databases. Databases Why NoSQL Databases?
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Working.
NoSQL databases A brief introduction NoSQL databases1.
MSF and MAGE: e-Science Middleware for BT Applications Sep 21, 2006 Jaeyoung Choi Soongsil University, Seoul Korea
Science SQL :: EGI-GEANT :: © 2014 P. Baumann EGI-GEANT Symposium 2014-sep-25, CWI, Amsterdam, The Netherlands Peter Baumann Jacobs University | rasdaman.
EScience Needs :: Brussels :: P. Baumann eScience Needs in the Big Data Era e-IRG Workshop Athens, Greece, 9-10 June 2014 Peter Baumann, Dimitar Misev.
CS422 Principles of Database Systems Introduction to NoSQL Chengyu Sun California State University, Los Angeles.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
Copyright, Open Geospatial Consortium Making Location Count Peer-to-Peer File Sharing An Answer to the SDI blues North Carolina GIS Conference February,
CPSC8985 FA 2015 Team C3 DATA MIGRATION FROM RDBMS TO HADOOP By Naga Sruthi Tiyyagura Monika RallabandiRadhakrishna Nalluri.
Dive into NoSQL with Azure Niels Naglé Hylke Peek.
Microsoft Ignite /28/2017 6:07 PM
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
George Percivall Delft, The Netherlands 22 March 2017
Neo4j: GRAPH DATABASE 27 March, 2017
RDA WG on Dynamic Data Citation
Database Systems: Design, Implementation, and Management Tenth Edition
UNIDART Uniform Data Request Interface
RDA Plenary 5 Big Data (Analytics) IG Session
Cloud Computing for Science
Central Satellite Data Repository Supporting Research and Development
Preserving Geo-Scientific Data Assets Through Service Interoperability
CS122B: Projects in Databases and Web Applications Winter 2017
Modern Data Management
Worldbank Conference on Land and Poverty, 2017-mar-23
NoSQL Systems Overview (as of November 2011).
Validating the Rasdaman Capability for Handling Big Raster Data
Topics Covered in COSC 6340 Data models (ER, Relational, XML (short))
Relational Databases The Relational Model.
Relational Databases The Relational Model.
Intro to NoSQL Databases
Topics Covered in COSC 6340 Data models (ER, Relational, XML)
Databases.
Intro to NoSQL Databases
Intro to NoSQL Databases
Which INSPIRE download service to implement?
Presentation transcript:

Big Data: Arrays :: RDA 4th Plenary :: © 2014 P. Baumann RDA 4th Plenary 2014-sep-22, Amsterdam, The Netherlands Peter Baumann Jacobs University | rasdaman GmbH Big Data Analytics WG: Use Case Array Databases

Big Data: Arrays :: RDA 4th Plenary :: © 2014 P. Baumann Structural Variety in Big Data  Stock trading: 1-D sequences (i.e., arrays)  Social networks: large, homogeneous graphs  Ontologies: small, heterogeneous graphs  Climate modelling: 4D/5D arrays  Satellite imagery: 2D/3D arrays (+irregularity)  Genome: long string arrays  Particle physics: sets of events  Bio taxonomies: hierarchies (such as XML)  Documents: key/value stores = sets of unique identifiers + whatever  etc.

Big Data: Arrays :: RDA 4th Plenary :: © 2014 P. Baumann  „raster data manager“: SQL + n-D raster objects  Scalable parallel “tile streaming” architecture  In operational use -OGC Web Coverage Service Core Reference Implementation rasdaman: Agile Array Analytics select img.green[x0:x1,y0:y1] > 130 from LandsatArchive as img where avg_cells( img.nir ) < 17

Big Data: Arrays :: RDA 4th Plenary :: © 2014 P. Baumann Array Databases: Practice Proven with rasdaman  from simple data access to agile analytics -strictly based on open OGC Big Geo Data standards   130+ TB databases, 2D, 3D x/y/z & x/y/t, 4D x/y/z/t timeseries  single query distributed to 1,000+ cloud nodes

Big Data: Arrays :: RDA 4th Plenary :: © 2014 P. Baumann  ISO 9075 Part 15: SQL/MDA -resolved by ISO SQL WG in June 2014  n-D arrays as attributes  declarative array operations select id, encode(scene.band1-scene.band2)/(scene.nband1+scene.band2)), „image/tiff“ ) from LandsatScenes where acquired between „ “ and „ “ and avg( scene.band3-scene.band4)/(scene.band3+scene.band4)) > 0 Recent Progress: ISO Array SQL create table LandsatScenes( id: integer not null, acquired: date, scene: row( band1: integer,..., band7: integer ) array [ 0:4999,0:4999] )