Bridging SQL and NoSQL Rupal Dhillon U00283533.

Slides:



Advertisements
Similar presentations
The State of the Art in Distributed Query Processing by Donald Kossmann Presented by Chris Gianfrancesco.
Advertisements

HadoopDB Inneke Ponet.  Introduction  Technologies for data analysis  HadoopDB  Desired properties  Layers of HadoopDB  HadoopDB Components.
Jennifer Widom NoSQL Systems Overview (as of November 2011 )
HadoopDB An Architectural Hybrid of Map Reduce and DBMS Technologies for Analytical Workloads Presented By: Wen Zhang and Shawn Holbrook.
Introduction to Backend James Kahng. Install Node.js.
Retrieving Documents with Geographic References Using a Spatial Index Structure Based on Ontologies Database Laboratory University of A Coruña A Coruña,
1 ©2007, University of Pisa, Dip. Ingegneria dell’Informazione – Andrea Bacioccola Survey on Database Architectures A. Bacioccola.
Stimulating reuse with an automated active code search tool Júlio Lins – André Santos (Advisor) –
CS 405G: Introduction to Database Systems 24 NoSQL Reuse some slides of Jennifer Widom Chen Qian University of Kentucky.
Sfdc_ppt_corp_template_01_01_2012.ppt Unlocking Proprietary Data with PostgreSQL Foreign Data Wrappers Pat Patterson Principal Developer Evangelist
How WebMD Maintains Operational Flexibility with NoSQL Rajeev Borborah, Sr. Director, Engineering Matt Wilson – Director, Production Engineering – Consumer.
First Indico Workshop Database Technology Pedro Ferreira May 2013 CERN.
AN INTRODUCTION TO NOSQL DATABASES Karol Rástočný, Eduard Kuric.
What makes Facebook do what it does? By Gavin Mais.
SQL vs NOSQL Discussion
By: Blake Peters.  OODB- Object Oriented Database  An OODB is a database management system in which information is represented in the form of objects.
Data Virtualization & Information As A Service (IaaS) By Anil Allewar Senior Solutions Architect - Synerzip 1.
Persistence Store Project Proposal.
HadoopDB project An Architetural hybrid of MapReduce and DBMS Technologies for Analytical Workloads Anssi Salohalla.
Modern Databases NoSQL and NewSQL Willem Visser RW334.
Amazon Web Services BY, RAJESH KANDEPU. Introduction  Amazon Web Services is a collection of remote computing services that together make up a cloud.
Changwon Nati Univ. ISIE 2001 CSCI5708 NoSQL looks to become the database of the Internet By Lawrence Latif Wed Dec Nhu Nguyen and Phai Hoang CSCI.
Spatial Tajo Supporting Spatial Queries on Apache Tajo Slideshare Shorten URL : goo.gl/j0VLXpgoo.gl/j0VLXp.
Position Paper How should the business logic be developed? As middle-tier application? or as a DBMS stored procedures? Akash Sharma Hiren Nagar.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Indexes and Views Unit 7.
Jennifer Widom NoSQL Systems Motivation. Jennifer Widom NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not.
NOSQL Implementation and examples Maciej Matuszewski.
Dec. 13, 2002 WISE2002 Processing XML View Queries Including User-defined Foreign Functions on Relational Databases Yoshiharu Ishikawa Jun Kawada Hiroyuki.
Mastering Redis Data Masters.
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
NoSQL: Graph Databases. Databases Why NoSQL Databases?
NoSQL databases A brief introduction NoSQL databases1.
Question 1 Which of the following requires object database technology? A. Storing employee picture on disk B. Retrieving an employee picture C. Retrieving.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
BIG DATA/ Hadoop Interview Questions.
DATA TODAY L. Grewe. Whole range of solutions This class  Focus on learning Mid-end Traditional Data Systems = Relational Database Systems  Note: if.
1 Analysis on the performance of graph query languages: Comparative study of Cypher, Gremlin and native access in Neo4j Athiq Ahamed, ITIS, TU-Braunschweig.
Microsoft Ignite /28/2017 6:07 PM
1 Cloud-Native Data Warehousing Bob Muglia. 2 Scenarios with affinity for cloud Gartner 2016 Predictions: By 2018, six billion connected things will be.
Big Data-An Analysis. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult.
CS 405G: Introduction to Database Systems
NoSQL: Graph Databases
and Big Data Storage Systems
Big Data Enterprise Patterns
Jeremy Shafer Temple University
Prepared by: Assistant prof. Aslamzai
DSS & Warehousing Systems
CS122B: Projects in Databases and Web Applications Winter 2017
Every Good Graph Starts With
Hybrid Cloud Architecture for Software-as-a-Service Provider to Achieve Higher Privacy and Decrease Securiity Concerns about Cloud Computing P. Reinhold.
Modern Databases NoSQL and NewSQL
Big-Data Fundamentals
NOSQL databases and Big Data Storage Systems
NoSQL Systems Overview (as of November 2011).
Big Data - in Performance Engineering
NOSQL and CAP Theorem.
SQL - כתיבת שאילתות למתחילים
NoSQL Systems Motivation.
Lecture 1: Multi-tier Architecture Overview
Overview of big data tools
Database Architecture
Market-based Dynamic Task Allocation in Mobile Surveillance Systems
Transaction Properties: ACID vs. BASE
Introduction to NoSQL Database Systems
Cloud versus Cloud: How Will Cloud Computing Shape Our World?
Applying Agile Lean to Global Software Development
The Database World of Azure
Big Data.
Presentation transcript:

Bridging SQL and NoSQL Rupal Dhillon U00283533

Introduction In this project, we bring traditional relational databases and non-relational data storage closer together by reducing the developer’s effort required to combine data from both types of databases. Motivated by the recent trend to use non-relational data storage, which leads to separate databases for a single application and the consequential extra work involved with managing multiple data sources.

What is NoSQL? The relational database is the most famous database system. These relational database systems are developed, used, and optimized for decades and offer a solid solution for data storage especially in the area of web applications. Recently a new trend is spotted in web development communities. Besides the traditional relational databases, products like CouchDB, MongoDB, Neo4j, Apache Cassandra, memcached, Redis, JADE, and Apache Hadoop are encountered more often in the context of web development. They are called NoSQL data storage systems.

Definitions SQL : An SQL database is a traditional relational database which can be queried using SQL. NoSQL : A NoSQL database is a database that is not an SQL database. Data is not stored in relations and the main query language to retrieve data is not SQL.

Problem Origin When a single software product requires data storage where a part of the data is ideally stored in a NoSQL database, whereas the rest of the data is perfectly relational and thus well-suited for a traditional SQL database. Raises the question- what type of data storage must be chosen for the application? The obvious compromise is a solution where the relational part of the data is stored in an SQL database, while the non-relational data is stored in NoSQL.

Drawbacks… Scenarios where data from both sources is required. Developers need to access different database systems.

Hybrid Database Abstraction layer on top of the SQL and NoSQL databases. Abstraction layer is responsible for retrieving the relevant data from the underlying SQL and NoSQL databases. Combines the fetched data into a single query result. Desired database abstraction architecture [1]

Approach Separate software layer Load SQL data in NoSQL, either virtual or materialized Load NoSQL data in SQL, either virtual or materialized

Theoretical framework for bridging SQL and NoSQL Architectural workflow illustrating the life of a query [2]

Experimental framework Software: SQL database- PostGRESQL NoSQL database- MongoDB Data : 2 different data sources- Saleable products of a company Twitter data

Conclusion The result of this work is a theoretical framework to bridge SQL and NoSQL. The developer sends a query containing a NoSQL query pattern. This pattern is translated to an equivalent SQL fragment that correctly joins the NoSQL triples retrieved from copies of the triple relation . The triples contained in the relation represent the NoSQL data we wanted to query. The result is a pure SQL query which can be executed like any other query. I have also tried to provide an experimental framework and developed a few ideas which I would like to implement in the near future.

References [1] Roijackers, J., & Fletcher, G. H. L. (2012). Bridging sql and nosql. Master's thesis, Eindhoven University of Technology. [2] Roijackers, J., & Fletcher, G. H. L. (2012). Bridging sql and nosql. Master's thesis, Eindhoven University of Technology.