A Social blog using MongoDB ITEC-810 Final Presentation Lucero Soria - 42403871 Supervisor: Dr. Jian Yang.

Slides:



Advertisements
Similar presentations
Chapter 10: Designing Databases
Advertisements

IMPLEMENTATION OF INFORMATION RETRIEVAL SYSTEMS VIA RDBMS.
Chapter 12 File Processing and Data Management Concepts
C6 Databases.
Management Information Systems, Sixth Edition
The Relational Database Model
1 More MongoDB: Ch 3- 8, plus a little Hadoop CSSE 533 Week 2, Spring, 2015.
7.1 © 2006 by Prentice Hall 7 Chapter Managing Data Resources.
Chapter 11 Data Management Layer Design
U of R eXtensible Catalog Team MetaCat. Problem Domain.
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
Data at the Core of the Enterprise. Objectives  Define of database systems  Introduce data modeling and SQL  Discuss emerging requirements of database.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education 4-1.
What is MongoDB? Developed by 10gen It is a NoSQL database A document-oriented database It uses BSON format.
VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui SWEN 432 Advanced Database Design and Implementation MongoDB Data Modeling.
7.1 Copyright © 2005 Pearson Education Canada Inc. Management Information Systems, Second Canadian Edition Chapter 7: Managing Data Resources MANAGING.
Data at the Core of the Enterprise. Objectives  Define of database systems.  Introduce data modeling and SQL.  Discuss emerging requirements of database.
Systems analysis and design, 6th edition Dennis, wixom, and roth
ASP.NET Programming with C# and SQL Server First Edition
PHP Programming with MySQL Slide 8-1 CHAPTER 8 Working with Databases and MySQL.
MongoDB An introduction. What is MongoDB? The name Mongo is derived from Humongous To say that MongoDB can handle a humongous amount of data Document.
NoSQL continued CMSC 461 Michael Wilson. MongoDB  MongoDB is another NoSQL solution  Provides a bit more structure than a solution like Accumulo  Data.
WTT Workshop de Tendências Tecnológicas 2014
Goodbye rows and tables, hello documents and collections.
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
6 Chapter Databases and Information Management. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits.
Chapter 7 Working with Databases and MySQL PHP Programming with MySQL 2 nd Edition.
CHAPTER:14 Simple Queries in SQL Prepared By Prepared By : VINAY ALEXANDER ( विनय अलेक्सजेंड़र ) PGT(CS),KV JHAGRAKHAND.
NOSQL DATABASES Please remember to read the NOSQL Distilled book and the Seven Databases book.
Methodological Foundations of Biomedical Informatics (BMSC-GA 4449) Himanshu Grover.
Relational Database CISC/QCSE 810 some materials from Software Carpentry.
Database Essentials. Key Terms Big Data Describes a dataset that cannot be stored or processed using traditional database software. Examples: Google search.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Introduction to Databases Trisha Cummings. What is a database? A database is a tool for collecting and organizing information. Databases can store information.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
Okalo Daniel Ikhena Dr. V. Z. Këpuska December 7, 2007.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
What have we learned?. What is a database? An organized collection of related data.
MongoDB is a database management system designed for web applications and internet infrastructure. The data model and persistence strategies are built.
SQL Jan 20,2014. DBMS Stores data as records, tables etc. Accepts data and stores that data for later use Uses query languages for searching, sorting,
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
Introduction to MongoDB
MongoDB Jer-Shuan Lin.
1 © Xchanging 2010 no part of this document may be circulated, quoted or reproduced without prior written approval of Xchanging. MOSS Training – UI customization.
Modeling MongoDB with Relational Model Proposed by Christopher Polanco.
Chapter 18 Object Database Management Systems. Outline Motivation for object database management Object-oriented principles Architectures for object database.
Introduction to MongoDB. Database compared.
Your Data Any Place, Any Time Beyond Relational. Overview of Beyond Relational Applications Today Beyond Relational Feature Overview Whirlwind Feature.
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Data Mining Meeting Mar, From SQL to NoSQL Xiao Yu Mar 2012.
NoSQL databases A brief introduction NoSQL databases1.
1 10 Systems Analysis and Design in a Changing World, 2 nd Edition, Satzinger, Jackson, & Burd Chapter 10 Designing Databases.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
CS422 Principles of Database Systems Introduction to NoSQL Chengyu Sun California State University, Los Angeles.
SQL Basics Review Reviewing what we’ve learned so far…….
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
COMP 430 Intro. to Database Systems MongoDB. What is MongoDB? “Humongous” DB NoSQL, no schemas DB Lots of similarities with SQL RDBMs, but with more flexibility.
Managing Data Resources File Organization and databases for business information systems.
NO SQL for SQL DBA Dilip Nayak & Dan Hess.
and Big Data Storage Systems
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
NOSQL.
Dineesha Suraweera.
NOSQL databases and Big Data Storage Systems
1 Demand of your DB is changing Presented By: Ashwani Kumar
MANAGING DATA RESOURCES
Teaching slides Chapter 8.
Databases and Information Management
Databases and Information Management
Working with GEOLocation Data
Presentation transcript:

A Social blog using MongoDB ITEC-810 Final Presentation Lucero Soria Supervisor: Dr. Jian Yang

Agenda Introduction Methodology Outcomes Blog implementation MongoDB vs. Relational databases Conclusions 2

Agenda Introduction Methodology Outcomes Blog implementation MongoDB vs. Relational databases Conclusions 3

Problem Specification Relational Databases Management Systems (RDBMS), such as MySQL, do not provide the flexibility and scalability needed to manage social media data NoSQL databases, such as MongoDB, emerged to provide the features that modern applications demand such as flexibility, scalability and productivity 4

Project Aim Analyse the differences between MongoDB and relational databases, especially in supporting social media data 5

Background Sources MongoDB MongoDB Online Manual Online articles Relational databases MySQL 5.5 reference manual Social Media Management Handbook by Robert Wollan Online articles 6

Agenda Introduction Methodology Outcomes Blog implementation MongoDB vs. Relational databases Conclusions 7

Project Approach This project is a combination of analysis and development tasks 8 Research  MongoDB, social media data and relational databases Implement a social blog using MongoDB Based on the implementation and research: Analyse the differences between MongoDB and relational databases

Methodology Incremental methodology was used to implement the social blog Combines waterfall model with iterations 9

Agenda Introduction Methodology Outcomes Blog implementation MongoDB vs. Relational databases Conclusions 10

A social blog with MongoDB 11 Features implemented: Login with facebook to create user’s profile in MongoDB Create, edit and delete posts (text, photos or videos) Add comments Search by tags Sort by blogs with more comments

Analysis Based on our experience implementing the social blog, the most relevant features to manage social media data are: Handle irregular data Handle large binary objects (videos, photos) Operations Metadata Manage huge volume of data Handle geospatial queries 12

Relational data model Fixed-schema Assume well-defined structure data with a fixed number of fields (columns) and relationships Minimize redundancy and dependency  Normalization 13 Source:

Terminology RDBMSMongoDB TableCollection RowsJSON Document Index JoinEmbedding & Linking 14

Document-oriented data model MongoDB uses a document-oriented model using collections Main characteristics: Schema-less Collections can be created on-the-fly when first referenced Capped collections: Fixed size, older records dropped after limit reached Collections store documents 15

MongoDB Document Main characteristics: Are represented in a format called BSON (Binary JSON) Data is de-normalized No joins  Embedding & Linking { author: ‘Lucero', created: Date(‘ '), title: 'Yet another blog post', text: 'Here is the text...', tags: [ 'example', ‘lucero' ], comments: [ { author: 'jim', comment: 'I disagree' }, { author: 'nancy', comment: 'Good post' }] } 16

Storing irregular data Example: Different information in user profiles MongoDB Each document can have different information doc1 = {name: “Joe”, age: ”20”, interest: ”football” } doc2 = {name : “Michele”} Relational database Tables with all attributes NULL value in columns where data was not provided Results: Special queries to handle NULL values  Expensive 17

Managing large binary data MongoDB Divide a large file among multiples documents (GridFS) Include metadata to large files Search files base on its content Retrieve only the first N bytes of a video Relational database Use BLOB (Binary large objects) Inefficient manipulating rich media BLOB cannot be searched or manipulated using standard database command 18

Geospatial Indexes Queries to find the nearest N point to a current location MongoDB Embedded Geospatial features Relational database Spatial extensions MySQL implements a subset of the SQL with Geometry Types environment proposed by Open Geospatial Consortium (OGC) 19

Managing huge volume of data MongoDB High performance No joins and embedding makes reads and writes fast Indexes including indexing of keys from embedded documents and arrays Horizontal scalability Automatic sharding (auto-partitioning of data across servers) Relational database Have shown poor performance on certain data-intensive applications and delivering streaming media  Case study: Foursquare Difficult to scale to multiple servers 20

Agenda Introduction Methodology Outcomes Blog implementation MongoDB vs. Relational databases Conclusions 21

Conclusions Benefits that MongoDB offers over relational database: Flexible schema High performance Manipulation of large object files out of the box Embedded geospatial features However, MongoDB does not replace relational databases MongoDB and relational databases can coexist 22

Thank You! Q&A 23