INF1343, Winter 2011 Data Modeling and Database Design Yuri Takhteyev University of Toronto This presentation is licensed under Creative Commons Attribution.

Slides:



Advertisements
Similar presentations
No SQL is not about SQL No SQL is a Zoo.. Key-Value Stores Wide Column Stores Document Stores Graph Databases.
Advertisements

Session 3: Setting Up and Managing a Course - Supports for staff.
Jennifer Widom NoSQL Systems Overview (as of November 2011 )
Relational Database Alternatives NoSQL. Choosing A Data Model Relational database underpin legacy applications and meet business needs However, companies.
Reporter: Haiping Wang WAMDM Cloud Group
Regions of Interest.  What’s in a ROI?  Use cases  Requirements  Current Storage System  Problems  Alternative Storage.
MapReduce Simplified Data Processing on Large Clusters Google, Inc. Presented by Prasad Raghavendra.
CS 405G: Introduction to Database Systems 24 NoSQL Reuse some slides of Jennifer Widom Chen Qian University of Kentucky.
1 Yasin N. Silva Arizona State University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
B 葉彥廷 B 林廷韋 B 王頃恩. Why we choose this topic Introduction Programming Model Example Implementation Conclusion.
ZhangGang, Fabio, Deng Ziyan /31 NoSQL Introduction to Cassandra Data Model Design Implementation.
SOFTWARE SYSTEMS DEVELOPMENT MAP-REDUCE, Hadoop, HBase.
Database Design Normalisation. Last Session Looked at: –What databases were –Where they are used –How they are used.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT IT Monitoring WG Technology for Storage/Analysis 28 November 2011.
NoSQL: Graph Databases. Databases Why NoSQL Databases?
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Data Mining Meeting Mar, From SQL to NoSQL Xiao Yu Mar 2012.
NoSQL databases A brief introduction NoSQL databases1.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
CCT396, Fall 2011 Database Design and Implementation Yuri Takhteyev University of Toronto This presentation is licensed under Creative Commons Attribution.
INF1343, Winter 2012 Data Modeling and Database Design Yuri Takhteyev Faculty of Information University of Toronto This presentation is licensed under.
CCT396, Fall 2011 Database Design and Implementation Yuri Takhteyev University of Toronto This presentation is licensed under Creative Commons Attribution.
INF1343, Winter 2012 Data Modeling and Database Design Yuri Takhteyev Faculty of Information University of Toronto This presentation is licensed under.
INF1343, Winter 2012 Data Modeling and Database Design Yuri Takhteyev Faculty of Information University of Toronto This presentation is licensed under.
CCT396, Fall 2011 Database Design and Implementation Yuri Takhteyev University of Toronto This presentation is licensed under Creative Commons Attribution.
CCT395, Week 4 Database Design and ER Modeling This presentation is licensed under Creative Commons Attribution License, v To view a copy of this.
INF1343, Winter 2012 Data Modeling and Database Design Yuri Takhteyev Faculty of Information University of Toronto This presentation is licensed under.
CCT395, Week 12 Storage, Grid Computing, Review This presentation is licensed under Creative Commons Attribution License, v To view a copy of this.
INF1343, Winter 2011 Data Modeling and Database Design Yuri Takhteyev University of Toronto This presentation is licensed under Creative Commons Attribution.
INF1343, Winter 2012 Data Modeling and Database Design Yuri Takhteyev Faculty of Information University of Toronto This presentation is licensed under.
INF1343, Week 6 Implementing a Database with SQL This presentation is licensed under Creative Commons Attribution License, v To view a copy of this.
CCT395, Week 11 Review, Permissions Physical Storage and Performance This presentation is licensed under Creative Commons Attribution License, v. 3.0.
CCT395, Week 6 Implementing a Database with SQL and a Case Study Exercise This presentation is licensed under Creative Commons Attribution License, v.
INF1343, Winter 2011 Data Modeling and Database Design Yuri Takhteyev University of Toronto This presentation is licensed under Creative Commons Attribution.
CCT395, Week 2 Introduction to SQL Yuri Takhteyev September 15, 2010
INF1343, Winter 2012 Data Modeling and Database Design Yuri Takhteyev Faculty of Information University of Toronto This presentation is licensed under.
1 Gaurav Kohli Xebia Breaking with DBMS and Dating with Relational Hbase.
CCT396, Fall 2011 Database Design and Implementation Yuri Takhteyev University of Toronto This presentation is licensed under Creative Commons Attribution.
CCT396, Fall 2011 Database Design and Implementation Yuri Takhteyev University of Toronto This presentation is licensed under Creative Commons Attribution.
INF1343, Week 4 Database Design and ER Modeling This presentation is licensed under Creative Commons Attribution License, v To view a copy of this.
CCT396, Fall 2011 Database Design and Implementation Yuri Takhteyev University of Toronto This presentation is licensed under Creative Commons Attribution.
CCT395, Week 5 Translating ER into Relations; Normalization This presentation is licensed under Creative Commons Attribution License, v To view a.
CCT395, Week 7 SQL with PHP This presentation is licensed under Creative Commons Attribution License, v To view a copy of this license, visit
NoSQL: Graph Databases
CS 405G: Introduction to Database Systems
NoSQL: Graph Databases
Database Design and Implementation
Translating ER into Relations; Normalization
Review Databases and Objects
Software Systems Development
Special Topics in CCIT: Databases
CCT1343, Week 2 Single-Table SQL Yuri Takhteyev University of Toronto
CS122B: Projects in Databases and Web Applications Winter 2017
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
NOSQL.
Applying the OSS Model to Other Domains
Database Design and Implementation
Data Modeling and Database Design
Data Modeling and Database Design INF1343, Winter 2012 Yuri Takhteyev
Database Design and Implementation
NOSQL databases and Big Data Storage Systems
Data Modeling and Database Design INF1343, Winter 2012 Yuri Takhteyev
NoSQL Systems Overview (as of November 2011).
Entity relationship diagrams
Normalization By Jason Park Fall 2005 CS157A.
NoSQL Databases Antonino Virgillito.
Database Systems Summary and Overview
April 13th – Semi-structured data
Normalization By Jason Park Fall 2005 CS157A.
Presentation transcript:

INF1343, Winter 2011 Data Modeling and Database Design Yuri Takhteyev University of Toronto This presentation is licensed under Creative Commons Attribution License, v To view a copy of this license, visit This presentation incorporates images from the Crystal Clear icon collection by Everaldo Coelho, available under LGPL from

Week 12 “NoSQL” Databases (A 30 Minute Introduction) + a Review Session

Backup mysqldump mysqldump -u kenobio1 -p starwars > starwars.sql mysql < starwars.sql (Add “create database starwars; use starwars;”) hot backup replication off-site copies

The Relational Model What's not to like?

Scaling Facebook.5 billion users 30 billion pieces of content per month 20 billion events per day = 200,000 events per second select url, count(*) from status join liking on status.id = liking.status_id group by status.url; Instead: v= &oid= &comments

“NoSQL” Storage Basic ideas: Key-Value Storage & Retrieval Documents or sparse relations Schema-free Distributed storage “Wide Column Store”: BigTable, HBase, Cassandra “Document Store”: CouchDB, MongoDB

MapReduce Map split the task into many parts all parts should be equivalent e.g.: count “likes” by URL for batches of 1 mln hits Reduce put together the results e.g: merge the batch counts into a total count

Cf. with Rel. Model Pros: horizontal scaling Cons: more work “you are not Google”

The Final Exam Date: Monday, April 4 Place: Bissel Rm. 114 (basement) Time: 9:10 am (arrive by 9 am) Duration: 2 hours 50 minutes Materials: pencils, erasers See the practice exam on the course website.

Kensington Airlines Design a database to record flight bookings, including seat assignments. Assume a single airline. Level 1: Just handle direct flights. Level 2: Handle connections. Level 3: Several people on 1 itinerary. Hint: 4, 5, and 6 entities respectively.

Normalization 2NF: 1NF + “all non-key attributes are functionally dependent on the entire primary key”. 3NF: 2NF + “no transitive dependencies” [of non-key attributes via non-key attributes]. BCNF (for comparison) : 3NF + “all determinants are candidate keys”.

Normalization film (film_id, imdb_id, english_title, original_title, year, directors, genre, duration, poster_image_file_small, poster_image_file_large) 1NF? 2NF? 3NF?

Normalization ingredient (ingredient_id, english_name, french_name, description, vegan?, vegetarian?)

Normalization rating (url, userid, username, rating) user (userid, username, password)

Normalization rating (url, userid, username, admin? rating )

Normalization rating (rating_id, url, userid, username, admin? rating )

Q & A