Very Brief Background on RDBMSs, Big Data/NoSQL Systems, Machine Learning AnHai Doan.

Slides:



Advertisements
Similar presentations
CS 440 Database Management Systems Lecture 10: Transaction Management - Recovery 1.
Advertisements

Distributed databases
COS 461 Fall 1997 Transaction Processing u normal systems lose their state when they crash u many applications need better behavior u today’s topic: how.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 16 – Intro. to Transactions.
Ch1: File Systems and Databases Hachim Haddouti
DBMS Functions Data, Storage, Retrieval, and Update
CPSC-608 Database Systems Fall 2011 Instructor: Jianer Chen Office: HRBB 315C Phone: Notes #13.
BUSINESS DRIVEN TECHNOLOGY
1 Recap Database: –collection of data central to some enterprise that is managed by a Database Management System –reflection of the current state of the.
CSC 2720 Building Web Applications Database and SQL.
CPSC-608 Database Systems Fall 2010 Instructor: Jianer Chen Office: HRBB 315C Phone: Notes 1.
Transaction Management WXES 2103 Database. Content What is transaction Transaction properties Transaction management with SQL Transaction log DBMS Transaction.
COMP 5138 Relational Database Management Systems Semester 2, 2007 Lecture 8A Transaction Concept.
CMU SCS Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications C. Faloutsos – A. Pavlo How to Scale a Database System.
Dr. Kalpakis CMSC 461, Database Management Systems Introduction.
Transaction. A transaction is an event which occurs on the database. Generally a transaction reads a value from the database or writes a value to the.
Chapter 1 Introduction to Databases
Centralized and Client/Server Architecture and Classification of DBMS
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Database Systems Chapter 1 The Worlds of Database Systems.
Introduction To Databases IDIA 618 Fall 2014 Bridget M. Blodgett.
INTRODUCTION TO TRANSACTION PROCESSING CHAPTER 21 (6/E) CHAPTER 17 (5/E)
Chapter 1: Introduction to DBMS & Databases. Database Management System (DBMS) What is a DBMS? What are some examples of Database Applications?
Databases with Scalable capabilities Presented by Mike Trischetta.
 Introduction Introduction  Purpose of Database SystemsPurpose of Database Systems  Levels of Abstraction Levels of Abstraction  Instances and Schemas.
6-1 DATABASE FUNDAMENTALS Information is everywhere in an organization Information is stored in databases –Database – maintains information about various.
Introduction. 
The Worlds of Database Systems Chapter 1. Database Management Systems (DBMS) DBMS: Powerful tool for creating and managing large amounts of data efficiently.
Functions of a Database Management System
1 CS 430 Database Theory Winter 2005 Lecture 1: Introduction.
Transaction Management: Concurrency Control CS634 Class 16, Apr 2, 2014 Slides based on “Database Management Systems” 3 rd ed, Ramakrishnan and Gehrke.
M1G Introduction to Database Development 6. Building Applications.
Modern Databases NoSQL and NewSQL Willem Visser RW334.
CS 162 Discussion Section Week 9 11/11 – 11/15. Today’s Section ●Project discussion (5 min) ●Quiz (10 min) ●Lecture Review (20 min) ●Worksheet and Discussion.
Relational Databases. Database Large collection of data in an organised format to allow access and control DBMS Database Management System - Special software.
Storing Organizational Information - Databases
1 CS 430 Database Theory Winter 2005 Lecture 16: Inside a DBMS.
The Client/Server Database Environment Ployphan Sornsuwit KPRU Ref.
Personal Computer - Stand- Alone Database  Database (or files) reside on a PC - on the hard disk.  Applications run on the same PC and directly access.
1 Database Concepts 2 Definition of a Database An organized Collection Of related records.
Database Management System (DBMS) an Introduction DeSiaMore 1.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
Introduction to Database Systems1. 2 Basic Definitions Mini-world Some part of the real world about which data is stored in a database. Data Known facts.
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
1 CS3431 – Database Systems I Introduction Instructor: Mohamed Eltabakh
DATABASE ADMINISTRATION Pertemuan ke-7. Application Performance source : Database Administration the complete guide to practices and procedures chapter.
Programming Logic and Design Fourth Edition, Comprehensive Chapter 16 Using Relational Databases.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
Chapter 10 Database Management. Data and Information How are data and information related? p Fig Next processing data stored on disk Step.
History & Motivations –RDBMS History & Motivations (cont’d) … … Concurrent Access Handling Failures Shared Data User.
Lecture 4 Page 1 CS 111 Online Modularity and Virtualization CS 111 On-Line MS Program Operating Systems Peter Reiher.
1 Intro stored procedures Declaring parameters Using in a sproc Intro to transactions Concurrency control & recovery States of transactions Desirable.
10 1 Chapter 10 - A Transaction Management Database Systems: Design, Implementation, and Management, Rob and Coronel.
ASET 1 Amity School of Engineering & Technology B. Tech. (CSE/IT), III Semester Database Management Systems Jitendra Rajpurohit.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 16 – Intro. to Transactions.
CS3431: C-Term CS3431 – Database Systems I Introduction Instructor: Mohamed Eltabakh
CSCI-235 Micro-Computers in Science Databases. Database Concepts Data is any unorganized text, graphics, sounds, or videos A database is a collection.
What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently and safely. Provide.
Introduction to Database Systems 1. Pop Quiz Question 1: How often do you use a database system or database system application? a)At least once a day.
COMP 430 Intro. to Database Systems Transactions, concurrency, & ACID.
On-Line Transaction Processing
Chapter 1: Introduction
Modern Databases NoSQL and NewSQL
The Client/Server Database Environment
Predictive Performance
What Should a DBMS Do? How will we do all this??
Transactions and Concurrency
OCR GCSE Computing © Hodder Education 2013 Slide 1
Database management systems
Presentation transcript:

Very Brief Background on RDBMSs, Big Data/NoSQL Systems, Machine Learning AnHai Doan

Database Management Systems At the start, people mixed data and code Soon recognized that many apps must deal with the same amount of data, and it can be huge So better to factor that amount of data into a separate place, to be managed separately => a DBMS Apps then connect to that DBMS A DBMS has several advantages –keep data in a persistent place, does not risk losing data –can be optimized for query answering –can handle multiple concurrent transactions, they do not interfere with one another –security, access control 2

Database Management Systems The first DBMSs assumes the user know how the data is laid out on disk When writing a query the user must figure out how to get data efficiently from disk Example: –note: this example assumes data is in table format so keep things simple –early DBMSs do not assume data is in table format Problems: changing data layout on disk is very hard 3

Example 4

Enter RDBMSs data will be stored in relational tables declarative query language –specify what the user wants –an example SQL query here data is stored in any way the system likes system will take the query and figure out how to answer it efficiently cost-based query optimization –estimate the cost of each plan, then pick the one with the lowest estimated cost well-defined notions of transactions methods to efficiently handle concurrent transactions 5

Examples of Transactions Matt is making too much money Transaction one: take 50K of Matt’s salary and add that to Jane’s salary Transaction two: compute the average salary If not careful, these two will interfere with each other ACID properties –atomic –consistent –isolation –durable These properties are typically ensured using locks on the data 6

Transactional Data Management What you have seen so far is transactional data management A lot of transactions, many concurrent ones Each touching just a small piece of data (read/write) Transaction management, aka concurrency control, is vital Focus on high throughput Also a lot of queries, many concurrent ones Each touching a small piece of data (just read) How to answer a lot of them efficiently (Each query is a transaction) Examples: buying airline tickets, checkout at grocery stores, … This is the typical data management paradigm up until early 90s 7

Web / Social Data Change All These Way more data Need to process them, not transaction centric, rather insight centric First question: where to store them? How to query them quickly? Problems with RDBMSs => NoSQL stores Distributed, no longer enforce ACID properties Second question: how to process them? This can take a very long time Either super computer, or parallel processing on a lot of commodity PCs => Big Data systems Web companies pioneer these, but soon virtually everyone has a lot of data, so everyone needs these 8

Machine Learning Supervised learning: classification Unsupervised learning: clustering 9

–10 Classification Training Predicting Each object is modeled using a set of feature-value pairs Feature engineering is a serious problem labels (weighted by confidence score) X (X 1,C 1 ) (X 2,C 2 )... (X m,C m ) Observed label Training examples Object Classification model (hypothesis)