Download presentation
Presentation is loading. Please wait.
Published byJames Sherman Modified over 9 years ago
1
CPS 216: Advanced Database Systems Shivnath Babu
2
Outline for Today What this class is about: Data management What we will cover in this class Logistics What does a Database System mean to you? (Hint: What are they used for? Give examples)
3
Data Management Data Query User/Application DataBase Management System (DBMS)
4
Example: At a Company IDNameDeptIDSalary… 10Nemo12120K… 20Dory15679K… 40Gill8976K… 52Ray3485K… …………… IDName… 12IT… 34Accounts… 89HR… 156Marketing… ……… EmployeeDepartment Query 1: Is there an employee named “Nemo”? Query 2: What is “Nemo’s” salary? Query 3: How many departments are there in the company? Query 4: What is the name of “Nemo’s” department? Query 5: How many employees are there in the “Accounts” department?
5
DataBase Management System (DBMS)High-level Query Q DBMS Data Answer Translates Q into best execution plan for current conditions, runs plan
6
Example: Store that Sells Cars MakeModelOwnerID HondaAccord12 ToyotaCamry34 MiniCooper89 HondaAccord156 ……… IDNameAge 12Nemo22 34Ray42 89Gill36 156Dory21 ……… Cars Owners Filter (Make = Honda and Model = Accord) Join (Cars.OwnerID = Owners.ID) MakeModelOwnerIDIDNameAge HondaAccord12 Nemo22 HondaAccord156 Dory21 Owners of Honda Accords who are <= 23 years old Filter (Age <= 23)
7
DataBase Management System (DBMS)High-level Query Q DBMS Data Answer Translates Q into best execution plan for current conditions, runs plan Keeps data safe and correct despite failures, concurrent updates, online processing, etc.
8
DBMS is multi-user Example Get account balance from database; If balance > amount of withdrawal then balance = balance - amount of withdrawal; dispense cash; store new balance into database; Homer at ATM1 withdraws $100 Marge at ATM2 withdraws $50 Initial balance = $400, final balance = ? –Should be $250 no matter who goes first
9
Final balance = $250 read balance; $400 if balance > amount then balance = balance - amount; $300 write balance; $300 read balance; $300 if balance > amount then balance = balance - amount; $250 write balance; $250 Homer withdraws $100: Marge withdraws $50:
10
Final balance = $300 read balance; $400 if balance > amount then balance = balance - amount; $300 write balance; $300 read balance; $400 If balance > amount then balance = balance - amount; $350 write balance; $350 Homer withdraws $100: Marge withdraws $50:
11
Final balance = $350 read balance; $400 if balance > amount then balance = balance - amount; $300 write balance; $300 read balance; $400 if balance > amount then balance = balance - amount; $350 write balance; $350 Homer withdraws $100: Marge withdraws $50:
12
Concurrency control in DBMS Similar to concurrent programming problems –But data is not all in main-memory Appears similar to file system concurrent access? –Approach taken by MySQL initially; now MySQL offers better alternatives But want to control at much finer granularity Or else one withdrawal would lock up all accounts!
13
Recovery in DBMS Example: balance transfer decrement the balance of account X by $100; increment the balance of account Y by $100; Scenario 1: Power goes out after the first instruction Scenario 2: DBMS buffers and updates data in memory (for efficiency); before they are written back to disk, power goes out Log updates; undo/redo during recovery
14
DataBase Management System (DBMS)High-level Query Q DBMS Data Answer Translates Q into best execution plan for current conditions, runs plan Keeps data safe and correct despite failures, concurrent updates, online processing, etc.
15
Summary of modern DBMS features Persistent storage of data Logical data model; declarative queries and updates ! physical data independence Multi-user concurrent access Safety from system failures Performance, performance, performance –Massive amounts of data (terabytes ~ petabytes) –High throughput (thousands ~ millions transactions per minute) –High availability (¸ 99.999% uptime)
16
Modern DBMS Architecture Disk(s) Applications OS Parser Query Optimizer Query Executor Storage Manager Logical query plan Physical query plan Access method API calls SQL File system API calls Storage system API calls DBMS
17
Course Outline 40% of the class is about core DBMS concepts –Query execution, query optimization, transactions, recovery, etc. –Textbook material 60% of the class is on “what is happening today in data management” –New developments on textbook material –Data streams –Web search – Google, Yahoo! –Data integration (structured data + unstructured data) –Data mining –Unsolved challenges
18
Using a Traditional DBMS User/Application Loader QueryResult Table R Table S Result…Query…
19
New Approach for Data Streams User/Application Register Continuous Query (Standing Query) Stream Query Processor Input streams Result
20
Example Continuous (Standing) Queries Web –Amazon’s best sellers over last hour Network Intrusion Detection –Track HTTP packets with destination address matching a prefix in given table and content matching “*\.ida” Finance –Monitor NASDAQ stocks between $20 and $200 that have moved down more than 2% in the last 20 minutes
21
New Challenges in DBMSsHigh-level Query Q DBMS Answer Data TeraBytes PetaBytes Empire B. Bob Dylan USA Columbia 10.90
22
Course Logistics Reference: Database Systems: The Complete Book, by H. Garcia-Molina, J. D. Ullman, and J. Widom Web site: http://www.cs.duke.edu/courses/fall07/cps216 Grading: –Project 30% –Homework Assignments 20% –Midterm 20% –Final 30%
23
Summary: Data Management is Important Core aspect of most sciences and engineering today Core need in industry Cool mix of theory and systems Chances are you will find something interesting even if you primary interest is elsewhere
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.