1 CSE444: REVIEW. 2 CSE444 in one slide v Logical : E/R diagram  normalized relations v Physical : files, buffering, and indexes v Logical : Relational.

Slides:



Advertisements
Similar presentations
Concurrency Control Part 2 R&G - Chapter 17 The sequel was far better than the original! -- Nobody.
Advertisements

CS 540 Database Management Systems
1 Supplemental Notes: Practical Aspects of Transactions THIS MATERIAL IS OPTIONAL.
Database Management Systems 3ed, R. Ramakrishnan and Johannes Gehrke1 Evaluation of Relational Operations: Other Techniques Chapter 14, Part B.
Database Management Systems, R. Ramakrishnan and Johannes Gehrke1 Evaluation of Relational Operations: Other Techniques Chapter 12, Part B.
Final Exam Coverage. E/R Converting E/R to Relations. SQL. –Joins and outerjoins –Subqueries –Aggregations –Views –Inserts, updates, deletes –Ordering.
Midterm Review Lecture 14b. 14 Lectures So Far 1.Introduction 2.The Relational Model 3.Disks and Files 4.Relational Algebra 5.File Org, Indexes 6.Relational.
CS263 Lecture 19 Query Optimisation.  Motivation for Query Optimisation  Phases of Query Processing  Query Trees  RA Transformation Rules  Heuristic.
1 Review #1 l Intro stuff –What is a database, 4 parts, 3 users, etc. l Architecture –Data independence –Three levels, two mappings –Jobs of the DBA.
Database management concepts Database Management Systems (DBMS) An example of a database (relational) Database schema (e.g. relational) Data independence.
Query Optimization 3 Cost Estimation R&G, Chapters 12, 13, 14 Lecture 15.
1 Evaluation of Relational Operations: Other Techniques Chapter 12, Part B.
1 Query Processing: The Basics Chapter Topics How does DBMS compute the result of a SQL queries? The most often executed operations: –Sort –Projection,
Query Processing & Optimization
...Looking back Why use a DBMS? How to design a database? How to query a database? How does a DBMS work?
Chapter 8 Physical Database Design. McGraw-Hill/Irwin © 2004 The McGraw-Hill Companies, Inc. All rights reserved. Outline Overview of Physical Database.
Query Processing Presented by Aung S. Win.
Systems analysis and design, 6th edition Dennis, wixom, and roth
Final Exam Review Lecture 31. Administrivia Office hours 1:15 – 2:15 today –also available via
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Query Evaluation Chapter 12: Overview.
Physical Database Design Chapter 6. Physical Design and implementation 1.Translate global logical data model for target DBMS  1.1Design base relations.
1 Object Databases: Introduction. 2 Why OO? v Relational Systems are limited: –Structural restrictions on data –Missing semantics (value-based relationships)
©Silberschatz, Korth and Sudarshan13.1Database System Concepts Chapter 13: Query Processing Overview Measures of Query Cost Selection Operation Sorting.
Query Processing. Steps in Query Processing Validate and translate the query –Good syntax. –All referenced relations exist. –Translate the SQL to relational.
1 CS 430 Database Theory Winter 2005 Lecture 16: Inside a DBMS.
1 Concurrency Control II: Locking and Isolation Levels.
Object Oriented Database By Ashish Kaul References from Professor Lee’s presentations and the Web.
CS 257 Chapter – 15.9 Summary of Query Execution Database Systems: The Complete Book Krishna Vellanki 124.
1 Final Review Tuesday, March 6, The Final Date: Tuesday, March 13, 2007 Time: 6:30 - 8:30 Room: EE 037 You must come to campus Open book exam.
Introduction to Query Optimization, R. Ramakrishnan and J. Gehrke 1 Introduction to Query Optimization Chapter 13.
CS 405G: Introduction to Database Systems
Foundations of Business Intelligence: Databases and Information Management.
Final Exam Review Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY courtesy of Joe Hellerstein for some slides.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Introduction to Query Optimization Chapter 13.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
Chapter 8 Physical Database Design. Outline Overview of Physical Database Design Inputs of Physical Database Design File Structures Query Optimization.
File Processing : Query Processing 2008, Spring Pusan National University Ki-Joune Li.
1 Overview of Query Evaluation Chapter Outline  Query Optimization Overview  Algorithm for Relational Operations.
Chapter 13: Query Processing
Christoph F. Eick: Final Words COSC Topics Covered in COSC 3480  Data models (ER, Relational, XML)  Using data models; learning how to store real.
Diskusi-08 Jelaskan dan berikan contoh penggunaan theta join, equijoin, natural join, outer join, dan semijoin The slides for this text are organized into.
Record Storage, File Organization, and Indexes
CS 540 Database Management Systems
Crash Recovery The slides for this text are organized into chapters. This lecture covers Chapter 20. Chapter 1: Introduction to Database Systems Chapter.
MODELS OF DATABASE AND DATABASE DESIGN
Storage and Indexes Chapter 8 & 9
Concurrency Control More !
Introduction to Query Optimization
The Entity-Relationship Model
Evaluation of Relational Operations: Other Operations
The Entity-Relationship Model
Crash Recovery The slides for this text are organized into chapters. This lecture covers Chapter 20. Chapter 1: Introduction to Database Systems Chapter.
Database management concepts
File Organizations and Indexing
File Organizations and Indexing
Crash Recovery The slides for this text are organized into chapters. This lecture covers Chapter 20. Chapter 1: Introduction to Database Systems Chapter.
Selected Topics: External Sorting, Join Algorithms, …
Introduction to Database Systems CSE 444 Lecture 23: Final Review
Final Review Topics Chapter 4 SQL,
Final Review Datalog (Ch 10) Rules and queries
Database management concepts
Evaluation of Relational Operations: Other Techniques
CSE594: REVIEW.
Database Systems (資料庫系統)
Lecture 24: Final Review Friday, March 10, 2006.
Introduction to Database Systems CSE 444 Lecture 23: Final Review
Final Review Friday, December 8, 2006.
Review #1 Intro stuff What is a database, 4 parts, 3 users, etc.
Evaluation of Relational Operations: Other Techniques
Presentation transcript:

1 CSE444: REVIEW

2 CSE444 in one slide v Logical : E/R diagram  normalized relations v Physical : files, buffering, and indexes v Logical : Relational algebra and SQL v Physical : join methods, optimization v CC/R choices affect correctness, performance v Object Database s extend, improve relations –… OODBMS, ORDBMS are converging? –… OQL, SQL3 are converging?

3 The E/R model v Entities vs. relationships v Attributes for entities and relationships v Keys and weak entities (no foreign keys!!) v Cardinality constraints v Participation (key) constraints v Translation to relations: –Entity becomes relation –Relationship becomes relation iff it is many-many –Other relationships: key of “parent” goes with “child”

4 FDs and Normal Forms v Determine Candidate Keys (CKs) v A field is prime iff it is in some CK v 1NF : All relations are in 1NF v 2NF : For each FD X  A: –A is prime OR X is not a proper subset of any CK v 3NF : For each FD X  A: –X is a superkey OR A is prime OR A  X (trivial) v BCNF : For each FD X  A: –X is a superkey OR A  X (trivial) v Normalize by decomposing R (ABC) to R1(AB) and R2 (BC), where B  C violates a NF

5 Physical DB Design v File organizations : heapfile, sorted file, hashed file v Disks : speed and physical characteristics v Buffer pool can vary in size; frame holds page v Replacement policy choice is important v Indexes can speed up data access: –Clustered vs. unclustered –Primary vs. secondary –Dense vs. sparse –B+ Trees are the most common: log(N) searches

6 Queries: Logical Aspects v Relational Algebra – , , , ,  –Joins kind of important too!!! u General join u Equijoin u Natural join –Transformations for optimization v SQL –Declarative –Relational calculus + algebra + grouping, etc. –Computationally in complete –Query execution: relational algebra operations

7 Queries: Physical Aspects v Implementation of relational algebra –File scan vs. index lookup vs. binary search –Exact-match vs. range queries –Impact of clustered vs. non-clustered indices –Join methods: u Block-Oriented Nested Loops (good idea, if buffers avail.) u Index Nested Loops (good idea, if index avail.) u Sort-Merge (good idea, if sorted or needs to be sorted) v Query optimization : cost estimates hard!!! –Large space of physical, logical alternatives –Prune space by considering only left-deep plans –Enables pipelined execution

8 Concurrency Control and Recovery v ACID transactions enforce CC&R v Strict 2PL “guarantees” serializability (C, I) –Deadlocks possible (detect and abort a XACT) –Phantom tuples possible (fix with index locking) v ARIES guarantees XACTS are A, D –Write-ahead logging –Log UNDO actions to allow complete replaying of history –Recovery phases: u Analyze (rebuild main memory structures) u REDO (rebuild data in buffer frames) u UNDO (abort uncommitted XACTS)

9 Object Databases v ORDBMS (e.g. Oracle 8i) vs. OODBMS (e.g. O2) v Structural aspects –Collection-valued fields –OIDs –Freely composable type constructors v Behavioral aspects (methods) v Adds new dimensions to RDBMS problems v SQL3, OQL standards are emerging

10 Current and Future DBMS Issues v New applications yield new techniques v New techniques yield new applications v Some “new” applications: –Data warehousing –On-line analytical processing (OLAP) –Data mining –Distributed data –Heterogeneous data and data integration –Scientific/sequential/ordered data –Partial or approximate query answers

11 Current and Future Issues (cont.) –Active DBs: rule management (ICs and triggers) –Real-time DBMS –Web-based DBMS –XML and semi-structured data –Spatial and high-dimensional data (lots of columns) –Special-purpose DBMSs –Digital Libraries –Geographic Information Systems –etc…..

12 Current and Future Issues (cont.) v Some “new” techniques: –New kinds of indices –Improved B Trees –Faster aggregation algorithms –New QP algorithms –Better optimization techniques –Data broadcasting –Generic data models –Faster sorting algorithms –New query languages –Deductive DBMSs

13 Current and Future Issues (cont.) –Object databases –New algebras –Query cost estimation –New locking and commit protocols –Main-memory databases –CC/R techniques for non-relational settings –DBMS interfaces, visualization tools –DBMS development tools –etc…. v BOTTOM LINE: Lots of opportunities for jobs, research, development, and fun !!!