Temple University – CIS Dept. CIS616– Principles of Database Systems V. Megalooikonomou Relational Model (based on notes by Silberchatz,Korth, and Sudarshan.

Slides:



Advertisements
Similar presentations
Ver 1,12/09/2012Kode :CCs 111,sistem basisdataFASILKOM Chapter 2: Relational Model Database System Concepts, 5th Ed. ©Silberschatz, Korth and Sudarshan.
Advertisements

©Silberschatz, Korth and Sudarshan3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple Relational.
Chapter 2 Relational Model (part II) Hankz Hankui Zhuo
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Extended.
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple Relational.
Midterm 2 Revision Prof. Sin-Min Lee Department of Computer Science San Jose State University.
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Extended Relational-Algebra-Operations Generalized Projection Outer Join Aggregate Functions.
Temple University – CIS Dept. CIS331– Principles of Database Systems V. Megalooikonomou Query by example (based on notes by Silberchatz,Korth, and Sudarshan.
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Extended.
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple Relational.
Chapter 3: Relational Model
Database System Concepts, 5 th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Chapter 2: Relational.
SPRING 2004CENG 3521 E-R Diagram for the Banking Enterprise.
Slides adapted from A. Silberschatz et al. Database System Concepts, 5th Ed. SQL - part 2 - Database Management Systems I Alex Coman, Winter 2006.
Database System Concepts, 5 th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Chapter 2: Relational.
Midterm 2 Revision Prof. Sin-Min Lee Department of Computer Science San Jose State University.
©Silberschatz, Korth and Sudarshan1Database System Concepts Tuple and Domain Calculus Tuple Relational Calculus Domain Relational Calculus.
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Calculus Objectives  Tuple Calculus  Domain Calculus  QBE & SQL  Related Examples.
Slides adapted from A. Silberschatz et al. Database System Concepts, 5th Ed. Relational Model Database Management Systems I Alex Coman, Winter 2006.
Relational Model. 2 Structure of Relational Databases Fundamental Relational-Algebra-Operations Additional Relational-Algebra-Operations Extended Relational-Algebra-Operations.
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple Relational.
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple Relational.
Temple University – CIS Dept. CIS331– Principles of Database Systems V. Megalooikonomou Relational Model III (based on notes by Silberchatz,Korth, and.
Database System Concepts, 5 th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Chapter 2: Relational.
©Silberschatz, Korth and Sudarshan2.1Database System Concepts - 5 th Edition, Oct 5, 2006 Outer Join n An extension of the join operation that avoids loss.
Database System Concepts, 5 th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Chapter 2: Relational.
CIS552Relational Model1 Structure of Relational Database Relational Algebra Extended Relational-Algebra-Operations Modification of the Database.
CMU SCS Carnegie Mellon Univ. Dept. of Computer Science Database Applications Lecture#6: Relational calculus.
3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple Relational Calculus Domain Relational.
Midterm 2 Revision Prof. Sin-Min Lee Department of Mathematics and Computer Science San Jose State University.
Computing & Information Sciences Kansas State University Monday, 08 Sep 2008CIS 560: Database System Concepts Lecture 5 of 42 Monday, 08 September 2008.
Relational Model By Dr.S.Sridhar, Ph.D.(JNUD), RACI(Paris, NICE), RMR(USA), RZFM(Germany) DIRECTOR ARUNAI ENGINEERING COLLEGE TIRUVANNAMALAI.
Yufis Azhar – Teknik Informatika – UMM.  Aggregation function takes a collection of values (of a single attribute) and returns a single value as a result.
Advanced Database Systems By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
©Silberschatz, Korth and Sudarshan5.1Database System Concepts Chapter 5: Other Relational Query Languages Tuple Relational Calculus Domain Relational Calculus.
ICOM 5016 – Introduction to Database Systems Lecture 8 Dr. Manuel Rodriguez Department of Electrical and Computer Engineering University of Puerto Rico,
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Extended.
Chapter 2: Relational Model. 2.2 Chapter 2: Relational Model Structure of Relational Databases Fundamental Relational-Algebra-Operations Additional Relational-Algebra-Operations.
Chapter 2: Relational Model. 2.2Unite International CollegeDatabase Management Systems Chapter 2: Relational Model Structure of Relational Databases Fundamental.
Carnegie Mellon Carnegie Mellon Univ. Dept. of Computer Science Database Applications C. Faloutsos Relational model.
Temple University – CIS Dept. CIS331– Principles of Database Systems V. Megalooikonomou Relational Model I (based on notes by Silberchatz,Korth, and Sudarshan.
2.1 Chapter 2: Relational Model. 2.2 Chapter 2: Relational Model Structure of Relational Databases Fundamental Relational-Algebra-Operations Additional.
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Extended.
Carnegie Mellon Carnegie Mellon Univ. Dept. of Computer Science Database Applications C. Faloutsos Relational domain calculus.
CMU SCS Carnegie Mellon Univ. Dept. of Computer Science /615 – DB Applications C. Faloutsos & A. Pavlo Lecture#5: Relational calculus.
Database System Concepts, 5th Ed. Bin Mu at Tongji University Chapter 5: Other Relational Languages.
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Extended Relational-Algebra-Operations Generalized Projection Aggregate Functions Outer Join.
Computing & Information Sciences Kansas State University Friday, 08 Sep 2006CIS 560: Database System Concepts Lecture 07 of 42 Tuesday, 30 January 2007.
Midterm 2 Revision Prof. Sin-Min Lee Department of Computer Science San Jose State University.
Database System Concepts, 5 th Ed. Bin Mu at Tongji University Chapter 2: Relational Model.
Computing & Information Sciences Kansas State University Wednesday, 03 Sep 2008CIS 560: Database System Concepts Lecture 3 of 42 Wednesday, 03 September.
Carnegie Mellon Carnegie Mellon Univ. Dept. of Computer Science Database Applications C. Faloutsos Relational tuple calculus.
IS 230Lecture 7Slide 1 Relational Algebra Lecture 8 Text Book – Chapter.
Chapter 3: Relational Model III Additional Relational Algebra Operations Additional Relational Algebra Operations Views Views.
International Computer Institute, Izmir, Turkey Relational Model Asst.Prof.Dr.İlker Kocabaş UBİ502 at
©Silberschatz, Korth and Sudarshan3.1Database System Concepts Chapter 3: Relational Model Structure of Relational Databases Relational Algebra Tuple Relational.
PMIT-6102 Advanced Database Systems By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
Database System Concepts, 5 th Ed. ©Silberschatz, Korth and Sudarshan Chapter 2: Relational Model.
International Computer Institute, Izmir, Turkey Relational Model Asst.Prof.Dr.İlker Kocabaş UBİ502 at
Midlands State University Topic: Relational DB Model
Temple University – CIS Dept. CIS661– Principles of Database Systems
Relational Model By Dr.S.Sridhar, Ph.D.(JNUD), RACI(Paris, NICE), RMR(USA), RZFM(Germany)
Chapter 3: Relational Model III
Lecture #4: Relational Algebra
Chapter 2: Relational Model
Database Applications (15-415) Relational Calculus Lecture 6, September 6, 2016 Mohammad Hammoud.
Lecture 4 of 42 Relational Joins Wednesday, 30 January 2008
Lecture#5: Relational calculus
Chapter 2: Relational Model
CS 405G: Introduction to Database Systems
Presentation transcript:

Temple University – CIS Dept. CIS616– Principles of Database Systems V. Megalooikonomou Relational Model (based on notes by Silberchatz,Korth, and Sudarshan and notes by C. Faloutsos at CMU)

Overview history concepts Formal query languages relational algebra rel. tuple calculus rel. domain calculus

History before: records, pointers, sets etc introduced by E.F. Codd ( ) in 1970 revolutionary!!! first systems: (System R; Ingres) Turing award in 1981

Concepts Database: a set of relations (= tables) rows: tuples columns: attributes (or keys) superkey, candidate key, primary key

Example Database:

Example: cont’d Database: rel. schema (attr+domains) tuple k-th attribute (Dk domain)

Example: cont’d rel. schema (attr+domains) instance

Example: cont’d rel. schema (attr+domains) instance Di: the domain of the I-th attribute (eg., char(10) Formally: an instance is a subset of (D1 x D2 x …x Dn)

Example: cont’d superkey (eg., ‘ssn, name’): determines record cand. key (eg., ‘ssn’, or ‘st#’): minimal superkey (no subset of it is a superkey) primary key: one of the cand. keys

Another example Example: if Customer-name = {Jones, Smith, Curry, Lindsay} Customer-street = {Main, North, Park} Customer-city = {Harrison, Rye, Pittsfield} Then r = { (Jones, Main, Harrison), (Smith, North, Rye), (Curry, North, Rye), (Lindsay, Park, Pittsfield)} is a relation over Customer-name x Customer-street x Customer-city

Relations, tuples Relation Schema: A 1, A 2, …, A n are attributes R = (A 1, A 2, …, A n ) is a relation schema E.g. Customer-schema = (customer-name, customer-street, customer-city) r(R) is a relation on the relation schema R E.g.customer (Customer-schema) Relations are unordered Order of tuples is irrelevant (tuples may be stored in an arbitrary order)

Database A database consists of multiple relations Information about an enterprise is broken up into parts, with each relation storing one part of the information E.g.: account : stores information about accounts depositor : stores information about which customer owns which account customer : stores information about customers Storing all information as a single relation such as bank(account-number, balance, customer-name,..) results in repetition of information (e.g. two customers own an account) the need for null values (e.g. represent a customer without an account) Normalization theory (discuss later) deals with how to design relational schemas

Overview history concepts Formal query languages relational algebra rel. tuple calculus rel. domain calculus

Formal query languages How do we collect information? Eg., find ssn’s of people in cis331 (recall: everything is a set!) One solution: Relational algebra, i.e., set operators (procedural language) Q1: Which operators?? Q2: What is a minimal set of operators?

. set union U set difference ‘-’ Relational operators

Example: Q: find all students (part or full time) A: PT-STUDENT union FT-STUDENT

Observations: two tables are ‘union compatible’ if they have the same attributes (i.e., same arity: number of attributes and same ‘domains’) Q: how about intersection ? U

Observations: A: redundant: STUDENT intersection STAFF = STUDENT - (STUDENT - STAFF) STUDENT STAFF

. set union set difference ‘-’ Relational operators U

Other operators? E.g., find all students on ‘Main street’ A: ‘selection’

Other operators? Notice: selection (and rest of operators) expect tables, and produce tables --> can be cascaded!! For selection, in general:

Selection - examples Find all ‘Smiths’ on ‘Forbes Ave’ ‘condition’ can be any boolean combination of ‘=‘, ‘>’, ‘>=‘,...

selection. set union set difference R - S Relational operators R U S

selection picks rows - how about columns? A: ‘projection’ - eg.: finds all the ‘ssn’ - removing duplicates Relational operators

Cascading: ‘find ssn of students on ‘forbes ave’ Relational operators

selection projection. set union set difference R - S Relational operators R U S

Are we done yet? Q: Give a query we can not answer yet! Relational operators

A: any query across two or more tables, eg., ‘find names of students in cis351’ Q: what extra operator do we need?? A: surprisingly, the cartesian product is enough! Relational operators TAKES SSNc-idgrade 123cis331A 234cis331B

Cartesian product E.g., dog-breeding: MALE x FEMALE gives all possible couples x =

so what? Eg., how do we find names of students taking cis351? TAKES SSNc-idgrade 123cis331A 234cis331B

Cartesian product A: SsnNameAddressssncidgrade 123smithmain str123cis331A 234jonesforbes ave123cis331A 123smithmain str234cis331B 234jonesforbes ave234cis331B

Cartesian product SsnNameAddressssncidgrade 123smithmain str123cis331A 234jonesforbes ave123cis331A 123smithmain str234cis331B 234jonesforbes ave234cis331B

SsnNameAddressssncidgrade 123smithmain str123cis331A 234jonesforbes ave123cis331A 123smithmain str234cis331B 234jonesforbes ave234cis331B

selection projection cartesian product MALE x FEMALE set union set difference R - S FUNDAMENTAL Relational operators R U S

Relational ops Surprisingly, they are enough, to help us answer almost any query we want!! derived operators, for convenience set intersection join (theta join, equi-join, natural join) ‘rename’ operator division

Joins Equijoin:

Cartesian product A: SsnNameAddressssncidgrade 123smithmain str123cis331A 234jonesforbes ave123cis331A 123smithmain str234cis331B 234jonesforbes ave234cis331B

Joins Equijoin: theta-joins: generalization of equi-join - any condition

Joins very popular: natural join: R S like equi-join, but it drops duplicate columns: STUDENT(ssn, name, address) TAKES(ssn, cid, grade)

Joins nat. join has 5 attributes SsnNameAddressssncidgrade 123smithmain str123cis331A 234jonesforbes ave123cis331A 123smithmain str234cis331B 234jonesforbes ave234cis331B equi-join: 6

Natural Joins - nit-picking if no attributes in common between R, S: nat. join -> cartesian product:

Overview - rel. algebra fundamental operators derived operators joins etc rename division examples

rename op. Q: why? A: Shorthand (BEFORE can be a relational algebra expression) self-joins; … for example, find the grand-parents of ‘Tom’, given PC(parent-id, child-id)

rename op. PC(parent-id, child-id)

rename op. first, WRONG attempt: (why? how many columns?) Second WRONG attempt:

rename op. we clearly need two different names for the same table - hence, the ‘rename’ op.

Overview - rel. algebra fundamental operators derived operators joins etc rename division examples

Division Rarely used, but powerful. Suited for queries that include the phrase “for all” Example: find suspicious suppliers, i.e., suppliers that supplied all the parts in A_BOMB

Division

Observations: ~reverse of cartesian product It can be derived from the 5 fundamental operators (!!) How?

Division Answer: gives those tuples t in such that for some tuple u in S, tu not in R.

Overview - rel. algebra fundamental operators derived operators joins etc rename division examples

Sample schema CLASS c-idc-nameunits cis331d.b.2 cis321o.s.2 TAKES SSNc-idgrade 123cis331A 234cis331B find names of students that take cis351

Examples find names of students that take cis351

Sample schema find course names of ‘smith’ CLASS c-idc-nameunits cis331d.b.2 cis321o.s.2 TAKES SSNc-idgrade 123cis331A 234cis331B

Examples find course names of ‘smith’

Examples find ssn of ‘overworked’ students, ie., that take cis331, cis342, cis350

Examples find ssn of ‘overworked’ students, ie., that take cis331, cis342, cis350: almost correct answer: )( )( )( TAKES namec c c        

Examples find ssn of ‘overworked’ students, ie., that take cis331, cis342, cis350 - Correct answer: )]([ ([ ([ TAKES namecssn namecssn namecssn        

Examples find ssn of students that work at least as hard as ssn=123 (ie., they take all the courses of ssn=123, and maybe more)

Sample schema CLASS c-idc-nameunits cis331d.b.2 cis321o.s.2 TAKES SSNc-idgrade 123cis331A 234cis331B

Examples find ssn of students that work at least as hard as ssn=123 (ie., they take all the courses of ssn=123, and maybe more

Conclusions Relational model: only tables (‘relations’) relational algebra: powerful, minimal: 5 operators can handle almost any query! most non-trivial op.: join

The bank database

Temple University – CIS Dept. CIS616– Principles of Database Systems V. Megalooikonomou Relational Model II (based on notes by Silberchatz,Korth, and Sudarshan and notes by C. Faloutsos at CMU)

Overview history concepts Formal query languages relational algebra rel. tuple calculus rel. domain calculus

Extended Relational-Algebra- Operations Generalized Projection Outer Join Aggregate Functions

Generalized Projection Extends the projection operation by allowing arithmetic functions to be used in the projection list  F1, F2, …, Fn (E) E is any relational-algebra expression Each of F 1, F 2, …, F n are arithmetic expressions involving constants and attributes in the schema of E. Given relation credit-info(customer-name, limit, credit-balance), find how much more each person can spend:  customer-name, limit – credit-balance (credit-info)

Aggregate Functions and Operations Aggregation function takes a collection of values and returns a single value as a result ?

Aggregate Functions and Operations Aggregation function takes a collection of values and returns a single value as a result avg: average value min: minimum value max: maximum value sum: sum of values count: number of values Aggregate operation in relational algebra G 1, G 2, …, G n g F 1 ( A 1), F 2 ( A 2),…, F n ( A n) (E) E is any relational-algebra expression G 1, G 2 …, G n is a list of attributes on which to group (can be empty) Each F i is an aggregate function Each A i is an attribute name

Aggregate Operation – Example Relation r : AB   C g sum(c) (r) sum-C 27

Aggregate Operation – Example Relation account grouped by branch-name: branch-name g sum(balance) (account) branch-nameaccount-numberbalance Perryridge Brighton Redwood A-102 A-201 A-217 A-215 A branch-namebalance Perryridge Brighton Redwood

Aggregate Functions (Cont.) Result of aggregation does not have a name Use rename operation For convenience, we permit renaming as part of aggregate operation branch-name g sum(balance) as sum-balance (account)

Outer Join An extension of the join operation that avoids loss of information Computes the join and then adds tuples from one relation that does not match tuples in the other relation to the result of the join Uses null values: null signifies that the value is unknown or does not exist All comparisons involving null are (roughly speaking) false by definition precise meaning of comparisons with nulls later

Outer Join – Example Relation loan loan-numberamount L-170 L-230 L Relation borrower customer-nameloan-number Jones Smith Hayes L-170 L-230 L-155 branch-name Downtown Redwood Perryridge

Outer Join – Example Inner Join loan borrower loan borrower Left Outer Join loan-numberamount L-170 L customer-name Jones Smith branch-name Downtown Redwood loan-numberamount L-170 L-230 L customer-name Jones Smith null branch-name Downtown Redwood Perryridge

Outer Join – Example Right Outer Join loan borrower Full Outer Join loan-numberamount L-170 L-230 L null customer-name Jones Smith Hayes branch-name Downtown Redwood null loan-numberamount L-170 L-230 L-260 L null customer-name Jones Smith null Hayes branch-name Downtown Redwood Perryridge null

Null Values It is possible for tuples to have a null value, denoted by null, for some of their attributes null signifies unknown value or a value that does not exist The result of any arithmetic expression involving null is null Aggregate functions ignore null values An arbitrary decision. Could have returned null as result instead Follow semantics of SQL in its handling of null values For duplicate elimination and grouping, null is treated like any other value, and two nulls are assumed to be the same Alternative: assume each null is different from each other Both arbitrary decisions, we follow SQL

Null Values Comparisons with null values return the special truth value unknown If false was used instead of unknown, then not (A = 5 Three-valued logic using the truth value unknown: OR: (unknown or true) = true, (unknown or false) = unknown (unknown or unknown) = unknown AND: (true and unknown) = unknown, (false and unknown) = false, (unknown and unknown) = unknown NOT: (not unknown) = unknown In SQL “P is unknown” evaluates to true if predicate P evaluates to unknown Result of select predicate is treated as false if it evaluates to unknown

Modification of the Database The content of the database may be modified using the following operations: Deletion Insertion Updating All these operations are expressed using the assignment operator

The bank database

Deletion Expressed similarly to a query Instead of displaying tuples, the selected tuples are removed from the database Can delete only whole tuples cannot delete values on only particular attributes A deletion is expressed in relational algebra by: r  r – E where r is a relation and E is a relational algebra query.

Deletion Examples Delete all account records in the Perryridge branch account  account –  branch-name = “Perryridge” (account) Delete all loan records with amount in the range of 0 to 50 loan  loan –   amount  0  and amount  50 (loan) Delete all accounts at branches located in Needham r 1    branch-city = “Needham” (account branch) r 2   branch-name, account-number, balance (r 1 ) r 3   customer-name, account-number (r 2 depositor) account  account – r 2 depositor  depositor – r 3

Insertion To insert data into a relation, we either: specify a tuple to be inserted write a query whose result is a set of tuples to be inserted in relational algebra, an insertion is expressed by?

Insertion To insert data into a relation, we either: specify a tuple to be inserted write a query whose result is a set of tuples to be inserted in relational algebra, an insertion is expressed by: r  r  E where r is a relation and E is a relational algebra expression Insertion of a single tuple: let E be a constant relation containing one tuple

Insertion Examples Insert information in the database specifying that Smith has $1200 in account A-973 at the Perryridge branch. account  account  {(“Perryridge”, A-973, 1200)} depositor  depositor  {(“Smith”, A-973)} Provide as a gift for all loan customers in the Perryridge branch, a $200 savings account. Let the loan number serve as the account number for the new savings account. r 1  (  branch-name = “Perryridge” (borrower loan)) account  account  (  branch-name, loan-number (r 1 )) x {(200)}) OR account  account   branch-name, account-number,200 (r 1 ) depositor  depositor   customer-name, loan-number,(r 1 )

Updating A mechanism to change a value in a tuple without charging all values in the tuple Use the generalized projection operator r   F1, F2, …, FI, (r) Each F, is either the i-th attribute of r, if the i-th attribute is not updated, or, if the attribute is to be updated F i is an expression, involving only constants and the attributes of r, which gives the new value for the attribute

Update Examples Make interest payments by increasing all balances by 5 percent account   AN, BN, BAL * 1.05 (account) where AN, BN and BAL stand for account-number, branch- name and balance, respectively Pay all accounts with balances over $10,000 6 percent interest and pay all others 5 percent account   AN, BN, BAL * 1.06 (  BAL  (account))   AN, BN, BAL * 1.05 (  BAL  (account))

Views … if it is not desirable for all users to see the entire logical model (i.e., all the relations stored in the database) A person who needs to know a customer’s loan number but has no need to see the loan amount can see the relation:  customer-name, loan-number (borrower loan) Any relation that is not of the conceptual model but is made visible to a user as a “virtual relation” is called a view.

View Definition A view is defined using the create view statement which has the form create view v as where is any legal relational algebra query expression. The view name, v, can be used to refer to the virtual relation View definition  creating a new relation by evaluating the query expression. Rather, a view definition causes the saving of an expression to be substituted into queries using the view Materialized views

View Examples Consider the view (named all-customer) consisting of branches and their customers create view all-customer as  branch-name, customer-name (depositor account)   branch-name, customer-name (borrower loan) We can find all customers of the Perryridge branch by writing:  customer-name (  branch-name = “Perryridge” (all-customer))

Updates Through View Database modifications expressed as views must be translated to modifications of the actual relations in the database E.g., a person who needs to see all loan data in the loan relation except amount. The view is defined as: create view branch-loan as  branch-name, loan-number (loan) Since we allow a view name to appear wherever a relation name is allowed, the person may write: branch-loan  branch-loan  {(“Perryridge”, L-37)}

Updates Through Views (Cont’d.) The previous insertion must be represented by an insertion into the actual relation loan from which the view branch- loan is constructed An insertion into loan requires a value for amount. The insertion can be dealt with by either: rejecting the insertion and returning an error message to the user inserting a tuple (“L-37”, “Perryridge”, null ) into the loan relation Some updates through views are impossible to translate into database relation updates create view v as  branch-name = “Perryridge” (account)) v  v  (L-99, Downtown, 23) Others cannot be translated uniquely all-customer  all-customer  (Perryridge, John) Have to choose loan or account, and create a new loan/account number!

Views Defined Using Other Views One view may be used in the expression defining another view A view relation v 1 is said to depend directly on a view relation v 2 if v 2 is used in the expression defining v 1 A view relation v 1 is said to depend on view relation v 2 if either v 1 depends directly to v 2 or there is a path of dependencies from v 1 to v 2 A view relation v is said to be recursive if it depends on itself

View Expansion Define the meaning of views in terms of other views… Let view v 1 be defined by an expression e 1 that may itself contain uses of view relations View expansion of an expression repeats the following replacement step: repeat Find any view relation v i in e 1 Replace the view relation v i by the expression defining v i until no more view relations are present in e 1 This loop will terminate as long as …?

View Expansion Define the meaning of views in terms of other views… Let view v 1 be defined by an expression e 1 that may itself contain uses of view relations View expansion of an expression repeats the following replacement step: repeat Find any view relation v i in e 1 Replace the view relation v i by the expression defining v i until no more view relations are present in e 1 This loop will terminate as long as the view definitions are not recursive

General Overview - rel. model history concepts Formal query languages relational algebra rel. tuple calculus rel. domain calculus

Overview - detailed Relational tuple calculus Why do we need it? Details Examples Equivalence with rel. algebra More examples; ‘safety’ of expressions Rel. domain calculus + QBE

Motivation Q: weakness of rel. algebra? A: procedural describes the steps (i.e., ‘how’) … still useful, for query optimization though

Solution: rel. calculus describes what we want two equivalent flavors: ‘tuple’ calculus and ‘domain’ calculus basis for SQL and QBE, resp.

Rel. tuple calculus (RTC) first order logic ‘Give me tuples ‘t’, satisfying predicate P – e.g.:

..or Tuple Relational Calculus A nonprocedural query language, where each query is of the form {t | P (t) } It is the set of all tuples t such that predicate P is true for t t is a tuple variable, t [A] denotes the value of tuple t on attribute A t  r denotes that tuple t is in relation r P is a formula similar to that of the predicate calculus

Details symbols allowed: quantifiers: “for all”, “there exists” Atom:

Specifically Formula: Atom if P1, P2 are formulas, so are if P(s) is a formula, so are

Specifically Reminders: DeMorgan implication: double negation: ‘every human is mortal : no human is immortal’

Reminder: our Mini-U db CLASS c-idc-nameunits cis331d.b.2 cis321o.s.2 TAKES SSNc-idgrade 123cis331A 234cis331B

Examples find all student records output tuple of type ‘STUDENT’

Examples (selection) find student record with ssn=123

Examples (selection) find student record with ssn=123

Examples (projection) find name of student with ssn=123

Examples (projection) find name of student with ssn=123 ‘t’ has only one column

‘Tracing’ s t

Examples cont’d (union) get records of both PT and FT students

Examples cont’d (union) get records of both PT and FT students

Examples difference: find students that are not staff (assuming that STUDENT and STAFF are union-compatible)

Examples difference: find students that are not staff

Cartesian product e.g., dog-breeding: MALE x FEMALE gives all possible couples x =

Cartesian product find all the pairs of (male, female)

‘Proof’ of equivalence rel. algebra rel. tuple calculus

Overview - detailed rel. tuple calculus why? details examples equivalence with rel. algebra more examples; ‘safety’ of expressions re. domain calculus + QBE

More examples join: find names of students taking cis351

Reminder: our Mini-U db CLASS c-idc-nameunits cis331d.b.2 cis321o.s.2 TAKES SSNc-idgrade 123cis331A 234cis331B

More examples join: find names of students taking cis351

More examples join: find names of students taking cis351 projection selection join

More examples 3-way join: find names of students taking a 2-unit course

Reminder: our Mini-U db

More examples 3-way join: find names of students taking a 2-unit course selection projection join

More examples 3-way join: find names of students taking a 2-unit course - in rel. algebra??

Even more examples: self -joins: find Tom’s grandparent(s)

Even more examples: self -joins: find Tom’s grandparent(s)

Hard examples: DIVISION find suppliers that shipped all the ABOMB parts

Hard examples: DIVISION find suppliers that shipped all the ABOMB parts

General pattern three equivalent versions: 1) if it’s bad, he shipped it 2)either it was good, or he shipped it 3) there is no bad shipment that he missed

More on division find (SSNs of) students that take all the courses that ssn=123 does (and maybe even more) find students ‘s’ so that if 123 takes a course => so does ‘s’

More on division find students that take all the courses that ssn=123 does (and maybe even more)

Safety of expressions FORBIDDEN: It has infinite output!! Instead, always use

Safety of expressions Possible to write tuple calculus expressions that generate infinite relations, e.g., {t |  t  r } results in an infinite relation if the domain of any attribute of relation r is infinite To guard against the problem, we restrict the set of allowable expressions to safe expressions. An expression {t | P (t) } in the tuple relational calculus is safe if every component of t appears in one of the relations, tuples, or constants that appear in P

More examples: Banking example branch (branch-name, branch-city, assets) customer (customer-name, customer-street, customer-city) account (account-number, branch-name, balance) loan (loan-number, branch-name, amount) depositor (customer-name, account-number) borrower (customer-name, loan-number)

Example Queries Find the loan-number, branch-name, and amount for loans of over $1200 {t | t  loan  t [amount]  1200} Find the loan number for each loan of an amount greater than $1200 {t |  s  loan (t [loan-number] = s [loan-number]  s [amount]  1200} Notice that a relation on schema [loan-number] is implicitly defined by the query

Example Queries Find the names of all customers having a loan, an account, or both at the bank {t |  s  borrower(t[customer-name] = s[customer-name])   u  depositor(t[customer-name] = u[customer- name]) Find the names of all customers who have a loan and an account at the bank {t |  s  borrower(t[customer-name] = s[customer-name])   u  depositor(t[customer-name] = u[customer- name])

Example Queries Find the names of all customers having a loan at the Perryridge branch {t |  s  borrower(t[customer-name] = s[customer-name]   u  loan(u[branch-name] = “Perryridge”  u[loan-number] = s[loan-number]))} Find the names of all customers who have a loan at the Perryridge branch, but no account at any branch of the bank {t |  s  borrower(t[customer-name] = s[customer-name]   u  loan(u[branch-name] = “Perryridge”  u[loan-number] = s[loan-number]))  not  v  depositor (v[customer-name] = t[customer-name]) }

Example Queries Find the names of all customers having a loan from the Perryridge branch, and the cities they live in {t |  s  loan(s[branch-name] = “Perryridge”   u  borrower (u[loan-number] = s[loan-number]  t [customer-name] = u[customer-name])   v  customer (u[customer-name] = v[customer-name]  t[customer-city] = v[customer-city])))}

Example Queries Find the names of all customers who have an account at all branches located in Brooklyn: {t |  c  customer (t[customer.name] = c[customer-name])   s  branch(s[branch-city] = “Brooklyn”   u  account ( s[branch-name] = u[branch-name]   s  depositor ( t[customer-name] = s[customer-name]  s[account-number] = u[account-number] )) )}

Temple University – CIS Dept. CIS616– Principles of Database Systems V. Megalooikonomou Relational Model III (based on notes by Silberchatz,Korth, and Sudarshan and notes by C. Faloutsos at CMU)

Overview history concepts Formal query languages relational algebra rel. tuple calculus rel. domain calculus

General Overview - rel. model history concepts Formal query languages relational algebra rel. tuple calculus rel. domain calculus

Overview - detailed rel. tuple calculus why? details examples equivalence with rel. algebra more examples; ‘safety’ of expressions rel. domain calculus + QBE

Safety of expressions FORBIDDEN: It has infinite output!! Instead, always use

Safety of expressions Possible to write tuple calculus expressions that generate infinite relations, e.g., {t |  t  r } results in an infinite relation if the domain of any attribute of relation r is infinite To guard against the problem, we restrict the set of allowable expressions to safe expressions. An expression {t | P (t) } in the tuple relational calculus is safe if every component of t appears in one of the relations, tuples, or constants that appear in P

More examples: Banking example branch (branch-name, branch-city, assets) customer (customer-name, customer-street, customer-city) account (account-number, branch-name, balance) loan (loan-number, branch-name, amount) depositor (customer-name, account-number) borrower (customer-name, loan-number)

Example Queries Find the loan-number, branch-name, and amount for loans of over $1200 {t | t  loan  t [amount]  1200} Find the loan number for each loan of an amount greater than $1200 {t |  s  loan (t [loan-number] = s [loan-number]  s [amount]  1200} Notice that a relation on schema [loan-number] is implicitly defined by the query

Example Queries Find the names of all customers having a loan, an account, or both at the bank {t |  s  borrower(t[customer-name] = s[customer-name])   u  depositor(t[customer-name] = u[customer- name]) Find the names of all customers who have a loan and an account at the bank {t |  s  borrower(t[customer-name] = s[customer-name])   u  depositor(t[customer-name] = u[customer- name])

Example Queries Find the names of all customers having a loan at the Perryridge branch {t |  s  borrower(t[customer-name] = s[customer-name]   u  loan(u[branch-name] = “Perryridge”  u[loan-number] = s[loan-number]))} Find the names of all customers who have a loan at the Perryridge branch, but no account at any branch of the bank {t |  s  borrower(t[customer-name] = s[customer-name]   u  loan(u[branch-name] = “Perryridge”  u[loan-number] = s[loan-number]))  not  v  depositor (v[customer-name] = t[customer-name]) }

Example Queries Find the names of all customers having a loan from the Perryridge branch, and the cities they live in {t |  s  loan(s[branch-name] = “Perryridge”   u  borrower (u[loan-number] = s[loan-number]  t [customer-name] = u[customer-name])   v  customer (u[customer-name] = v[customer-name]  t[customer-city] = v[customer-city])))}

Example Queries Find the names of all customers who have an account at all branches located in Brooklyn: {t |  c  customer (t[customer.name] = c[customer-name])   s  branch(s[branch-city] = “Brooklyn”   u  account ( s[branch-name] = u[branch-name]   s  depositor ( t[customer-name] = s[customer-name]  s[account-number] = u[account-number] )) )}

General Overview relational model Formal query languages relational algebra rel. tuple calculus rel. domain calculus

Overview - detailed rel. tuple calculus dfn details equivalence to rel. algebra rel. domain calculus + QBE

Rel. domain calculus (RDC) Q: why? A: slightly easier than RTC, although equivalent - basis for QBE idea: domain variables (w/ F.O.L.) – e.g.: ‘find STUDENT record with ssn=123’

Rel. Dom. Calculus find STUDENT record with ssn=123’

Details Like R.T.C - symbols allowed: quantifiers

Details but: domain (= column) variables, as opposed to tuple variables, e.g.: ssn name address

Domain Relational Calculus A nonprocedural query language equivalent in power to the tuple relational calculus Each query is an expression of the form: {  x 1, x 2, …, x n  | P (x 1, x 2, …, x n )} x 1, x 2, …, x n represent domain variables P represents a formula similar to that of the predicate calculus

Example queries Find the branch-name, loan-number, and amount for loans of over $1200 {  l, b, a  |  l, b, a   loan  a > 1200} Find the names of all customers who have a loan of over $1200 {  c  |  l, b, a (  c, l   borrower   l, b, a   loan  a > 1200)} Find the names of all customers who have a loan from the Perryridge branch and the loan amount: {  c, a  |  l (  c, l   borrower   b(  l, b, a   loan  b = “Perryridge”))} or {  c, a  |  l (  c, l   borrower   l, “Perryridge”, a   loan)}

Example Queries Find the names of all customers having a loan, an account, or both at the Perryridge branch: {  c  |  l ({  c, l   borrower   b,a (  l, b, a   loan  b = “Perryridge”))   a (  c, a   depositor   b,n (  a, b, n   account  b = “Perryridge”))} Find the names of all customers who have an account at all branches located in Brooklyn: {  c  |  n (  c, s, n   customer)   x,y,z (  x, y, z   branch  y = “Brooklyn”)   a,b (  x, y, z   account   c,a   depositor)}

Reminder: our Mini-U db CLASS c-idc-nameunits cis331d.b.2 cis321o.s.2 TAKES SSNc-idgrade 123cis331A 234cis331B

Examples find all student records RTC:

Examples (selection) find student record with ssn=123

Examples (selection) find student record with ssn=123 RTC: or

Examples (projection) find name of student with ssn=123

Examples (projection) find name of student with ssn=123 need to ‘restrict’ “a” RTC:

Examples (projection) find name of student with ssn=123 - RTC:

Examples cont’d (union) get records of both PT and FT students RTC:

Examples cont’d (union) get records of both PT and FT students

Examples difference: find students that are not staff RTC:

Examples difference: find students that are not staff

Cartesian product eg., dog-breeding: MALE x FEMALE gives all possible couples x =

Cartesian product find all the pairs of (male, female) - RTC:

Cartesian product find all the pairs of (male, female) - RDC:

‘Proof’ of equivalence rel. algebra rel. domain calculus rel. tuple calculus

Overview - detailed rel. domain calculus why? details examples equivalence with rel. algebra more examples; ‘safety’ of expressions

More examples join: find names of students taking cis351

Reminder: our Mini-U db

More examples join: find names of students taking cis351 - in RTC

More examples join: find names of students taking cis351 - in RDC

Sneak preview of QBE: TAKES SSNc-idgrade _xcis351

Sneak preview of QBE: TAKES SSNc-idgrade _xcis351 very user friendly heavily based on RDC very similar to MS Access interface

More examples 3-way join: find names of students taking a 2-unit course - in RTC: selection projection join

Reminder: our Mini-U db CLASS c-idc-nameunits cis331d.b.2 cis321o.s.2 grade TAKES SSNc-id 123cis331A 234cis331B _x.P _x_y 2

More examples 3-way join: find names of students taking a 2-unit course

More examples 3-way join: find names of students taking a 2-unit course

Even more examples: self -joins: find Tom’s grandparent(s)

Even more examples: self -joins: find Tom’s grandparent(s)

Even more examples: self -joins: find Tom’s grandparent(s)

Even more examples: self -joins: find Tom’s grandparent(s)

Hard examples: DIVISION find suppliers that shipped all the ABOMB parts

Hard examples: DIVISION find suppliers that shipped all the ABOMB parts

Hard examples: DIVISION find suppliers that shipped all the ABOMB parts

More on division find students that take all the courses that ssn=123 does (and maybe even more)

More on division find students that take all the courses that ssn=123 does (and maybe even more)

Safety of expressions similar to RTC FORBIDDEN:

Safety of Expressions {  x 1, x 2, …, x n  | P(x 1, x 2, …, x n )} is safe if all of the following hold: 1. All values that appear in tuples of the expression are values from dom a(P) (that is, the values appear either in P or in a tuple of a relation mentioned in P ) 2.For every “there exists” subformula of the form  x (P 1 (x)), the subformula is true if and only if there is a value x in dom (P 1 ) such that P 1 (x) is true. 3. For every “for all” subformula of the form  x (P 1 (x)), the subformula is true if and only if P 1 (x) is true for all values x from dom (P 1 ).

Overview - detailed rel. domain calculus + QBE dfn details equivalence to rel. algebra

table