Download presentation
Presentation is loading. Please wait.
Published byNigel McKinney Modified over 8 years ago
1
C HAPTERS 3-6 R ELATIONAL D ATA M ODELS, R ELATIONAL C ONSTRAINTS, AND R ELATIONAL A LGEBRA Chapters 5-8 1 Flat file: A two dimensional array of attributes or data items ProductX 1 Bellaire 5 ProductY 2 Sugarland 5 ProductZ 3 Houston 5 Computerization 10 Stafford 4 Reorganization 20 Houston 1 Newbenefits 30 Stafford 4 Database Management Systems (DBMS): A generalized software system that is used to create, manage, and protect data bases
2
Chapters 5-8 2
3
3 Attribute: A name characteristic or property of an entity = column header Entity: A “thing” in the real world with an independent existence physical existence: person, student, car
4
Domain - The valid set of atomic value for an attribute in a relation e.g. SSN set of 9 digits GPA: 0<= GPA <= 4.0 Atomic - each value in the domain is indivisible Name (Fname, Minit, Lname) – not atomic Fname -- atomic Minit -- atomic Lname -- atomic 4 Chapters 5-8
5
R ELATIONAL M ODEL C ONCEPTS A Relation is a mathematical concept based on the ideas of sets The model was first proposed by Dr. E.F. Codd of IBM Research in 1970 in the following paper: "A Relational Model for Large Shared Data Banks," Communications of the ACM, June 1970 The above paper caused a major revolution in the field of database management and earned Dr. Codd the coveted ACM Turing Award 5 Chapters 5-8
6
I NFORMAL D EFINITIONS Informally, a relation looks like a table of values. A relation typically contains a set of rows. The data elements in each row represent certain facts that correspond to a real-world entity or relationship In the formal model, rows are called tuples Each column has a column header that gives an indication of the meaning of the data items in that column In the formal model, the column header is called an attribute name (or just attribute ) 6 Chapters 5-8
7
F ORMAL D EFINITIONS - S CHEMA The Schema (or description) of a Relation: Denoted by R(A1, A2,.....An) R is the name of the relation The attributes of the relation are A1, A2,..., An Example: CUSTOMER (Cust-id, Cust-name, Address, Phone#) CUSTOMER is the relation name Defined over the four attributes: Cust-id, Cust- name, Address, Phone# Each attribute has a domain or a set of valid values. For example, the domain of Cust-id is 6 digit numbers. 7 Chapters 5-8
8
F ORMAL D EFINITIONS - T UPLE A tuple is an ordered set of values (enclosed in angled brackets ‘ ’) Each value is derived from an appropriate domain. A row in the CUSTOMER relation is a 4- tuple and would consist of four values, for example: This is called a 4-tuple as it has 4 values A tuple (row) in the CUSTOMER relation. A relation is a set of such tuples (rows) 8 Chapters 5-8
9
F ORMAL D EFINITIONS - D OMAIN A domain has a logical definition: Example: “USA_phone_numbers” are the set of 10 digit phone numbers valid in the U.S. A domain also has a data-type or a format defined for it. The USA_phone_numbers may have a format: (ddd)ddd-dddd where each d is a decimal digit. Dates have various formats such as year, month, date formatted as yyyy-mm-dd, or as dd mm,yyyy etc. The attribute name designates the role played by a domain in a relation: Used to interpret the meaning of the data elements corresponding to that attribute Example: The domain Date may be used to define two attributes named “Invoice-date” and “Payment-date” with different meanings 9 Chapters 5-8
10
F ORMAL D EFINITIONS - S TATE The relation state is a subset of the Cartesian product of the domains of its attributes each domain contains the set of all possible values the attribute can take. Example: attribute Cust-name is defined over the domain of character strings of maximum length 25 dom(Cust-name) is varchar(25) The role these strings play in the CUSTOMER relation is that of the name of a customer. 10 Chapters 5-8
11
F ORMAL D EFINITIONS - S UMMARY Formally, Given R(A1, A2,.........., An) r(R) dom (A1) X dom (A2) X....X dom(An) R(A1, A2, …, An) is the schema of the relation R is the name of the relation A1, A2, …, An are the attributes of the relation r(R): a specific state (or "value" or “population”) of relation R – this is a set of tuples (rows) r(R) = {t1, t2, …, tn} where each ti is an n-tuple ti = where each vj element-of dom(Aj) 11 Chapters 5-8
12
F ORMAL D EFINITIONS - E XAMPLE Let R(A1, A2) be a relation schema: Let dom(A1) = {0,1} Let dom(A2) = {a,b,c} Then: dom(A1) X dom(A2) is all possible combinations: {,,,,, } The relation state r(R) dom(A1) X dom(A2) For example: r(R) could be {,, } this is one possible state (or “population” or “extension”) r of the relation R, defined over A1 and A2. It has three 2-tuples:,, 12 Chapters 5-8
13
D EFINITION S UMMARY 13 Chapters 5-8 Informal TermsFormal Terms TableRelation Column HeaderAttribute All possible Column Values Domain RowTuple Table DefinitionSchema of a Relation Populated TableState of the Relation
14
S UPER KEY : AN ATTRIBUTE OR A SET OF ATTRIBUTES THAT IDENTIFIES AN ENTITY UNIQUELY ( MAY NOT BE MINIMAL SET ) SSN SSN, NAME SSN, NAME, MAJOR 14 Chapters 5-8
15
C ANDIDATE KEY : A SUPER KEY SUCH THAT NO PROPER SUBSET OF ITS ATTRIBUTES IS ITSELF A SUPER KEY. S O CANDIDATE KEYS MUST HAVE A MINIMAL IDENTIFIER. STUID SSN P RIMARY KEY : THE CANDIDATE KEY THAT IS CHOSEN OR THE CANDIDATE KEY THAT IS USED TO IDENTIFY TUPLES IN A RELATION -- UNIQUE, MUST EXIST A LTERNATE KEY : A CANDIDATE KEY IN A RELATION THAT IS NOT SELECTED E. G. IF PRIMARY KEY IS SSN THEN STUID IS A ALTERNATE KEY 15 Chapters 5-8
16
16 C ONCATENATED ( COMPOSITE ) KEY : A PRIMARY KEY THAT IS COMPRISED OF TWO OR MORE ATTRIBUTES OR DATA ITEMS G RADE_REPORT(STUID, COURSE#, GRADE)
17
F OREIGN KEY : A NON - KEY ATTRIBUTE IN ONE RELATION THAT APPEARS AS THE PRIMARY KEY ( OR PART OF THE KEY ) IN ANOTHER RELATION EMPLOYEE(SSN, FNAME, MINIT, DNO) DEPARTMENT(DNUMBER, DNAME, MANAGER) 17 Chapters 5-8
18
S ECONDARY KEY : A FIELD THAT CAN HAVE DUPLICATE VALUES, AND THAT CAN BE USED AS SEARCH PATH BY THE USERS 18 Chapters 5-8
19
19
20
Chapters 5-8 20 Referential Integrity Constraints for COMPANY database
21
Chapters 5-8 21
22
R ELATIONAL A LGEBRA O VERVIEW Relational algebra is the basic set of operations for the relational model These operations enable a user to specify basic retrieval requests (or queries ) The result of an operation is a new relation, which may have been formed from one or more input relations This property makes the algebra “closed” (all objects in relational algebra are relations) 22 Chapters 5-8
23
R ELATIONAL A LGEBRA O VERVIEW ( CONTINUED ) The algebra operations thus produce new relations These can be further manipulated using operations of the same algebra A sequence of relational algebra operations forms a relational algebra expression The result of a relational algebra expression is also a relation that represents the result of a database query (or retrieval request) 23 Chapters 5-8
24
R ELATIONAL A LGEBRA O VERVIEW Relational Algebra consists of several groups of operations Unary Relational Operations SELECT (symbol: (sigma)) PROJECT (symbol: (pi)) Relational Algebra Operations From Set Theory UNION ( ), INTERSECTION ( ), DIFFERENCE (or MINUS, – ) CARTESIAN PRODUCT ( x ) Binary Relational Operations JOIN (several variations of JOIN exist) DIVISION Additional Relational Operations OUTER JOINS, OUTER UNION AGGREGATE FUNCTIONS (These compute summary of information: for example, SUM, COUNT, AVG, MIN, MAX) 24 Chapters 5-8
25
Unary Relational Operations: SELECT The SELECT operation (denoted by (sigma)) is used to select a subset of the tuples from a relation based on a selection condition. The selection condition acts as a filter Keeps only those tuples that satisfy the qualifying condition Tuples satisfying the condition are selected whereas the other tuples are discarded ( filtered out ) Examples: Select the EMPLOYEE tuples whose department number is 4: DNO = 4 (EMPLOYEE) Select the employee tuples whose salary is greater than $30,000: SALARY > 30,000 (EMPLOYEE) 25 Chapters 5-8
26
U NARY R ELATIONAL O PERATIONS : SELECT In general, the select operation is denoted by (R) where the symbol (sigma) is used to denote the select operator the selection condition is a Boolean (conditional) expression specified on the attributes of relation R tuples that make the condition true are selected appear in the result of the operation tuples that make the condition false are filtered out discarded from the result of the operation 26 Chapters 5-8
27
U NARY R ELATIONAL O PERATIONS : SELECT ( CONTD.) SELECT Operation Properties The SELECT operation (R) produces a relation S that has the same schema (same attributes) as R SELECT is commutative: ( (R)) = ( (R)) Because of commutativity property, a cascade (sequence) of SELECT operations may be applied in any order: ( ( (R)) = ( ( ( R))) A cascade of SELECT operations may be replaced by a single selection with a conjunction of all the conditions: ( ( (R)) = AND AND (R))) The number of tuples in the result of a SELECT is less than (or equal to) the number of tuples in the input relation R 27 Chapters 5-8
28
Select Works on single table and takes rows that meet a specified condition, copy them into a new table (Table name) Condition(s) SQL (Structured Query language) SELECT * FROM (table name) WHERE condition 1 AND condition 2 AND condition 3… 28 Chapters 5-8
29
29 Chapters 5-8 Table Condition(s)
30
Find employees who work for department number 5. employee DNO = 5 SQL: SELECT * FROM employee WHERE dno = 5; 30 Chapters 5-8
31
31
32
Chapters 5-8 32 Employee DNO=5 Query tree
33
33 Chapters 5-8 s ( DNO =4 AND SALARY >25000) OR ( DNO =5 AND SALARY >30000) ( EMPLOYEE ) --------------------------------------------------------- s ( s (R)) = s ( s (R)) s ( s (...( s (R))...)) = s AND AND... AND (R)
34
Project Operates on a single table, produces a vertical subset of the table, extract the values of specified columns eliminate duplicate rows place the value in a new table (table name) column1, column2, column3, … 34 Chapters 5-8
35
SQL: SELECT column1, column2, column3, … FROM (table name) 35 Chapters 5-8
36
36 Table column(s)
37
E.g. Show the names of all employees employee fname, minit, lname SELECT fname, minit, lname FROM employee; 37 Chapters 5-8
38
38
39
Chapters 5-8 39 Employee fname,minit,lname
40
Select & project Show the names of all employees who work for department number 5 ( employee) fname, minit, lname dno = 5 SELECT fname, minit, lname FROM employee WHERE dno = 5; 40 Chapters 5-8
41
41
42
Chapters 5-8 42 Employee fname,minit,lname DNO = 5
43
E XAMPLES OF APPLYING SELECT AND PROJECT OPERATIONS 43 Chapters 5-8
44
PRODUCT (or Cartesian product) R1 x R2 R1 X R2 is a table where width is the width of R1 plus the width of R2 and whose columns are the columns of R1 followed by the columns of R2 If R1 has X rows and M columns R2 has Y rows and N columns R1 X R2 = X * Y rows and M + N columns 44 Chapters 5-8
45
45 Cartesian Product
46
Q UERY T REE FOR C ARTESIAN P RODUCT 46 Chapters 5-8 Table1Table2 X
47
47 Chapters 5-8 Example of Query Tree
48
Theta Join The result of performing a SELECT operation using a comparison operator theta (=,, ) on the product 48 Chapters 5-8
49
49 Theta Join (>)
50
Chapters 5-8 50 Theta Join (ID>STUID)
51
Q UERY T REE FOR T HETA J OIN 51 Chapters 5-8 Student Credit_Hours X ID > STUID
52
Equijoin Product with “theta” is equality 52 Chapters 5-8
53
53 Equijoin
54
Chapters 5-8 54 Equijoin
55
Q UERY T REE FOR E QUIJOIN 55 Chapters 5-8 Student Credit_Hours X ID = STUID
56
Natural Join |X| Is an equijoin which the repeated column is eliminated Usually join performs over column with the same names 56 Chapters 5-8
57
57 Remove Equi-join
58
Chapters 5-8 58 Remove this column
59
Chapters 5-8 59
60
Q UERY T REE FOR N ATURAL J OIN 60 Chapters 5-8 Student Credit_Hours |X|
61
Semi-join: If R1 and R2 are tables Semijoin of R1 and R2 is natural join of R1 and R2 and then projecting the result into the attributes of A Semijoin is not cumulative 61 Chapters 5-8
62
Create tables create table student1 (id char(3) primary key, fname char(10), lname char(10)); insert into student1 values(‘101’,’Jim’,’Smith’); insert into student1 values(‘102’,’Tim’,’Brown’); insert into student1 values(‘103’,’Babara’,’Houston’); ----------------- ---- create table credit_hours (stuid char(3) primary key, hours number(3)); insert into credit_hours values(101,60); insert into credit_hours values(102,85); 62 Chapters 5-8
63
63 Left Semi-Join
64
Chapters 5-8 64 Right Semi-Join
65
Outer Join: Is an extension of a THETA JOIN, an EQUIJOIN, or a NATURAL JOIN An outer join consists of all rows that appear in the usual theta join, plus an additional row for each of the tuples from the original tables that do not participate in the theta join. In those rows that are unmatched original tuples, extend it by assigning null values to the other attributes. 65 Chapters 5-8
66
Left outer join unmatched rows from the first (left) table appear in the resulting table Right outer join unmatched rows from the second (right) table appear in the resulting table 66 Chapters 5-8
67
67 Left Outer JoinRight Outer Join
68
Outer Join -- Oracle Left-outer join select * from student, credit_hours where id = stuid(+); SELECT E.FNAME, E.LNAME, dependent_name FROM EMPLOYEE E, DEPENDENT D WHERE E.SSN = D.ESSN(+); 68 Chapters 5-8
69
R IGHT - OUTER JOIN select * from student, credit_hours where id(+) = stuid; 69 Chapters 5-8
70
Sample SQL create view: create view v_emp_dno as select fname, lname, dno from employee; select * from v_emp_dno; create view v_department as select dnumber, dname from department; select * from v_department; Cartesian product: select * from v_emp_dno, v_department; Natural join: select * from v_emp_dno, v_department where dno = dnumber; Left Outer join select fname, lname, ssn, essn, dependent_name from employee, dependent where ssn = essn (+); Right Outer join select essn, dependent_name, fname, lname, ssn from employee, dependent where essn (+) = ssn; 70 Chapters 5-8
71
Set operations: Union, Difference, Intersection, Division Union (U) tables must be compatible - they must have same basic structure, both relations must have the same domains. The union of two relations is the set of tuples in either or both relations 71 Chapters 5-8
72
E XAMPLE TO ILLUSTRATE THE RESULT OF UNION, INTERSECT, AND DIFFERENCE 72 Chapters 5-8
73
SQL--Union Select ssn from employee where dno = 5 Union select distinct(essn) from dependent; 73 Chapters 5-8 SSN --------- 123456789 333445555 666884444 453453453 4 rows selected ESSN --------- 123456789 333445555 987654321 3 rows selected SSN --------- 123456789 333445555 453453453 666884444 987654321 5 rows selected U =
74
Difference (-) The difference between two relations is the set of tuples that belong to the first relation but not in the second relation. 74 Chapters 5-8
75
SQL--Minus Select ssn from employee minus select distinct(essn) from dependent; 75 Chapters 5-8 SSN --------- 123456789 333445555 999887777 987654321 666884444 453453453 987987987 888665555 8 rows selected ESSN --------- 123456789 333445555 987654321 3 rows selected SSN --------- 453453453 666884444 888665555 987987987 999887777 5 rows selected - =
76
Intersection ( ) The intersection of two relations is the set of tuples that belong to both relations simultaneously. 76 Chapters 5-8
77
77 Intersection
78
Division ( ) A binary operation that can be defined on two relations where the entire structure of one (the divisor) is a portion of the structure of the other (the dividen) 78 Chapters 5-8
79
79 Division
80
E XAMPLE OF DIVISION 80 Chapters 5-8
81
A GGREGATE F UNCTIONS AND G ROUPING Script F: (group attributes) (R) Functions = sum, average, maximum, minimum, count 81 Chapters 5-8
82
A LL E MPLOYEES (N O G ROUP B Y ) SELECT sum(salary), Max (salary), min(salary), avg(salary) FROM employee; SUM(SALARY) MAX(SALARY) MIN(SALARY) AVG(SALARY) ----------- ----------- ----------- ---------------------- ----------- ----------- ----------- 281000 55000 25000 35125 82 Chapters 5-8
83
E XAMPLE : R ETRIEVE THE DEPARTMENT NUMBER, NUMBER OF EMPLOYEES, AND AVERAGE SALARY IN THE DEPARTMENT – G ROUP B Y DNO RESULT(DNO, NUMBER_OF_EMPLOYEES, AVG_SAL) count SSN, Average SALARY EMPLOYEE SELECT dno, count(ssn), avg(salary) FROM employee GROUP BY dno order by dno; DNO COUNT(SSN) AVG(SALARY) -------------------------------------- ---------- ----------- 1 1 55000 4 3 31000 5 4 33250 83 Chapters 5-8
84
G ROUP B Y SELECT dno, sum(salary), Max (salary), min(salary), avg(salary) FROM employee GROUP BY dno; DNO SUM(SALARY) MAX(SALARY) MIN(SALARY) AVG(SALARY) -------------------------------------- ----------- 1 55000 55000 55000 55000 5 133000 40000 25000 33250 4 93000 43000 25000 31000 84 Chapters 5-8
85
DNO count SSN, Average SALARY (EMPLOYEE) 85 Chapters 5-8
86
I F GROUPING ATTRIBUTES ARE NOT SPECIFIED count SSN, Average SALARY (EMPLOYEE) 86 Chapters 5-8
87
SELECT sum(salary), Max (salary), min(salary), avg(salary) FROM employee, department WHERE dno = dnumber AND dname = 'Research'; SUM(SALARY) MAX(SALARY) MIN(SALARY) AVG(SALARY) ----------- ----------- 133000 40000 25000 33250 87 Chapters 5-8
88
View Create View V_Dno5 as (select fname, lname, dno from employee where dno = 5) -------------- view V_DNO5 created. ------------- Select * from V_DNO5; FNAME LNAME DNO --------------- --------------- ----------------- John Smith 5 Franklin Wong 5 Ramesh Narayan 5 Joyce English 5 88 Chapters 5-8
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.