Download presentation
Presentation is loading. Please wait.
Published byJemima Simon Modified over 9 years ago
1
THE RELATIONAL MODEL II IST 210: Organization of Data IST210 1
2
How to design a well-formed model? Revisit assignment 1 Break list into tables How to split? Functional Dependency important criteria Normalization Process process to design a well-formed model IST210 2
3
Functional Dependency A relationship between attributes: some attribute(s) determine the value of other attribute(s) in the same table These attributes we name them determinant The other attributes are functionally dependent on this determinant IST210 3 CookieCost = NumberOfBoxes * $5 NumberOfBoxes CookieCost CookieCost = NumberOfBoxes * UnitPrice (UnitPrice, NumberOfBoxes) CookieCost determinant The composite forms determinant Functionally dependent on NumberofBoxes
4
Determinant in Relations IST210 4 StudentI D FirstNameLastNameDOB 9123450JohnSmithJan. 1, 1989 9123451JohnAdamJun. 1, 1988 9123452JaneAdamAug. 1,1989 9123453JoshCohenAug. 1,1989 ClubIDClubNamePresident 12Football9123450 13Medical9123453 15Dance9123452 Determinant: StudentID Determinant: ClubID Determinant: CourseID Determinant = Candidate key?
5
Review: Candidate Key A candidate key is a special key Any subset of a candidate key is NOT a key (StudentID, FirstName) is not a candidate key, because StudentID is a key IST210 5
6
Determinant = Candidate Key? Student ID Student Name Student's DepartmentEmailCourseIDInstructorCourseNameLocation 1BobIST bbb@psu.edu 210JessieOrganization of Data110IST 5LisaIST lll@psu.edu 210JessieOrganization of Data110IST 2SarahIST sss@psu.edu 210JessieOrganization of Data110IST 3JimCSE jjj@psu.edu 210JessieOrganization of Data110IST 1BobIST bbb@psu.edu 220JohnNetwork208IST 3JimCSE jjj@psu.edu 220JohnNetwork208IST 10KateIST kkk@psu.edu 230DavidComputer Language206IST IST210 6 DeterminantsCandidate Key (StudentID, CourseID) StudentID CourseID (StudentID, CourseID) In a relation, a candidate key must be a determinant In a relation, a determinant may not be a candidate key If every determinant is a candidate key, it is a well-formed relation.
7
Normalization Normalization is a process of analyzing a relation to ensure that it is well formed No data redundancy among the nonkey attributes Data can be inserted, deleted, or modified without creating update anomalies IST210 7
8
Normalization Principles Relational design principle for normalized relations: To be a well-formed relation, every determinant must be a candidate key Any relation that is not well formed should be broken into two or more well-formed relations IST210 8
9
Normalization Process 1. Identify all the candidate keys of the relation. 2. Identify all the functional dependencies in the relation. 3. Examine the determinants of the functional dependencies. If any determinant is not a candidate key, the relation is not well formed. In this case: a. Place the columns of the functional dependency in a new relation of their own. b. Make the determinant of the functional dependency the primary key of the new relation. c. Leave a copy of the determinant as a foreign key in the original relation. d. Create a referential integrity constraint between the original relation and the new relation. 4. Repeat step 3 as many times as necessary until every determinant of every relation is a candidate key. IST210 9
10
Normalization Process: Example 1: Step 1 IST210 10 Step 1. Identify all the candidate keys of the relation. PrescriptionNumber
11
Normalization Process: Example 1: Step 2 IST210 11 Step 2. Identify all the functional dependencies in the relation. Drug Dosage? No CustomerEmail (CustomerName, CustomerPhone)? Yes, in this example PrescriptionNumber (Date, Drug, Dosage, CustomerName, CustomerPhone, CustomerEmail) This is a trivial dependency by the definition of candidate key
12
Normalization Process: Example 1: Step 3 IST210 12 Step 3. If any determinant is not a candidate key, the relation is not well formed. Determinant: PrescriptionNumber, CustomerEmail Candidate key: PrescriptionNumber CustomerEmail is NOT a candidate key, so the relation NOT well formed Then, we will normalize it (break it into multiple relations)
13
Normalization Process: Example 1: Step 3 IST210 13 Step 3. Examine the determinants of the functional dependencies. If any determinant is not a candidate key, the relation is not well formed. In this case: a.Place the columns of the functional dependency in a new relation of their own. CUSTOMER (CustomerEmail, CustomerName, CustomerPhone) b.Make the determinant of the functional dependency the primary key of the new relation. CUSTOMER (CustomerEmail, CustomerName, CustomerPhone) c.Leave a copy of the determinant as a foreign key in the original relation. PRESCRIPTION(PrescriptionNumber, Date, Drug, Dosage, CustomerEmail) d.Create a referential integrity constraint between the original relation and the new relation. CustomerEmail in PRESCRIPTION must exist in CustomerEmail in CUSTOMER
14
Normalization Process: Example 1: Step 4 IST210 14 Step 4. Repeat step 3 as many times as necessary until every determinant of every relation is a candidate key. CUSTOMER (CustomerEmail, CustomerName, CustomerPhone) PRESCRIPTION(PrescriptionNumber, Date, Drug, Dosage, CustomerEmail) CustomerEmail in PRESCRIPTION must exist in CustomerEmail in CUSTOMER Well-formed relational model design
15
Normalization Process: Example 2 IST210 15 Step 1. Candidate keys Step 2. Functional dependencies Step 3. Split the relation
16
Normalization Process: Example 2: Step 1 IST210 16 Step 1. Identify all the candidate keys of the relation. StudentNumber
17
Normalization Process: Example 2: Step 2 IST210 17 Step 2. Identify all the functional dependencies in the relation. DormName DormCost Trivial dependency: StudentNumber (LastName, FirstName, DormName, DormCost)
18
Normalization Process: Example 2: Step 3 IST210 18 Step 3. If any determinant is not a candidate key, the relation is not well formed. StudentNumber StudentNumber (LastName, FirstName, DormName, DormCost) DormName DormCost
19
Normalization Process: Example 2: Step 3 IST210 19 Step 3. Examine the determinants of the functional dependencies. If any determinant is not a candidate key, the relation is not well formed. In this case: a.Place the columns of the functional dependency in a new relation of their own. DORM(DormName, DormCost) b.Make the determinant of the functional dependency the primary key of the new relation. DORM(DormName, DormCost) c.Leave a copy of the determinant as a foreign key in the original relation. STU_DORM(StudentNumber, LastName, FirstName, DormName) d.Create a referential integrity constraint between the original relation and the new relation. DormName in STU_DORM must exist in DormName in DORM
20
Normalization Process: Example 2: Step 4 IST210 20 Step 4. Repeat step 3 as many times as necessary until every determinant of every relation is a candidate key. STU_DORM(StudentNumber, LastName, FirstName, DormName) DORM(DormName, DormCost) DormName in STU_DORM must exist in DormName in DORM Well-formed relational model design
21
Normalization Process: Example 3 IST210 21 Step 1. Candidate keys Step 2. Functional dependencies Step 3. Split the relation
22
Normalization Process: Example 3: Step 1 IST210 22 Step 1. Identify all the candidate keys of the relation. (Attorney, ClientNumber, MeetingDate)
23
Normalization Process: Example 3: Step 2 IST210 23 Step 2. Identify all the functional dependencies in the relation. ClientNumber ClientName Trivial dependency: (Attorney, ClientNumber, MeetingDate) (ClientName, Duration)
24
Normalization Process: Example 3: Step 3 IST210 24 Step 3. If any determinant is not a candidate key, the relation is not well formed. (Attorney, ClientNumber, MeetingDate) (Attorney, ClientNumber, MeetingDate) (ClientName, Duration) ClientNumber ClientName
25
Normalization Process: Example 3: Step 3 IST210 25 Step 3. Examine the determinants of the functional dependencies. If any determinant is not a candidate key, the relation is not well formed. In this case: a.Place the columns of the functional dependency in a new relation of their own. CLIENT(ClientNumber, ClientName) b.Make the determinant of the functional dependency the primary key of the new relation. CLIENT(ClientNumber, ClientName) c.Leave a copy of the determinant as a foreign key in the original relation. MEETING(Attorney, ClientNumber, MeetingDate, Duration) ClientNumber: A foreign key and also part of primary key d.Create a referential integrity constraint between the original relation and the new relation. ClientNumber in MEETING must exist in ClientNumber in CLIENT
26
Normalization Process: Example 3: Step 4 IST210 26 Step 4. Repeat step 3 as many times as necessary until every determinant of every relation is a candidate key. CLIENT(ClientNumber, ClientName) MEETING(Attorney, ClientNumber, MeetingDate, Duration) ClientNumber in MEETING must exist in ClientNumber in CLIENT Well-formed relational model design
27
Normalization Process: Example 4 IST210 27 Student ID Student Name Student's DepartmentEmailCourseIDInstructorCourseNameLocation 1BobIST bbb@psu.edu 210JessieOrganization of Data110IST 5LisaIST lll@psu.edu 210JessieOrganization of Data110IST 2SarahIST sss@psu.edu 210JessieOrganization of Data110IST 3JimCSE jjj@psu.edu 210JessieOrganization of Data110IST 1BobIST bbb@psu.edu 220JohnNetwork208IST 3JimCSE jjj@psu.edu 220JohnNetwork208IST 10KateIST kkk@psu.edu 230DavidComputer Language206IST Step 1. Candidate keys Step 2. Functional dependencies Step 3. Split the relation
28
Normalization Process: Example 4 IST210 28 Student ID Student Name Student's DepartmentEmailCourseIDInstructorCourseNameLocation 1BobIST bbb@psu.edu 210JessieOrganization of Data110IST 5LisaIST lll@psu.edu 210JessieOrganization of Data110IST 2SarahIST sss@psu.edu 210JessieOrganization of Data110IST 3JimCSE jjj@psu.edu 210JessieOrganization of Data110IST 1BobIST bbb@psu.edu 220JohnNetwork208IST 3JimCSE jjj@psu.edu 220JohnNetwork208IST 10KateIST kkk@psu.edu 230DavidComputer Language206IST Step 1. Candidate keys (StudentID, CourseID)(StudentID, CourseID) all other columns StudentID (StudentName, Student’s Department, Email) CourseID (Instructor, CourseName, Location) STUDENT(StudentID, StudentName, Student’sDepartment, Email) COURSE(CourseID, Instructor, CourseName, Location) REGISTRATION(StudentID, CourseID) StudentID in REGISTRATION must exist in StudentID in STUDENT COURSEID in REGISTRATION must exist in CourseID in COURSE Step 2. Functional dependencies Step 3. Split the relation
29
Normalization Process: Example 5 IST210 29 GRADE(ClassName, Section, Term, Grade, StudentNumber, StudentName, Professor, Department, ProfessorEmail)
30
Normalization Process: Example 5: Step 1 IST210 30 Step 1. Identify all the candidate keys of the relation. (ClassName, Section, Term, StudentNumber) GRADE(ClassName, Section, Term, Grade, StudentNumber, StudentName, Professor, Department, ProfessorEmail)
31
Normalization Process: Example 5: Step 2 IST210 31 Step 2. Identify all the functional dependencies in the relation. StudentNumber StudentName Professor (Department, ProfessorEmail) (Classname, Section, Term) Professor GRADE(ClassName, Section, Term, Grade, StudentNumber, StudentName, Professor, Department, ProfessorEmail) Trivial dependency: (ClassName, Section, Term, StudentNumber) (Grade, StudentName, Professor, Department, ProfessorEmail)
32
Normalization Process: Example 5: Step 3 IST210 32 GRADE(ClassName, Section, Term, Grade, StudentNumber, StudentName, Professor, Department, ProfessorEmail) Step 3. If any determinant is not a candidate key, the relation is not well formed. (ClassName, Section, Term, StudentNumber) (ClassName, Section, Term, StudentNumber) all other columns StudentNumber StudentName Professor (Department, ProfessorEmail) (Classname, Section, Term) Professor
33
Normalization Process: Example 5: Step 3 IST210 33 Step 3. Examine the determinants of the functional dependencies. If any determinant is not a candidate key, the relation is not well formed. In this case: STUDENT(StudentNumber, StudentName) PROFESSOR(Professor, Department, ProfessorEmail) CLASS_PROFESSOR(ClassName, Section, Term, Professor) GRADE_1(ClassName, Section, Term, Grade, StudentNumber) StudentNumber in GRADE_1 must exist in StudentNumber in STUDENT Proessor in CLASS_PROFESSOR must exist in Professor in PROFESSOR (ClassName, Section, Term) in GRADE_1 must exist in (ClassName, Section, Term) of CLASS_PROFESSOR StudentNumber StudentName Professor (Department, ProfessorEmail) (Classname, Section, Term) Professor (ClassName, Section, Term, StudentNumber) all other columns
34
Normal Form The normal forms (abbrev. NF) of relational database theory provide criteria for determining a table's degree of vulnerability to logical inconsistencies and anomalies. In textbook, briefly mentioned in P. 261 This is an important theory in database http://en.wikipedia.org/wiki/Database_normalization#Normal_forms IST210 34
35
IST210 35 Normal formBrief definition First normal formFirst normal form (1NF) Table faithfully represents a relation and has no repeating groupsrelation Second normal formSecond normal form (2NF) No non-prime attribute in the table is functionally dependent on a proper subset of any candidate keyfunctionally dependentproper subsetcandidate key Third normal formThird normal form (3NF) Every non-prime attribute is non-transitively dependent on every candidate key in the table. The attributes that do not contribute to the description of the primary key are removed from the table. In other words, no transitivity dependency is allowed.candidate key Elementary Key Normal Form Elementary Key Normal Form (EKNF) Every non-trivial functional dependency in the table is either the dependency of an elementary key attribute or a dependency on a superkey Boyce–Codd normal form Boyce–Codd normal form (BCNF) Every non-trivial functional dependency in the table is a dependency on a superkeysuperkey Fourth normal formFourth normal form (4NF) Every non-trivial multivalued dependency in the table is a dependency on a superkeymultivalued dependency Fifth normal formFifth normal form (5NF) Every non-trivial join dependency in the table is implied by the superkeys of the tablejoin dependency Domain/key normal form Domain/key normal form (DKNF) Every constraint on the table is a logical consequence of the table's domain constraints and key constraintslogical consequence Sixth normal formSixth normal form (6NF) Table features no non-trivial join dependencies at all (with reference to generalized join operator)
36
First Normal Form First normal form (1NF) is a property of a relation in a relational database. A relation is in first normal form if the domain of each attribute contains only atomic values, and the value of each attribute contains only a single value from that domain IST210 36 Fail to comply 1NF Comply 1NF
37
Second Normal Form A table is in 2NF if and only if it is in 1NF and no non prime attribute is dependent on any proper subset of any candidate key of the table. IST210 37 Fail to comply 2NF Comply 2NF
38
Second Normal Form (cont.) A table is in 2NF if and only if it is in 1NF and no non prime attribute is dependent on any proper subset of any candidate key of the table. IST210 38 Comply 2NF?
39
Third Normal Form A table is in 3NF if and only if both of the following conditions hold: The relation R (table) is in second normal form (2NF) Every non-prime attribute of R is non-transitively dependent (i.e. directly dependent) on every super key of R. IST210 39 Fail to comply 3NF (Tournament, Year) Winner Winner Winner DOB Comply 3NF
40
Review Question A relation is well-formed if A. Every determinant is a candidate key. B. Every candidate key is a determinant. IST210 40
41
Summary of Chapter 2 Learn the concept of the relational model Learn the meaning and importance of keys, candidate keys, primary keys, foreign keys, and related terminology Learn the meaning of functional dependencies Learn to apply a process for normalizing relations IST210 41
42
Reminder Homework 2 is out – due Sept 14 th at 11:59PM No Class on Labor day IST210 42
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.