Spatial Databases: Building Spatial DB

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Relational Database and Data Modeling
1 Review Visual System Modeling Tools Todd Bacastow Penn State University Geospatial System Analysis & Design.
Introduction to UML Todd Bacastow Penn State University Geography 583 Geospatial System Analysis & Design.
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Ch 12: Object-Oriented Analysis
ETEC 100 Information Technology
Database Design Conceptual –identify important entities and relationships –determine attribute domains and candidate keys –draw the E-R diagram Logical.
Geographic Information Systems
Physical design. Stage 6 - Physical Design Retrieve the target physical environment Create physical data design Create function component implementation.
Training of master Trainers Workshop e-Services Design and Delivery
UML Overview. UML Diagrams to be Covered Class Diagrams Use Case Diagrams Collaboration Diagrams Sequence Diagrams Package Diagrams Component Diagrams.
Information storage: Introduction of database 10/7/2004 Xiangming Mu.
Systems Analysis and Design in a Changing World, Fifth Edition
ITEC224 Database Programming
Databases ? 2014, Fall Pusan National University Ki-Joune Li.
1 Introduction to Database Systems. 2 Database and Database System / A database is a shared collection of logically related data designed to meet the.
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
Spatial Databases: Data Collection Spring, 2015 Ki-Joune Li.
5 Systems Analysis and Design in a Changing World, Fourth Edition.
1/26/2004TCSS545A Isabelle Bichindaritz1 Database Management Systems Design Methodology.
Spatial Databases: Building Spatial DB Spring, 2015 Ki-Joune Li.
Databases Shortfalls of file management systems Structure of a database Database administration Database Management system Hierarchical Databases Network.
5 Systems Analysis and Design in a Changing World, Fifth Edition.
Spatial Databases - Introduction Spring, 2015 Ki-Joune Li.
©NIIT Normalizing and Denormalizing Data Lesson 2B / Slide 1 of 18 Objectives In this section, you will learn to: Describe the Top-down and Bottom-up approach.
Databases : Database Building Procedures 2007, Fall Pusan National University Ki-Joune Li.
CISB113 Fundamentals of Information Systems IS Development.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Introduction to UML Todd Bacastow Rational Unified Process A process for the effective implementation of key “Best Practices” Control Changes Manage.
1 More About UML Todd Bacastow Penn State University Geospatial System Analysis & Design.
Retele de senzori Curs 2 - 1st edition UNIVERSITATEA „ TRANSILVANIA ” DIN BRAŞOV FACULTATEA DE INGINERIE ELECTRICĂ ŞI ŞTIINŢA CALCULATOARELOR.
SQL Basics Review Reviewing what we’ve learned so far…….
1 Documenting Solutions Todd Bacastow Penn State University Geog 468 GIS Analysis & Design.
What is a database? (a supplement, not a substitute for Chapter 1…) some slides copied/modified from text Collection of Data? Data vs. information Example:
Geographic Information Systems GIS Data Databases.
5 Systems Analysis and Design in a Changing World, Fourth Edition.
Fundamental of Database Systems
Databases and DBMSs Todd S. Bacastow January 2005.
Database Systems: Design, Implementation, and Management Tenth Edition
Database Development Lifecycle
UML Diagrams: Class Diagrams The Static Analysis Model
DATA REQIREMENT ANALYSIS
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Spatial Databases: Building Spatial DB
Physical Changes That Don’t Change the Logical Design
MODELS OF DATABASE AND DATABASE DESIGN
Quiz Questions Q.1 An entity set that does not have sufficient attributes to form a primary key is a (A) strong entity set. (B) weak entity set. (C) simple.
Physical Structure of GDB
Chapter 12 Information Systems.
Abstract descriptions of systems whose requirements are being analysed
Geographic Information Systems
Object-Oriented Design of Spatial Entities Todd Bacastow
Data, Databases, and DBMSs
2018, Fall Pusan National University Ki-Joune Li
Spatial Databases: Building Spatial DB
Spatial Databases - Introduction
URBDP 422 Urban and Regional Geo-Spatial Analysis
Data Model.
Business Application Development
File Processing : Index and Hash
CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS
Spatial Databases - Introduction
Relational Database Design
Spatial Databases - Representation
Spatial Databases - Representation
Question 1: Basic Concepts (45 %)
Review #1 Intro stuff What is a database, 4 parts, 3 users, etc.
Geographic Information Systems
Presentation transcript:

Spatial Databases: Building Spatial DB Spring, 2017 Ki-Joune Li

Importance of Database Application of Spatial Databases (e.g. GIS) Garbage-In Garbage-Out About 70% of GIS Development Cost: DB Cost

Comparison with Software Lifecycle Requirement Analysis Modeling Schema Design DB Environments Data Collection and Input Quality Control Maintenance DB Life Cycle Requirement Analysis Functional Specification Design Development Environments Coding Test Maintenance Software Life Cycle – Waterfall Model

Requirement Analysis Analysis of Status Output of Analysis as it is and as it shall be. Output of Analysis Use-Case Diagram of UML: Workflow Analysis Data items that have been maintained and to be maintained Description of each item: Data Dictionary Relationships and Constraints on items Required accuracy Spatial Precision Temporal Precision Current State: As it is As it must be

Data Dictionary Definitions and Representation of Data Items such as Precise definition of data elements Integrity constraints or Constrains Stored procedures and trigger rules Specification of Producer and Consumer of data element Why it is so important? Common understanding on data items Consistency of databases Important input to data modeling

Data Modeling Data Modeling 4 steps Understanding the real world and application A very small piece of the real world According to viewpoint Determined by applications Drawing what you have understood in formal method Class Diagram in UML 4 steps Definition of Entities Attributes of each Entity Relationships Constraints

-- can optionally be described here. Class Diagram: Basic MyClassName +SomePublicAttribute : SomeType -SomePrivateAttribute : SomeType #SomeProtectedAttribute : SomeType +ClassMethodOne() +ClassMethodTwo() Responsibilities -- can optionally be described here. Multiplicity Customer 1 Simple Aggregation Class Abstract Class Rental Invoice Rental Item {abstract} 1..* 1 0..1 Composition (Dependency) Simple Association Generalization Checkout Screen DVD Movie VHS Movie Video Game

Definition of Entities Extract nouns from Problem statement Use-Case Diagram Delete unnecessary entities Duplication Attributes rather than entity ex. Loan amount Definition of Features Geographic Entity Granularity MyClassName

Definition of Features Meaningful Object of GIS in real world Must have a geometry Point, Line, Polygon, etc.. How to define the Granularity of Features Example How to define “a” coastal line? The highway from Pusan to Seoul is a long feature ? How to separate this long road?

Definition of Attributes Attributes of Feature Geometric type: Spatial Attribute Non-Spatial Attributes Geometric Type Different Levels of Detail (LOD) Building Polygon in 1/1,000 scale Point in 1/1,000,000 scale Road Polyline in 1/1,000,000 scale MyClassName +SomePublicAttribute : SomeType -SomePrivateAttribute : SomeType #SomeProtectedAttribute : SomeType +GeometricAttribute

Relationship Relationship Non-Spatial Relationship Spatial Relationship: Topology

Constraints Example Implementation No building on road surface More than 50 meters between two poles Implementation Internal Functions for checking constraints (or constructor) Spatial OCL (Object Constraint Language) More detail and complete constraint Better quality of DB

Quality Control for Data Modeling For the quality control, A Simulation with a pre-defined test scenario

Schema Design Automatic Conversion from Data Modeling to Schema Check Points: Performance Issues Materialization Index Geographic Distribution of DB: Clustering Based on Workload Analysis Distribution of operations Distribution of values

Materialization In SQL, view is a virtual table derived from a Select statement Eample CREATE VIEW ExcellentStudents AS SELECT Name, Department, Score FROM Students WHERE Score > 4.0 SELCT Name FROM ExcellentStudents Where Department=‘CS’ ExcellentStudents Materialization Invoke

Materialize or Not ? Materialization Duplication Not 3NF (BCNF) Cause an inconsistency between the original and derived tables Update: Overhead due to update propagation Extra Space Requirements Should be determined depending on the WORKLOAD Frequency of updates Cost for update propagation Especially when materialized view is geographically distributed

Spatial Index Index: Accelerate Search Spatial Index Spatial predicates: contain, overlapping, k-NN Much improves the query processing performance Has a performance overhead for insertion/deletion 2nd Phase Search Block Number Search Condition { Block# } Database on Disk 1st Phase

Clustering: Placement of records Vertical Fragmentation vs. Horizontal Fragmentation Vertical Fragmentation: Decomposition of table Horizontal Fragmentation: Placement of objects Consideration on Workload Vertical Fragmentation Horizontal Fragmentation

Clustering Clustering: Grouping objects so as to maximize Prob(a C, bC), when OK=a and OK+1=b for any two objects a and b of the same group C. Spatial Clustering Basic Assumption: If dist(a,b) < dist(a,c), Prob(OK=a, OK+1=b) > Prob(OK=a, OK+1=c) Two consecutive accesses a b c

Spatial Clustering Methods k-Means CLARANS in IEEE TKDE 2002, 14(5) BIRCH in proc. VLDB 1996 DBSCAN in proc. KDD 1996 SMTIN in proc. ACM-GIS 1997