‘Modern’ Databases Database Systems Lecture 18 Natasha Alechina.

Slides:



Advertisements
Similar presentations
Chapter 10: Designing Databases
Advertisements

XML: Extensible Markup Language
Lecture-7/ T. Nouf Almujally
Concepts of Database Management Seventh Edition
Agenda from now on Done: SQL, views, transactions, conceptual modeling, E/R, relational algebra. Starting: XML To do: the database engine: –Storage –Query.
Chapter Object-Oriented Practices. Agenda Object-Oriented Concepts Terminology Object-Oriented Modeling Tips Object-Oriented Data Models and DBMSs.
Databases Chapter Distinguish between the physical and logical view of data Describe how data is organized: characters, fields, records, tables,
1 Lecture 10 XML Wednesday, October 18, XML Outline XML (4.6, 4.7) –Syntax –Semistructured data –DTDs.
File Systems and Databases
1 COS 425: Database and Information Management Systems XML and information exchange.
Winter 2002Arthur Keller – CS 18018–1 Schedule Today: Mar. 12 (T) u Semistructured Data, XML, XQuery. u Read Sections Assignment 8 due. Mar. 14.
ICS (072)Database Systems Background Review 1 Database Systems Background Review Dr. Muhammad Shafique.
Fall 2001Arthur Keller – CS 18017–1 Schedule Nov. 27 (T) Semistructured Data, XML. u Read Sections Assignment 8 due. Nov. 29 (TH) The Real World,
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
1 XML Semistructured Data Extensible Markup Language Document Type Definitions.
The University of Akron Dept of Business Technology Computer Information Systems Database Management Approaches 2440: 180 Database Concepts Instructor:
Copyright 2003 The McGraw-Hill Companies, Inc CHAPTER Application Software computing ESSENTIALS    
WWW and Internet The Internet Creation of the Web Languages for document description Active web pages.
Lecture Two Database Environment Based on Chapter Two of this book:
Object-Oriented Methods: Database Technology An introduction.
RIZWAN REHMAN, CCS, DU. Advantages of ORDBMSs  The main advantages of extending the relational data model come from reuse and sharing.  Reuse comes.
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
Introduction and Conceptual Modeling
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
1 XML Semistructured Data Extensible Markup Language Document Type Definitions.
4/20/2017.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Systems analysis and design, 6th edition Dennis, wixom, and roth
CSS/417 Introduction to Database Management Systems Workshop 5.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
Module Title? DBMS Introduction to Database Management System.
Internet Basics Dr. Norm Friesen June 22, Questions What is the Internet? What is the Web? How are they different? How do they work? How do they.
Chapter 2 Database Environment
Web-Enabled Decision Support Systems
Postacademic Interuniversity Course in Information Technology – Module C1p1 Contents Data Communications Applications –File & print serving –Mail –Domain.
1 CS 430 Database Theory Winter 2005 Lecture 1: Introduction.
CSCE 520- Relational Data Model Lecture 2. Relational Data Model The following slides are reused by the permission of the author, J. Ullman, from the.
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Winter 2006Keller, Ullman, Cushing18–1 Plan 1.Information integration: important new application that motivates what follows. 2.Semistructured data: a.
1 CS 430 Database Theory Winter 2005 Lecture 17: Objects, XML, and DBMSs.
Concepts of Database Management Sixth Edition Chapter 9 Database Management Approaches.
Modern Databases Willem Visser RW334. The Web is Changing the Game Databases used to be the domain of corporations with limited amounts of data and limited.
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
1 CS1368 Introduction* Relational Model, Schemas, SQL Semistructured Model, XML * The slides in this lecture are adapted from slides used in Standford's.
MULTIMEDIA DATABASES -Define data -Define databases.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Jeff Ullman: Introduction to XML 1 XML Semistructured Data Extensible Markup Language Document Type Definitions.
Semistructured Data Extensible Markup Language Document Type Definitions Zaki Malik November 04, 2008.
The Semistructured-Data Model Programming Languages for XML Spring 2011 Instructor: Hassan Khosravi.
Object Oriented Database By Ashish Kaul References from Professor Lee’s presentations and the Web.
OWL Representing Information Using the Web Ontology Language.
CSCE 520- Relational Data Model Lecture 2. Oracle login Login from the linux lab or ssh to one of the linux servers using your cse username and password.
Object storage and object interoperability
Fall CSE330/CIS550: Introduction to Database Management Systems Prof. Susan Davidson Office: 278 Moore Office hours: TTh
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
LECTURE TWO Introduction to Databases: Data models Relational database concepts Introduction to DDL & DML.
Data Models. 2 The Importance of Data Models Data models –Relatively simple representations, usually graphical, of complex real-world data structures.
1 © 2013 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the.
Datab ase Systems Week 1 by Zohaib Jan.
Distributed web based systems
Chapter 27 WWW and HTTP.
Data Model.
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Presentation transcript:

‘Modern’ Databases Database Systems Lecture 18 Natasha Alechina

'Modern' Databases In This Lecture 'Modern' Databases Distributed DBs Web-based DBs Object Oriented DBs Semistructured Data and XML Multimedia DBs For more information Connolly and Begg chapters Ullman and Widom chapter 4

'Modern' Databases Other Sorts of DB We have looked mainly at relational databases Relational model SQL Design techniques Transactions Many of these topics relied on relational concepts There are several other types of DB in use today Distributed DBs Object DBs Multimedia DBs Temporal DBs Logic DBs

'Modern' Databases Distributed Databases A distributed DB system consists of several sites Sites are connected by a network Each site can hold data and process it It shouldn’t matter where the data is - the system is a single entity Distributed database management system (DDBMS) A DBMS (or set of them) to control the databases Communication software to handle interaction between sites

'Modern' Databases Client/Server Architecture The client/server architecture is a general model for systems where a service is provided by one system (the server) to another (the client) Server Hosts the DBMS and database Stores the data Client User programs that use the database Use the server for database access

'Modern' Databases Distributed Databases Network Client(s) Server Client(s) Server Client(s) Server Client(s) Server Client(s) Server

'Modern' Databases Web-based Databases Database access over the internet Web-based clients Web server Database server(s) Web server serves pages to browsers (clients) and can access database(s) Typical operation Client sends a request for a page to the web server Web server sends SQL to database The web server uses results to create page The page is returned to the client

'Modern' Databases Web-based Databases Client (Browser) Web Server Database Server HTTP request SQL query SQL result HTML page

'Modern' Databases Web-based Databases Advantages World-wide access Internet protocols (HTTP, SSL, etc) give uniform access and security Database structure is hidden from clients Uses a familiar interface Disadvantages Security can be a problem if you are not careful Interface is less flexible using standard browsers Limited interactivity over slow connections

'Modern' Databases Object Oriented Databases Relational DBs The database can’t see data’s internal structure so can’t use complex data Relational model gives a simple, and quite powerful, structure - but is quite rigid Object Oriented DBs Use concepts from object oriented design/programming OO concepts Encapsulation Inheritance Polymoprhism OODBMS

'Modern' Databases Object Oriented Databases An object oriented database (OODB) is a collection of persistent objects Objects - instances of a defined class Persistent - object exist independently of any program An object oriented DBMS Manages a collection of objects Allows objects to be made persistent Permits queries to be made of the objects Does all the normal DBMS things as well

'Modern' Databases OODB example In lecture 10 we had a store with different sorts of products Books CDs DVDs This lead to missing data among the various types OODB solution We make an abstract Product class Book, CD, and DVD are each a concrete subclass of Product The database is a persistent collection of Products

'Modern' Databases OODB Example Product is abstract You cannot make a Product directly You can, however, make a Book, CD, or DVD, and these are Products Product BookCDDVD Price Title Shipping Author Publisher ArtistProducer Director

'Modern' Databases Object Oriented Databases Advantages Good integration with Java, C++, etc Can store complex information Fast to recover whole objects Has the advantages of the (familiar) object paradigm Disadvantages There is no underlying theory to match the relational model Can be more complex and less efficient OODB queries tend to be procedural, unlike SQL

'Modern' Databases Object Relational Databases Extend a RDBMS with object concepts Data values can be objects of arbitrary complexity These objects have inheritance etc. You can query the objects as well as the tables An object relational database Retains most of the structure of the relational model Needs extensions to query languages (SQL or relational algebra)

'Modern' Databases Semistructured data Semistructured Data : A new data model designed to cope with problems of information integration. XML : A standard language for describing semistructured data schemas and representing data.

'Modern' Databases The Information-Integration Problem Related data exists in many places and could, in principle, work together. But different databases differ in: Model (relational, object-oriented?). Schema (normalised/ not normalized?). Terminology: are consultants employees? Retirees? Subcontractors? Conventions (meters versus feet?). How do we model information residing in heterogeneous sources (if we cannot combine it all in a single new database)?

'Modern' Databases Example Suppose we are integrating information about bars in some town. Every bar has a database. One may use a relational DBMS; another keeps the menu in an MS-Word document. One stores the phones of distributors, another does not. One distinguishes ales from other beers, another doesn’t. One counts beer inventory by bottles, another by cases.

'Modern' Databases Semistructured Data Purpose: represent data from independent sources more flexibly than either relational or object-oriented models. Think of objects, but with the type of each object its own business, not that of its “class.” Labels to indicate meaning of substructures. Data is self-describing: structural information is part of the data.

'Modern' Databases Graphs of Semistructured Data Nodes = objects. Labels on arcs (attributes, relationships). Atomic values at leaf nodes (nodes with no arcs out). Flexibility: no restriction on: Labels out of a node. Number of successors with a given label.

'Modern' Databases Example: Data Graph Bud A.B. Gold1995 MapleJoe’s M’lob beer bar manf servedAt name addr prize yearaward root The bar object for Joe’s Bar The beer object for Bud Notice a new kind of data.

'Modern' Databases XML XML = Extensible Markup Language. While HTML uses tags for formatting (e.g., “italic”), XML uses tags for semantics (e.g., “this is an address”). Key idea: create tag sets for a domain (e.g., bars), and translate all data into properly tagged XML documents. Well formed XML - XML which is syntactically correct; tags and their nesting totally arbitrary. Valid XML - XML which has DTD (document type definition); imposes some structure on the tags, but much more flexible than relational database schema.

'Modern' Databases XML and Semistructured Data Well-Formed XML with nested tags is exactly the same idea as trees of semistructured data. XML also enables non-tree structures (with references to IDs of nodes), as does the semistructured data model.

'Modern' Databases Example: Well-Formed XML Joe’s Bar Bud 2.50 Miller 3.00 …

'Modern' Databases Example The XML document is: Joe’s Bar Bud2.50Miller3.00 PRICE BAR BARS NAME... BAR PRICE NAME BEER NAME

'Modern' Databases XPATH and XQUERY XPATH is a language for describing paths in XML documents. Really think of the semistructured data graph and its paths. Why do we need path description language: can’t get at the data using just Relation.Attribute expressions. XQUERY is a full query language for XML documents with power similar to OQL (Object Query Language, query language for object-oriented databases).

'Modern' Databases Multimedia Databases Multimedia DBs can store complex information Images Music and audio Video and animation Full texts of books Web pages They can be used in a wide range of application areas Entertainment Marketing Medical imaging Digital publishing Geographic Information Systems

'Modern' Databases Querying Multimedia DBs Metadata searches Information about the multimedia data (metadata) is stored This can be kept in a standard relational database and queried normally Limited by the amount of metadata available Content searches The multimedia data is searched directly Potential for much more flexible search Depends on the type of data being used Often difficult to determine what the ‘correct’ results are

'Modern' Databases Metadata Searches Example - indexing films we might store Title Year Genre(s) Actor(s) Director(s) Producer(s) We can then search for things like Films starring Kevin Spacey Films directed by Peter Jackson Dramas produced in 2000 We don’t actually search the films

'Modern' Databases Metadata Searches Advantages Metadata can be structured in a traditional DBMS Metadata is generally concise and so efficient to store Metadata enriches the content Disadvantages Metadata can’t always be found automatically, and so requires data entry It restricts the sorts of queries that can be made

'Modern' Databases Content Searches An alternative to metadata is to search the content directly Multimedia is less structured than metadata It is a richer source of information but harder to process Example of content based retrieval Find images similar to a given sample Hum a tune and find out what it is Search for features, such as cuts or transitions in films

'Modern' Databases Content-Based Retrieval QBIC™ (Query By Image Content) from IBM - searches for images having similar colour or layout

'Modern' Databases Content-Based Retrieval Image retrieval is hard It is often not clear when two images are ‘similar’ Image interpretation is unsolved and expensive Different people expect different things Do we look for? Images of roses Images of red things? Images of flowers? Images of red flowers? Images of red roses?

'Modern' Databases Other Topics Temporal DBs Storing data that changes over time Can ask about the history of the DB rather than just the current state System time vs real time Logic DBs A database is a set of facts and rules for manipulating them (like a Prolog program) The DBMS maintains and controls these facts and rules A ‘query’ is made by applying the rules to the facts

'Modern' Databases Next week Wednesday 12-1: Revision and module evaluation lecture If you have suggestions for revision questions, please me! Monday the 28th 9-10: Java and SQL lecture (optional – not in the exam)