Is the WWW a DBMS? = Fairly sophisticated search available

Slides:



Advertisements
Similar presentations
1 Introduction to Database Systems CSE444 Instructor: Scott Vandenberg University of Washington Winter 2000.
Advertisements

Database Management Systems 1 Ramakrishnan & Gehrke Introduction to Database Systems Chapter 1 Instructor: Mirsad Hadzikadic.
Chapter 1 Instructor: Murali Mani Database Management Systems.
1 541: Database Systems S. Muthu Muthukrishnan. 2 Some Data Collections I Have Played With….  Wireless call detail records.  U. S. Patents.  AskJeeves.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Introduction to Database Systems Chapter 1 Instructor: Wang-Chien Lee
Database: A collection of related data [Elmasri]. A database represents some aspect of real world called “miniworld” [Elmasri] or “enterprise” [Ramakrishnan].
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Database Management Systems Chapter 1 Instructor: Deborah Strahman
H.Lu/HKUST L1: Course Overview & Review. L01: RDBMS REVIEW -- 2 H.Lu/HKUST The Teaching Staff  Instructor: Lu Hongjun  Office: 3543 (Lift 25-26), HKUST.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Database Management Systems Chapter 1.
1 Introduction to Database Systems Ref. Ramakrishnan & Gehrke Chapter 1.
CS186 - Introduction to Database Systems Spring Semester 2006 Prof. Michael Franklin “Knowledge is of two kinds: we know a subject ourselves, or we know.
1 CENG 302 Introduction to Database Management Systems Nihan Kesim Çiçekli URL:
CS Data Management and Information Processing.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Database Management Systems Chapter 1 Instructor: Ethan Jackson
CSCD34 - Data Management Systems,- A. Vaisman1 CSC D34 - Data Management Systems Instructor: Alejandro Vaisman University of Toronto.
CSC 411/511: DBMS Design 1 1 Dr. Nan WangCSC411_L1_Overview 1 Overview of Database Systems (Chapter 1)
CSC2012 Database Technology & CSC2513 Database Systems.
Database Management Systems 1 Introduction to Database Systems Instructor: Xintao Wu Ramakrishnan & Gehrke.
COP Introduction to Database Systems Prof. Feifei Li.
CS6530 Graduate-level Database Systems Prof. Feifei Li.
 DATABASE DATABASE  DATABASE ENVIRONMENT DATABASE ENVIRONMENT  WHY STUDY DATABASE WHY STUDY DATABASE  DBMS & ITS FUNCTIONS DBMS & ITS FUNCTIONS 
1 CS862 - Advanced Database Systems Sang H. Son
Database Management Systems 1 Ramakrishnan & Gehrke Introduction to Database Systems Chpt 1 Instructor: Xintao Wu.
Database Management Systems 1 Ramakrishnan & Gehrke Introduction to Database Systems Chpt 1 Instructor: Weichao Wang.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Database Management Systems Chapter 1 Overview of Database Systems.
ICS 321 Fall 2009 Introduction to Database Systems Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa.
Introduction to Database Systems1. 2 Basic Definitions Mini-world Some part of the real world about which data is stored in a database. Data Known facts.
1 What Is a DBMS?  A very large, integrated collection of data.  Models real-world enterprise.  Entities (e.g., students, courses)  Relationships (e.g.,
 TERMINOLOGY TERMINOLOGY DATA INFORMATION  NEED OF INFORMATION NEED OF INFORMATION  QUALITIES OF INFORMATION QUALITIES OF INFORMATION  FILE SYSTEM.
1 Geog 357: Data models and DBMS. Geographic Decision Making.
1 CS462- Database Systems Sang H. Son
CS 405G: Introduction to Database Systems Lecture 1: Introduction.
CS 405G: Introduction to Database Systems Lecture 1: Introduction.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Database Management Systems Chapter 1.
1 CENG 351 CENG 351 Introduction to Data Management and File Structures Department of Computer Engineering METU.
Database Management Systems 1 Ramakrishnan & Gehrke Introduction to Database Systems Chpt 1 Instructor: Xin Zhang.
1 Database Management Systems Introduction Instructor: Oliver Schulte
Fundamental of Database Systems
CS186 - Introduction to Database Systems Spring Semester 2007 Professor Eben Haber Professor Mary Roth “Knowledge is of two kinds: we know a subject.
COP Introduction to Database Structures
DATABASE MANAGEMENT SYSTEMS
Introduction to Database Systems Chapter 1
CS4222 Principles of Database System
DATABASE MANAGEMENT SYSTEMS
Outline Types of Databases and Database Applications Basic Definitions
Introduction to Database Systems
Database Management System
Instructor: Elke Rundensteiner
Database Management Systems Chapter 1
CS Data Management and Information Processing
Fundamentals of Information Systems, Sixth Edition
Introduction to Database Systems
Overview of Database Systems Chpt 1
Tools for Memory: Database Management Systems
Instructor: Murali Mani
Database Management Systems Chapter 1
CS186 - Introduction to Database Systems Spring Semester 2003 Prof
DATABASE SYSTEM.
Database Management Systems
CS Data Management and Information Processing
Database Management Systems CSE594
CS186: Introduction to Database Systems
CS 405G: Introduction to Database Systems
Sang H. Son CS6750: Database Systems The slides for this text are organized into chapters. This lecture covers Chapter 1. Chapter 1: Introduction.
DATABASE TECHNOLOGIES
Introduction to Database Systems
CS 405G Introduction to Database Systems
Database Management Systems Chapter 1
Introduction to Database Systems Chpt 1
Presentation transcript:

Is the WWW a DBMS? = Fairly sophisticated search available crawler indexes pages on the web Keyword-based search for pages But, currently data is mostly unstructured and untyped search only: can’t modify the data can’t get summaries, complex combinations of data few guarantees provided for freshness of data, consistency across data items, fault tolerance, … Web sites typically have a DBMS in the background to provide these functions. The picture is changing New standards e.g., XML, Semantic Web can help data modeling Research groups (e.g., at Berkeley) are working on providing some of this functionality across multiple web sites.

“Search” vs. Query What if you wanted to find out which actors donated to John Kerry’s presidential campaign? Try “actors donated to john kerry” in your favorite search engine.

A “Database Query” Approach

Is a File System a DBMS? = A) Yours B) Partner’s C) Both D) Neither Thought Experiment 1: You and your project partner are editing the same file. You both save it at the same time. Whose changes survive? A) Yours B) Partner’s C) Both D) Neither E) ??? Thought Experiment 2: You’re updating a file. The power goes out. Which of your changes survive? Q: How do you write programs over a subsystem when it promises you only “???” ? A: Very, very carefully!! A) All B) None C) All Since Last Save D) ???

Current Commercial Outlook A major part of the software industry: Oracle, IBM, Microsoft, Sybase also Informix (now IBM), Teradata smaller players: java-based dbms, devices, OO, … Well-known benchmarks (esp. TPC) Lots of related industries data warehouse, document management, storage, backup, reporting, business intelligence, app integration Relational products dominant and evolving adapting for extensibility (user-defined types), adding native XML support. Open Source coming on strong MySQL, PostgreSQL, BerkeleyDB

Why Study Databases?? ? Shift from computation to information always true for corporate computing Web made this point for personal computing more and more true for scientific computing Need for DBMS has exploded in the last years Corporate: retail swipe/clickstreams, “customer relationship mgmt”, “supply chain mgmt”, “data warehouses”, etc. Scientific: digital libraries, Human Genome project, NASA Mission to Planet Earth, physical sensors, grid physics network DBMS encompasses much of CS in a practical discipline OS, languages, theory, AI, multimedia, logic Yet traditional focus on real-world apps

What’s the intellectual content? representing information data modeling languages and systems for querying data complex queries with real semantics* over massive data sets concurrency control for data manipulation controlling concurrent access ensuring transactional semantics reliable data storage maintain data semantics even if you pull the plug * semantics: the meaning or relationship of meanings of a sign or set of signs 1. Representing info: both critically useful and profound. where bits and bytes meet “reality” – computational models of reality where bits and bytes meet mental models of reality: thought (because who/what are we modeling for? for other people, who will ask questions to learn more) not unlike data structures, but imagine a 20-year data structure! Correctness, evolution, communication 2. Languages: deeply tied to models for non-programmers, and must support evolution of models. So: declarative, not imperative Query processing: data access and operations for things MUCH bigger than memory. Must generate correct answers! 3. CC:: ensuring correctness in the face of concurrent update. First question: WHAT IS CORRECT? Second: how to ensure that without one-at-a-time operation? 4. Reliability: in the face of user error, software error, hardware failure, catastrophe!

Describing Data: Data Models A data model is a collection of concepts for describing data. A schema is a description of a particular collection of data, using a given data model. The relational model of data is the most widely used model today. Main concept: relation, basically a table with rows and columns. Every relation has a schema, which describes the columns, or fields. 5

Levels of Abstraction Users DB Physical Schema Conceptual Schema Views describe how users see the data. Conceptual schema defines logical structure Physical schema describes the files and indexes used. (sometimes called the ANSI/SPARC model) Physical Schema Conceptual Schema View 1 View 2 View 3 DB 6

Example: University Database Physical Schema Conceptual Schema View 1 View 2 View 3 DB Conceptual schema: Students(sid: string, name: string, login: string, age: integer, gpa:real) Courses(cid: string, cname:string, credits:integer) Enrolled(sid:string, cid:string, grade:string) External Schema (View): Course_info(cid:string,enrollment:integer) Physical schema: Relations stored as unordered files. Index on first column of Students. 7