Wrapup Amol Deshpande CMSC424. “Inventing the Future” Wednesday at 3:30pm 1115 CSIC Exam.

Slides:



Advertisements
Similar presentations
Database Systems: Design, Implementation, and Management Tenth Edition
Advertisements

Big Data: Analytics Platforms Donald Kossmann Systems Group, ETH Zurich 1.
IiWAS2002, Bandung, Indonesia Teaching and Learning Databases Dr. Stéphane Bressan National University of Singapore.
Introduction to Databases
CMSC424: Database Design Instructor: Amol Deshpande
Chapter Physical Database Design Methodology Software & Hardware Mapping Logical Design to DBMS Physical Implementation Security Implementation Monitoring.
File Systems and Databases
Benchmarking XML storage systems Information Systems Lab HS 2007 Final Presentation © ETH Zürich | Benchmarking XML.
CMSC424: Database Design Instructor: Amol Deshpande
CMSC424: Database Design Instructor: Amol Deshpande
CMSC724: Database Management Systems Instructor: Amol Deshpande
CMSC424: Review. Database Management Systems Manage data ► Store data ► Update data ► Answer questions about the data.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 11 Database Performance Tuning and Query Optimization.
CMSC424, Spring 2005 CMSC424: Database Design Instructor: Amol Deshpande
Chapter 14 The Second Component: The Database.
Some slides adapted from A. Silberschatz et al. Database System Concepts, 5th Ed. Review Database Management Systems I Alex Coman, Winter 2006.
...Looking back Why use a DBMS? How to design a database? How to query a database? How does a DBMS work?
Compe 301 ER - Model. Today DBMS Overview Data Modeling Going from conceptual requirements of a application to a concrete data model E/R Model.
Data Base Management System
Introduction to Database Concepts
Database System Concepts, 5th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Chapter 1: Introduction.
Databases From A to Boyce Codd. What is a database? It depends on your point of view. For Manovich, a database is a means of structuring information in.
Week 1 Lecture MSCD 600 Database Architecture Samuel ConnSamuel Conn, Asst. Professor Suggestions for using the Lecture Slides.
Introduction. 
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
Course Introduction Introduction to Databases Instructor: Joe Bockhorst University of Wisconsin - Milwaukee.
Database Architecture Introduction to Databases. The Nature of Data Un-structured Semi-structured Structured.
Database Organization and Design
Databases From A to Boyce Codd. What is a database? It depends on your point of view. For Manovich, a database is a means of structuring information in.
NoSQL Databases NoSQL Concepts SoftUni Team Technical Trainers Software University
I Information Systems Technology Ross Malaga 4 "Part I Understanding Information Systems Technology" Copyright © 2005 Prentice Hall, Inc. 4-1 DATABASE.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
Physical Database Design I, Ch. Eick 1 Physical Database Design I About 25% of Chapter 20 Simple queries:= no joins, no complex aggregate functions Focus.
1 CS 430 Database Theory Winter 2005 Lecture 16: Inside a DBMS.
Principles of Information Systems, Sixth Edition Organizing Data and Information Chapter 5.
HND Agri DBMS Introduction MH Mohamed Nafas 1. Why DBMS? 2  Suppose we need to develop a Information system.  How do we  store the data? (use file.
COMU114: Introduction to Database Development 1. Databases and Database Design.
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
Data resource management
1 CS3431 – Database Systems I Introduction Instructor: Mohamed Eltabakh
Computer Science and Engineering Computer System Security CSE 5339/7339 Session 21 November 2, 2004.
1 Final Review Tuesday, March 6, The Final Date: Tuesday, March 13, 2007 Time: 6:30 - 8:30 Room: EE 037 You must come to campus Open book exam.
Introduction HNDIT DBMS 1. Database Management Systems Module code HNDIT Module title Database Management Systems Credits2HoursLectures15.
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
1 CSE444: REVIEW. 2 CSE444 in one slide v Logical : E/R diagram  normalized relations v Physical : files, buffering, and indexes v Logical : Relational.
CPT-S Advanced Databases 11 Yinghui Wu EME 49.
Fall CSE330/CIS550: Introduction to Database Management Systems Prof. Susan Davidson Office: 278 Moore Office hours: TTh
1 Geog 357: Data models and DBMS. Geographic Decision Making.
CS 540 Database Management Systems
Lecture 15: Query Optimization. Very Big Picture Usually, there are many possible query execution plans. The optimizer is trying to chose a good one.
CS3431: C-Term CS3431 – Database Systems I Introduction Instructor: Mohamed Eltabakh
1 Introduction to Database Systems, CS420 SQL Views and Indexes.
Database Management Systems By Dinesha L Lecturer, Dept. Of CSE SSIT, Tumkur Chapter-1.
Chapter 1 Overview of Databases and Transaction Processing.
Christoph F. Eick: Final Words COSC Topics Covered in COSC 3480  Data models (ER, Relational, XML)  Using data models; learning how to store real.
CS 540 Database Management Systems
Chapter 1: Introduction
Introduction Instructor: Mohamed Eltabakh
Modeling Your Data Chapter 2 cs542
 DATAABSTRACTION  INSTANCES& SCHEMAS  DATA MODELS.
Basic Concepts in Data Management
Topics Covered in COSC 6340 Data models (ER, Relational, XML (short))
Topics Covered in COSC 6340 Data models (ER, Relational, XML)
Introduction to Database Systems CSE 444 Lecture 23: Final Review
Query Optimization.
Introduction Instructor: Mohamed Eltabakh
Introduction to Database Systems CSE 444 Lecture 23: Final Review
Final Review Friday, December 8, 2006.
Presentation transcript:

Wrapup Amol Deshpande CMSC424

“Inventing the Future” Wednesday at 3:30pm 1115 CSIC Exam

DBMS at a glance Data Models  Conceptual representation of the data Data Retrieval  How to ask questions of the database  How to answer those questions Data Storage  How/where to store data, how to access it Data Integrity  Manage crashes, concurrency  Manage semantic inconsistencies Not fully disjoint categorization !!

DBMS at a glance Data Models  E/R Model, Relational model  Very simple and hence effective  Easy to make things complicated, very hard to keep them simple  No other data model has survived for so long  What is the future of XML ?

DBMS at a glance Data Retrieval  How to ask questions of the database  Declarative languages are great  Hide complexity from users, can optimize things, can evolve easily  SQL –More or less declarative  How to answer those questions  Parsing --> Optimization --> Processing  Operators: Hashing, sorting, joins, aggregation  Data structures –Hash indexes: Good for equality queries –Tree indexes: For everything else  Optimization: Complex, but key piece of a database system

DBMS at a glance Data Storage  How/where to store data, how to access it  Need to be cognizant of the memory hierarchy  Memory is cheap, disk is very expensive to access  Further disk is cheap to access sequentially, much more expensive to access randomly –Many of our decisions are influenced by this  RAID: Surviving failures  Accessing data: Indexes  What happens if a new form of storage comes along with different properties (say holographic storage ?)  We will need to rethink the tradeoffs, but we now know the approach

DBMS at a glance Data Integrity  Manage crashes, concurrency  Transactions, 2-phase locking  Write-ahead logging  DBMS pretty much the last word on concurrency/recovery  OSs don’t come close to supporting anything like that  Manage semantic inconsistencies  Normalization, FDs  Not easy to identify tools, but we have learned how to think about them –Try to capture them in the E/R diagram as much as possible

Motivation: Data Overload We began the first lecture with discussing the data overload  Huge amounts of data generated every day  Much faster than our ability to process it  Increasing ability to capture more enterprise data  Web, blogs, RSS Feeds etc  Multimedia –Flickr and cellphone cameras has led a revolution in how people take pictures –Videos will be next –Not hard to imagine capturing every moment of your life  Sensor/RFID data –Tiny sensors/RFID just beginning to become ubiquitous –Billions of these generating a tiny amount of data every second is still too much  Biological/Scientific data

Motivation: Data Overload Relational databases help for structured data  But increasingly not sufficient  The things we want to do with data can’t be expressed in SQL  E.g. with biological data, web  Too much unstructured data  Distributed data generation creates additional headaches  Almost impossible to try to collect the data in one location Making sense of this requires not only advances in data processing, but also in data understanding/mining  Interdisciplinary efforts

Some Lessons from RDBMS But can use the lessons learned from developing RDBMS  Data independence / abstraction is good  Hide details, even if initially it leads to inefficiency  Look for structure  Every seemingly highly unstructured data might have structure  Look for patterns in usage  Relational database are fast because query processing is predictable –Unlike say OS workloads which are very hard to optimize for  If you can identify patterns, you can probably optimize them  Declarative languages are great  Say what you want, not how to get it