Fall 2013, Databases, Exam 2 Questions for the second exam. Your answers are due by Dec. 18 at 4PM. (This is the final exam slot.) And please type your.

Slides:



Advertisements
Similar presentations
Chapter 10: Designing Databases
Advertisements

What is a Database By: Cristian Dubon.
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
C6 Databases.
Fall 2014, project. XAMPP (or other stack) Go to Install a MySQL, Apache,
Management Information Systems, Sixth Edition
COLUMN-BASED DBS BigTable, HBase, SimpleDB, and Cassandra.
CMPT 354, Simon Fraser University, Fall 2008, Martin Ester 52 Database Systems I Relational Algebra.
OLAP. Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more time-consuming, analytic queries.
Chapter 12: ADO.NET and ASP.NET Programming with Microsoft Visual Basic.NET, Second Edition.
Mgt 240 Lecture MS Excel and Access: Introduction to Databases September 23, 2004.
1 Lecture 13: Database Heterogeneity Debriefing Project Phase 2.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
The University of Akron Dept of Business Technology Computer Information Systems Database Management Approaches 2440: 180 Database Concepts Instructor:
Rutgers University Relational Algebra 198:541 Rutgers University.
SESSION 7 MANAGING DATA DATARESOURCES. File Organization Terms and Concepts Field: Group of words or a complete number Record: Group of related fields.
Relational Algebra, R. Ramakrishnan and J. Gehrke (with additions by Ch. Eick) 1 Relational Algebra.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
XML, distributed databases, and OLAP/warehousing The semantic web and a lot more.
ONTOLOGY SUPPORT For the Semantic Web. THE BIG PICTURE  Diagram, page 9  html5  xml can be used as a syntactic model for RDF and DAML/OIL  RDF, RDF.
Concepts of Database Management, Fifth Edition
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Databases From A to Boyce Codd. What is a database? It depends on your point of view. For Manovich, a database is a means of structuring information in.
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
Managing Data Resources
Systems analysis and design, 6th edition Dennis, wixom, and roth
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
On-Line Analytic Processing Chetan Meshram Class Id:221.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Chapter 9 Database Management Discovering Computers Fundamental.
1 Adapted from Pearson Prentice Hall Adapted form James A. Senn’s Information Technology, 3 rd Edition Chapter 7 Enterprise Databases and Data Warehouses.
PostgreSQL and relational databases As well as assignment 4…
OnLine Analytical Processing (OLAP)
NOSQL DATABASES Please remember to read the NOSQL Distilled book and the Seven Databases book.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
1 Data Warehouses BUAD/American University Data Warehouses.
Database A database is a collection of data organized to meet users’ needs. In this section: Database Structure Database Tools Industrial Databases Concepts.
1.file. 2.database. 3.entity. 4.record. 5.attribute. When working with a database, a group of related fields comprises a(n)…
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
Info Systems Fall 2013 . The modern role of often not-so-modern database technology  We will look at MySQL SQL PHP  NoSQL DBs Mongo and GUIs for it.
BI Terminologies.
Chapter 4 Data and Databases. Learning Objectives Upon successful completion of this chapter, you will be able to: Describe the differences between data,
CHAPTER 3 DATABASES AND DATA WAREHOUSES. 2 OPENING CASE STUDY Chrysler Spins a Competitive Advantage with Supply Chain Management Software Chapter 2 –
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
1 CS 430 Database Theory Winter 2005 Lecture 2: General Concepts.
Database Systems Basic Data Management Concepts
By N.Gopinath AP/CSE. There are 5 categories of Decision support tools, They are; 1. Reporting 2. Managed Query 3. Executive Information Systems 4. OLAP.
Fall 2013, Databases, Exam 2 Questions for the second exam…
UNIT-II Principles of dimensional modeling
Object Oriented Database By Ashish Kaul References from Professor Lee’s presentations and the Web.
Building Dashboards SharePoint and Business Intelligence.
Department of Industrial Engineering Sharif University of Technology Session# 9.
Object storage and object interoperability
Sesame A generic architecture for storing and querying RDF and RDFs Written by Jeen Broekstra, Arjohn Kampman Summarized by Gihyun Gong.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
3-1 Chapter 3 Data and Knowledge Management
Managing Data Resources File Organization and databases for business information systems.
N5 Databases Notes Information Systems Design & Development: Structures and links.
Intro to MIS – MGS351 Databases and Data Warehouses
Databases and Data Warehouses Chapter 3
Relational Algebra 461 The slides for this text are organized into chapters. This lecture covers relational algebra, from Chapter 4. The relational calculus.
MANAGING DATA RESOURCES
Database Vs. Data Warehouse
Building your First Cube with SSAS
INTRODUCTION A Database system is basically a computer based record keeping system. The collection of data, usually referred to as the database, contains.
Presentation transcript:

Fall 2013, Databases, Exam 2 Questions for the second exam. Your answers are due by Dec. 18 at 4PM. (This is the final exam slot.) And please type your answers. (You can include hand drawn images if you want.)

1. Cassandra  Describe a database-centric application that would be an excellent match for developing with Cassandra as its data store.(You can use Hbase or Mongo instead; but you must use one of these 3.)  What business owns this application? What product or service to they sell?  What does the data represent in the real world?  Most likely, some of their data is standard transaction processing data that they have to manipulate on an ongoing basis – assume that this is not your concern.  But some of the data they must process every day consists of documents that have internal structure. This is what you are concerned with.  Why would Cassandra be the best thing for them?  List the reasons  Why not a relational DBMS?  Why not Hbase or Mongo?

Cassandra, continued  Draw out the column families (in the data model of Cassandra) for the application  Make them specific and detailed  Make sure your specification isn’t trivial  Write 3 queries for your Cassandra application using CQL  One that creates data with an Insert  One that performs an SQL like summary task  One that creates the column families for your application

2. An XML language  Create an XML language for the general application domain that you used to describe the business or organization you invented in problem 1.  Your goal is to create a shared language that can be used to describe other companies that manage data of the sort sold by your chosen environment.  Note: in problem 1, you described a particular business or organization, and you built a specific data structure for an application, but your XML language needs to be generic, suitable any other companies or organizations similar to the one you described in problem 1.  IMPORTANT: make sure that some of your elements refer to data types and that some of them refer to operations that can be performed on the data. YOU WILL NEED both of THESE IN PROBLEM 4.)

An XML language, continued  For the company that you invented in problem 1, use your new XML language to create a data description for the data you defined structurally in question 1. In other words, you are specifying the XML version of the Cassandra structures you specified in problem 1.  Keep in mind that your language is just a list of elements and attribute types, and that users of your language will created hierarchies and lists of elements by using your primitives; in other words, they can build xml documents that embed your elements inside each other.  Make sure that…  … your language is not trivial. It should have at least 5 sorts of elements and 3 sorts of attributes  … the structure you define by using your XML language uses ALL of the elements and attributes you specified in your XML language.  …the structure you define has at least 2 different element instances with nested elements inside of them.

3. Relational DBs and OLAP  Consider problem 1. You did not have to concern yourself with the transaction processing data in problem 1. Now, you will. Create a relational schema for the online transaction processing data from the app you invented in problem 1.  Specify all Primary Keys and FK’s  Specify all relevant FD’s and MVD’s – but you do not have to normalize your schema  Write 2 OLTP queries for your schema  One does an insert into two tables  One does a general set oriented query with at least one join in it.  Your schema must use only terms from your XML language.

OLAP and OLTP, continued  Create a data warehousing schema to go with your relational schema  Add to your relational schema if you need to do so in order to make your warehousing schema non-trivial. You do not have to go back and add anything new to problems 1 or 2.  Assume that the basic “fact” data being studied over time is money.  Assume that time is one of the dimensions, and break your time data down by year/month/day.  Create three different transactions that can be run in order to look for trends over time. You do not have to use any OLAP operators, regular SQL is fine.

OLAP and OLTP, continued  Carefully describe the data that is taken from the transaction database and how it is aggregated into the dimension tables of the warehouse to create new time slices.  Write a couple of sentences describing the reason for your warehouse and queries – why are these very different from the OLTP transactions?

4. Assertions and inferences  For the application you created in question 1, write a set of assertions.  Create at least 10 triples  The “lefts” and “rights” refer to data; the “middles” refer to the operations that can be applied to the data lefts and the rights.  Make sure that all of the “middles” of your assertions are from the xml language you specified. You can add to your XML language if you want. These are the operations you placed in your XML language.  Make sure that some of the “lefts” and “rights” of your triples come from your XML language (these are the data types you placed in your XML language), and that some of the others consist of URLs pointing to specific documents.  In other words, you will have two kinds of triples, generic ones that are true for all data of certain types (like Cows give Milk), and data-focused ones that relate specific documents to each other (like Betsy-the-cow give (s) chocolate-milk). Only your data will consist of documents, not farm animals.

Assertions and inferences, continued  Create at least two inferences based on your assertions  one that is generic and states a new assertion about how all data of these types are related …  …and one that is based on specific documents data types in the Cassandra database, i.e., creates a new assertion that is based only on specific documents.  In other words, your assertions and your inferences will be of two categories: relating data types and relating individual documents. Important: you do not have to use the same operations in both kinds of assertions. It might be that cows give milk, but that Betsy-the-cow is the color brown.

5. Distributed DBs  Create a heterogeneous distributed database  Use your relational schema from problem 3 as one of them  Specify two other schemas that concern the same general application domain, but do NOT use any of your XML language terms.  Instead, they use their own terms, but make some of them similar to your XML language terms and make others quite different.  For your 2 new schemas, specify their keys, FK’s, FDs and MVDs and deliberately make them conflict with the original schema and with each other

Distributed DBs, continued  Build a set of primitives that you might use to translate these conflicting schemas into a single schema.  For example, you will specify a primitive that can be used to to pick an FD from one schema over an FD from another schema (given that the two FDs conflict).  Be creative and think about other conflicts that must be settled.  IMPORTANT: Your primitives are generic and not based on the terms in the 3 schemas. You are in essence laying out some primitives that can be used to merge two or more schemas  Use your new primitives to turn your 3 schemas into 1. In other words, write a problem with your primitives that merges the 3 schemas into 1.

6. Ambient Intelligence  Describe an instance of ambient intelligence that would be useful to some sizeable and diverse group of people who are in a public place like a train station or a bank.  In what public place is this system located?  Make sure that computer-based device is end-user friendly. Describe what it does. You can draw one or more picture of the interface, if you want.  Choose an application where people are aware that they are accessing a computer; i.e., don’t use a hidden imbedded computer, like one controlling fuel usage in a car.  How is your device able to instantly adapt to these diverse people?  Would it require special hardware, like gaming hardware? If so, describe it. It’s perfectly fine, and maybe a good idea if you use specialized hardware.

Ambient Intelligence, continued  Make sure that it serves hands-on users, but is also web-connected.  How would the connectivity to the web enhance your systems value to people?  How might this connectivity help whoever it is that has placed this ambient intelligence device in this public place?  In other words, the web connectivity could be in the best interest of the users and/or the people who own the device.  Pick a database system (among the ones we have talked about in class) that will store the data that is collected from this device and uploaded into the web server.  Why is this the best database to use? Be specific.  It’s okay to suggest that more than one database system should be used, but you have to defend both and explain why.

7. Database Systems Comparison  Create a chart that includes  Neo4J  Mongo  MySQL  Cassandra  Hbase  In your chart, the vertical axis will have the databases on them and the horizontal axis will have (at least 4) properties that specifically focus on the way these database systems differ in their capabilities (not the kinds of apps they are meant for, but in their actual functionalities).