Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Chris Stolte and Pat Hanrahan Computer Science Department.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Area: Data visualization & Interface design
TU e technische universiteit eindhoven / department of mathematics and computer science Modeling User Input and Hypermedia Dynamics in Hera Databases and.
Lesson 14 Creating Formulas and Charting Data
Automating the Design of Graphical Presentations of Relational Information By Jock Mackinlay (Stanford) Presented by: Jeremy Manson and Bill Shapiro.
POLARIS Area: Data visualization & Interface design A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases By Chris Stolte.
Foundations of Relational Implementation n Defining Relational Data n Relational Data Manipulation n Relational Algebra.
Automating the Design of Graphical Presentations of Relational Information Jock MacKinlay Beth Weinstein March 14, 2001.
Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Presented by Darren Gates for ICS 280.
Lecture Microsoft Access and Relational Database Basics.
1 i247: Information Visualization and Presentation Marti Hearst Multidimensional Graphing.
Polaris Query, Analysis, and Visualization of Large Hierarchical Relational Databases Pat Hanrahan With Chris Stolte and Diane Tang Computer Science Department.
Multiscale Visualization Using Data Cubes Chris Stolte, Diane Tang, Pat Hanrahan Stanford University Information Visualization October 2002 Boston, MA.
Infovis and data george, laura, tjerk.
Matthias Mayer The Table Lens - Ramana Rao & Stuart K. Card Information Visualization 838b - February 21st 2001 The Table Lens: Merging.
Table Lens From papers 1 and 2 By Tichomir Tenev, Ramana Rao, and Stuart K. Card.
Query, Analysis, and Visualization of Hierarchically Structured Data using Polaris Chris Stolte, Diane Tang, Pat Hanrahan July 2002.
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
Table Lens Introduction to the Table Lens concept Table Lens Implementation Projected Usage Scenarios Usage Comparison with Splus Critical Analysis.
Data Mining – Intro.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
RIZWAN REHMAN, CCS, DU. Advantages of ORDBMSs  The main advantages of extending the relational data model come from reuse and sharing.  Reuse comes.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals Presenter : Parminder Jeet Kaur Discussion Lead : Kailang.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Biostatistics, statistical software II. A brief survey of statistical program systems Krisztina Boda PhD Department of Medical Informatics, University.
CIDR, Tuesday The Schema-Independent Database UI Eirik Bakke and Edward Benson CIDR 2011 (a proposed holy grail and some suggestions)
Polaris A System for Query, Analysis and Visualization of Multidimensional Relational Databases Ugur YENIER.
ACOT Intro/Copyright Succeeding in Business with Microsoft Excel
1 Lesson 19 Creating Formulas and Charting Data Computer Literacy BASICS: A Comprehensive Guide to IC 3, 3 rd Edition Morrison / Wells.
TEA Science Workshop #3 October 1, 2012 Kim Lott Utah State University.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
Relational Databases and Statistical Processing Andrew Westlake Survey & Statistical Computing
SRI International Bioinformatics 1 The Structured Advanced Query Page Tomer Altman & Mario Latendresse Bioinformatics Research Group SRI, International.
SRI International Bioinformatics 1 The Structured Advanced Query Page Tomer Altman & Mario Latendresse Bioinformatics Research Group SRI, International.
© 2008 by Andrew Webb, Interface Ecology Lab. meta-metadata: an extensible semantic architecture for multimedia metadata definition, extraction, and presentation.
Data Warehousing.
ACCESS CHAPTER 5 FORMS AND REPORTS Learning Objectives: Build a simple form Add a label, text box, and list controls to a form Create a multi-table form.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Polaris: A System for Query, Analysis, & Visualization of Relational Databases Chris Stolte May 29 th, 2002.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Some OLAP Issues CMPT 455/826 - Week 9, Day 2 Jan-Apr 2009 – w9d21.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
Unit 42 : Spreadsheet Modelling
SRI International Bioinformatics 1 The Structured Advanced Query Page Tomer Altman Bioinformatics Research Group SRI, International February 1, 2008.
IST 210 The Relational Language Todd S. Bacastow January 2004.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
Polaris: A System for Query, Analysis and Visualization of Multi- dimensional Relational Database by Chris Stolte & Pat Hanrahan presenter Andrew Trieu.
Visualization Four groups Design pattern for information visualization
1 2 Concepts of Database Management, 4 th Edition, Pratt & Adamski Chapter 2 The Relational Model 1: Introduction, QBE, and Relational Algebra.
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. DATABASE.
SRI International Bioinformatics 1 The Structured Advanced Query Page Tomer Altman Mario Latendresse Bioinformatics Research Group SRI International April.
Helpful hints for planning your Wednesday investigation.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
Programming Languages Concepts Chapter 1: Programming Languages Concepts Lecture # 4.
Defects of UML Yang Yichuan. For the Presentation Something you know Instead of lots of new stuff. Cases Instead of Concepts. Methodology instead of the.
Data Mining – Intro.
Potter’s Wheel: An Interactive Data Cleaning System
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Warehouse.
DATA CUBE Advanced Databases 584.
Enhance BI Applications and Simplify Development
CSc4730/6730 Scientific Visualization
Data Warehousing and Data Mining
CSCI N207 Data Analysis Using Spreadsheet
Slides based on those originally by : Parminder Jeet Kaur
Presentation transcript:

Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Chris Stolte and Pat Hanrahan Computer Science Department Stanford University

Motivation Large multi-dimensional databases have become very common – corporate data warehouses Amazon, Walmart,… – scientific projects: Human Genome Project Sloan Digital Sky Survey Need tools for exploration and analysis of these databases

The Pivot Table Interface – common interface to data warehouses – simple interface based on drag-and-drop – generate text tables from databases:

Polaris: Extending the Pivot Table Interface – generate rich table-based graphical displays rather than tables of text – single conceptual model for both graphs and tables – preserve ability to rapidly construct displays

Polaris Design Goals Interactive analysis and exploration versus static visualization Simple, consistent interface

Design Goal: Analysis & Exploration Want to extract meaning from data Process of hypothesis, experiment, and discovery Path of exploration is unpredictable

Requirements on UI for Analysis and Exploration – Data dense displays: display both many tuples & many dimensions – Multiple display types: different displays suited to different tasks – Exploratory interfaces: rapidly change data transformations and views

Design Goal: Simple, Consistent Interface Excel Pivot tables provide a simple interface for building text-based tables Graphs require multiple steps: different interfaces and conceptual models Want to unify tables, graphs, and database queries in one interface

Polaris Demo

Design Decision: Use a Formalism Why a formalism? – unification: unify tables and graphs – expressiveness: build visualizations designers did not think of – interface simplicity: clearly defined semantics and operations – code simplicity: composable language versus monolithic objects

Polaris Formalism Interface interpreted as visual specification in formal language that defines: – table configuration – type of graphic in each pane – encoding of data as visual properties of marks Specification compiled into data & graphical transformations to generate display

Formalism Example: Specifying Table Configurations Interface: define table configuration by dropping fields on shelves Formalism: shelf content interpreted as expressions in table algebra Can express extremely wide range of table configurations

Formalism Example: Specifying Table Configurations Operands are the database fields – each operand interpreted as a set {…} – quantitative and ordinal fields interpreted differently Three operators: – concatenation (+), cross (X), nest (/)

Ordinal fields - interpret domain as a set that partitions table into rows and columns: QUARTER = {Quarter1,Quarter2,Quarter3,Quarter4}  Table Algebra: Operands Quarter 1Quarter 2Quarter 3Quarter 4 31,40037,12035,60030,900 Quantitative fields – treat domain as single element set and encode spatially as axes: PROFIT = {P[0 - 65,000]} 

Table Algebra: Concatenation (+) Operator Ordered union of set interpretations: QUARTER + PRODUCT_TYPE = {QTR1,QTR2,QTR3,QTR4} + {Coffee, Tea} = {QTR1,QTR2,QTR3,QTR4, Coffee, Tea} Quarter 1Quarter 2Quarter 3Quarter 4 31,40037,12035,60030,900 CoffeeTea 37,12030,900 PROFIT + SALES = {P[0-65,000], S[0-125,000]}

Table Algebra: Cross (X) Operator Cross-product of set interpretations: QUARTER X PRODUCT_TYPE = PRODUCT_TYPE X PROFIT = Quarter 1Quarter 2Quarter 3Quarter 4 CoffeeTeaCoffeeTeaCoffeeTeaCoffeeTea {(Qtr1,Coffee), (Qtr1, Tea), (Qtr2, Coffee), (Qtr2, Tea), (Qtr3, Coffee), (Qtr3, Tea), (Qtr4, Coffee), (Qtr4,Tea)}

Table Algebra: Nest (/) Operator QUARTER X MONTH would create entry twelve entries for each quarter i.e. (Qtr1, December) QUARTER / MONTH would only create three entries per quarter

Formalism Remainder of formalism defined in paper: – specification of different graph types – encoding of data as retinal properties of marks in graphs – translation of visual specification into SQL queries

Related Work Formalisms for Graphics – Wilkinson’s Grammar of Graphics – Bertin’s Semiology of Graphics – Mackinlay’s APT Visual Queries – Trellis display, DeVise, Visage Table-based Visualizations – Table lens, Spreadsheet for Visualization

Wilkinson’s Grammar of Graphics Describes formalism for statistical graphics Different choices in the design of formalism: – non-relational data model – different operators in table algebra Further experience necessary to fairly evaluate differences between our formalisms

Conclusions Novel interface for rapidly constructing table-based graphical displays from multi- dimensional relational databases A formalism for specifying complex graphics and tables Interpretation of visual specifications as relational (SQL) queries and drawing operations.