Polaris: A System for Query, Analysis, & Visualization of Relational Databases Chris Stolte May 29 th, 2002.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Area: Data visualization & Interface design
C6 Databases.
Automating the Design of Graphical Presentations of Relational Information By Jock Mackinlay (Stanford) Presented by: Jeremy Manson and Bill Shapiro.
POLARIS Area: Data visualization & Interface design A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases By Chris Stolte.
Introduction to Databases
Automating the Design of Graphical Presentations of Relational Information Jock MacKinlay Beth Weinstein March 14, 2001.
Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Presented by Darren Gates for ICS 280.
Managing Data Resources
1 i247: Information Visualization and Presentation Marti Hearst Multidimensional Graphing.
Using ADO.NET Chapter Microsoft Visual Basic.NET: Reloaded 1.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Polaris Query, Analysis, and Visualization of Large Hierarchical Relational Databases Pat Hanrahan With Chris Stolte and Diane Tang Computer Science Department.
Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Chris Stolte and Pat Hanrahan Computer Science Department.
Multiscale Visualization Using Data Cubes Chris Stolte, Diane Tang, Pat Hanrahan Stanford University Information Visualization October 2002 Boston, MA.
Implementing Business Analytics with MDX Chris Webb London September 29th.
Table Lens From papers 1 and 2 By Tichomir Tenev, Ramana Rao, and Stuart K. Card.
Query, Analysis, and Visualization of Hierarchically Structured Data using Polaris Chris Stolte, Diane Tang, Pat Hanrahan July 2002.
Table Lens Introduction to the Table Lens concept Table Lens Implementation Projected Usage Scenarios Usage Comparison with Splus Critical Analysis.
Copyright 2003 The McGraw-Hill Companies, Inc CHAPTER Application Software computing ESSENTIALS    
Automatic Data Ramon Lawrence University of Manitoba
Attribute databases. GIS Definition Diagram Output Query Results.
Data Mining – Intro.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
 A data processing system is a combination of machines and people that for a set of inputs produces a defined set of outputs. The inputs and outputs.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals Presenter : Parminder Jeet Kaur Discussion Lead : Kailang.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
XML, distributed databases, and OLAP/warehousing The semantic web and a lot more.
CIDR, Tuesday The Schema-Independent Database UI Eirik Bakke and Edward Benson CIDR 2011 (a proposed holy grail and some suggestions)
Coding for Excel Analysis Optional Exercise Map Your Hazards! Module, Unit 2 Map Your Hazards! Combining Natural Hazards with Societal Issues.
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
Software – Applications software and programming languages
IST722 Data Warehousing Business Intelligence Development with SQL Server Analysis Services and Excel 2013 Michael A. Fudge, Jr.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Polaris A System for Query, Analysis and Visualization of Multidimensional Relational Databases Ugur YENIER.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
SRI International Bioinformatics 1 The Structured Advanced Query Page Tomer Altman & Mario Latendresse Bioinformatics Research Group SRI, International.
Software – Applications software and programming languages.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Some OLAP Issues CMPT 455/826 - Week 9, Day 2 Jan-Apr 2009 – w9d21.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
IST 210 The Relational Language Todd S. Bacastow January 2004.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
Polaris: A System for Query, Analysis and Visualization of Multi- dimensional Relational Database by Chris Stolte & Pat Hanrahan presenter Andrew Trieu.
CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization 2 1.
Visualization Four groups Design pattern for information visualization
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
1 Budapest University of Technology and Economics Department of Measurement and Information Systems Budapest University of Technology and Economics Fault.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
Defects of UML Yang Yichuan. For the Presentation Something you know Instead of lots of new stuff. Cases Instead of Concepts. Methodology instead of the.
Intro to MIS – MGS351 Databases and Data Warehouses
Data Mining – Intro.
MIS2502: Data Analytics Advanced Analytics - Introduction
Potter’s Wheel: An Interactive Data Cleaning System
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Databases and Data Warehouses Chapter 3
MANAGING DATA RESOURCES
Enhance BI Applications and Simplify Development
CSc4730/6730 Scientific Visualization
Data Warehousing and Data Mining
Slides based on those originally by : Parminder Jeet Kaur
Presentation transcript:

Polaris: A System for Query, Analysis, & Visualization of Relational Databases Chris Stolte May 29 th, 2002

Motivation  Large multi-dimensional databases have become very common corporate data warehouses Amazon, Walmart,… scientific projects: Human Genome Project Sloan Digital Sky Survey  Need effective tools for exploration and analysis of these databases

Existing Tools: Charts typically provide a “gallery” of charts hard to iteratively explore simple charts can display few dimensions

Existing Tools: Pivot Tables common interface to data warehouses simple interface based on drag-and-drop generate text tables from databases:

Polaris: Extending Pivot Tables generate rich table-based graphical displays rather than tables of text single conceptual model for both graphs and tables preserve ability to rapidly construct displays

Polaris Design Goals Two main design goals:  Interactive analysis and exploration versus static visualization  Simple, consistent interface

Design Goal: Analysis & Exploration  Want to extract meaning from data  Process of hypothesis, experiment, and discovery  Path of exploration is unpredictable

UI Requirements for Exploration  Data dense displays: display both many tuples & many dimensions  Multiple display types: different displays suited to different tasks  Exploratory interfaces: rapidly change data transformations and views

Design Goal: Simple, Consistent UI  Excel Pivot tables provide a simple interface for building text-based tables  Graphs require multiple steps: different interfaces and conceptual models  Want to unify tables, graphs, and database queries in one interface

Polaris Demo

Display Types Gantt charts of events for a parallel graphics application on a 32-processor SGI machine. Flights between major airports in the USA Source code colored by cache misses for a parallel graphics application. Major wars and the births of well known scientists as a timeline.

Polaris Formalism  UI interpreted as visual specification (in XML) that defines: table configuration type of graphic in each pane encoding of data as visual properties of marks data transformations  Specification then compiled into queries & drawing commands to generate visualization

Design Decision: Use a Formalism  Why a formalism? unification: unify tables and graphs expressiveness: build visualizations designers did not think of interface simplicity: clearly defined semantics and operations code simplicity: composable language versus monolithic objects declarative: can state what, not how— allows for optimization, etc.

Example specification table configuration }

Specifying Table Configurations  Interface: define table configuration by dropping fields on shelves  Formalism: shelf content interpreted as expressions in table algebra  Can express extremely wide range of table configurations

Specifying Table Configurations  Operands are the database fields each operand interpreted as a set {…} quantitative and ordinal fields interpreted differently  Four operators: concatenation (+), cross (X), nest (/), dot (.)

 Ordinal fields: interpret domain as a set that partitions table into rows and columns: Quarter = {(Qtr1),(Qtr2),(Qtr3),(Qtr4)}  Table Algebra: Operands  Quantitative fields: treat domain as single element set and encode spatially as axes: Profit = {(Profit[-410,650])} 

Concatenation (+) operator  Ordered union of set interpretations: Quarter + ProductType = {(Qtr1),(Qtr2),(Qtr3),(Qtr4)} + {(Coffee), (Espresso)} = {(Qtr1),(Qtr2),(Qtr3),(Qtr4),(Coffee),(Espresso)} Profit + Sales = {(Profit[-310,620]),(Sales[0,1000])}

Cross (x) operator Cross-product of set interpretations: Quarter x ProductType = ProductType x Profit = {(Qtr1,Coffee), (Qtr1, Tea), (Qtr2, Coffee), (Qtr2, Tea), (Qtr3, Coffee), (Qtr3, Tea), (Qtr4, Coffee), (Qtr4,Tea)}

Nest (/) operator  Quarter x Month would create entry twelve entries for each quarter. i.e., (Qtr1, December)  Quarter / Month would only create three entries per quarter based on tuples in database not semantics can be expensive to compute

Dot (.) operator: Hierarchies  Many data warehouses have hierarchical dimensions: Time: Year, Month, Day Location: Country, State, Region  Dot (.) works like Nest (/) except it exploits the defined hierarchies based on semantics not tuples in database  Demo

Formalism  Can mix graph types in single visualization:

Polaris Formalism  Remainder of formalism defined in papers*: specification of different graph types encoding of data as retinal properties of marks in graphs data transformations translation of visual specification into SQL queries * Relevant papers: Query, Analysis, and Visualization of Hierarchically Structured Data using Polaris Chris Stolte, Diane Tang and Pat Hanrahan Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, July Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases (extended paper) Chris Stolte, Diane Tang and Pat Hanrahan IEEE Transactions on Visualization and Computer Graphics, Vol. 8, No. 1, January 2002.

Generating Queries  Database queries automatically generated from specification.  Multiple queries required if level-of-detail varies.  Algebraic manipulation can be used to determine minimal set of queries.  Current interpreters can generate SQL, MDX, or Rivet queries.

Related Visualization Projects  Formalisms for Graphics Wilkinson’s Grammar of Graphics Bertin’s Semiology of Graphics Mackinlay’s APT  Visual Exploration of Databases DeVise, Visage, DataSplash/Tioga-2  Table-based Visualizations Table Lens, Spreadsheet for Visualization

Summary  Exploratory visualization versus presentation  Multiple display types—different questions require different visualizations  Polaris: a novel interface for rapidly constructing table-based graphical displays from multi-dimensional relational databases  Formalisms: powerful declarative tool for specifying complex graphics and tables