1 i247: Information Visualization and Presentation Marti Hearst Multidimensional Graphing.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Area: Data visualization & Interface design
Relational Algebra, Join and QBE Yong Choi School of Business CSUB, Bakersfield.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
POLARIS Area: Data visualization & Interface design A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases By Chris Stolte.
Concepts of Database Management Seventh Edition
Concepts of Database Management Sixth Edition
Concepts of Database Management Seventh Edition
Excel Charts – Basic Skills Creating Charts in Excel.
Visual Analytics Research at WPI Dr. Matthew Ward and Dr. Elke Rundensteiner Computer Science Department.
Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Presented by Darren Gates for ICS 280.
The University of Akron Dept of Business Technology Computer Information Systems The Relational Model: Query-By-Example (QBE) 2440: 180 Database Concepts.
Classifier Decision Tree A decision tree classifies data by predicting the label for each record. The first element of the tree is the root node, representing.
Polaris Query, Analysis, and Visualization of Large Hierarchical Relational Databases Pat Hanrahan With Chris Stolte and Diane Tang Computer Science Department.
Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Chris Stolte and Pat Hanrahan Computer Science Department.
Multiscale Visualization Using Data Cubes Chris Stolte, Diane Tang, Pat Hanrahan Stanford University Information Visualization October 2002 Boston, MA.
Table Lens From papers 1 and 2 By Tichomir Tenev, Ramana Rao, and Stuart K. Card.
Evaluation of MineSet 3.0 By Rajesh Rathinasabapathi S Peer Mohamed Raja Guided By Dr. Li Yang.
Query, Analysis, and Visualization of Hierarchically Structured Data using Polaris Chris Stolte, Diane Tang, Pat Hanrahan July 2002.
Table Lens Introduction to the Table Lens concept Table Lens Implementation Projected Usage Scenarios Usage Comparison with Splus Critical Analysis.
Attribute databases. GIS Definition Diagram Output Query Results.
Build a database IV: Create queries for a new Access database Overview: Ask your data — create queries It’s time to create queries, one of the most powerful.
Quantifying Data.
INFORMATION VISUALIZATION / EXPLORATORY DATA ANALYSIS Prof. Marti A. Hearst, UC Berkeley Bay Area PHUSE Single Day Event, 9/26/2014.
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
Distributed Data Analysis & Dissemination System (D-DADS) Prepared by Stefan Falke Rudolf Husar Bret Schichtel June 2000.
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals Presenter : Parminder Jeet Kaur Discussion Lead : Kailang.
CS 235: User Interface Design November 12 Class Meeting Department of Computer Science San Jose State University Fall 2014 Instructor: Ron Mak
PY550 Research and Statistics Dr. Mary Alberici Central Methodist University.
Concepts of Database Management, Fifth Edition
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 9: Quantitative.
IST722 Data Warehousing Business Intelligence Development with SQL Server Analysis Services and Excel 2013 Michael A. Fudge, Jr.
AGB 260: Agribusiness Information Technology Sort, Filters, Pivot Tables, and Pivot Charts.
Polaris A System for Query, Analysis and Visualization of Multidimensional Relational Databases Ugur YENIER.
Construction Change Order analysis CPSC 533C Analysis Project Presented by Chiu, Chao-Ying Department of Civil Engineering University of British Columbia.
June 6, 2014 IAT Interaction ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS +
DBSQL 3-1 Copyright © Genetic Computer School 2009 Chapter 3 Relational Database Model.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Concepts of Database Management Seventh Edition
Polaris: A System for Query, Analysis, & Visualization of Relational Databases Chris Stolte May 29 th, 2002.
National Levee Database Interactive Reports Instructions NLD Point of Contact 1 US Army Corps of Engineers.
Introduction to Enterprise Guide Jennifer Schmidt Rhonda Ellis Cassandra Hall.
Topics for our first Seminar The readings are Chapters 1 and 2 of your textbook. Chapter 1 contains a lot of terminology with which you should be familiar.
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Unit 42 : Spreadsheet Modelling
VisDB: Database Exploration Using Multidimensional Visualization Maithili Narasimha 4/24/2001.
VizDB A tool to support Exploration of large databases By using Human Visual System To analyze mid-size to large data.
You can sort Access data so you can view records in the order you want to view them, and you can filter data so you only see the records you want to see.
Polaris: A System for Query, Analysis and Visualization of Multi- dimensional Relational Database by Chris Stolte & Pat Hanrahan presenter Andrew Trieu.
Today’s Goals Answer questions about homework and lecture 2 Understand what a query is Understand how to create simple queries using Microsoft Access 2007.
Data Abstraction and Time-Series Data CS 4390/5390 Data Visualization Shirley Moore, Instructor September 15,
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
1 2 Concepts of Database Management, 4 th Edition, Pratt & Adamski Chapter 2 The Relational Model 1: Introduction, QBE, and Relational Algebra.
DTC Quantitative Methods Summary of some SPSS commands Weeks 1 & 2, January 2012.
Data Mining: Exploring Data
PIVOT TABLES AND CHARTS Leena Razzaq CS1100 Computer Science and its Applications CS1100Pivot tables and charts1.
1 Visual Encoding Andrew Chan CPSC 533C January 20, 2003.
Exploring High-D Spaces with Multiform Matrices and Small Multiples Presented by Ray Chen and Sorelle Friedler Authors: MacEachren, A., Dai, X., Hardisty,
LAB 01: BAR AND LINE CHARTS February 3, 2015 SDS 136 Communicating with Data.
Concepts of Database Management, Fifth Edition Chapter 3: The Relational Model 2: SQL.
Relational Databases Today we will look at: Different ways of searching a database Creating queries Aggregate Queries More complex queries involving different.
Data Visualization with Tableau
EMPA Statistical Analysis
Data Mining – Intro.
DATA CUBE Advanced Databases 584.
CSc4730/6730 Scientific Visualization
Lesson 13 - Cleaning Data Lesson 14 - Creating Summary Tables
Topic 8 – Pivot tables and Charts Lesson 1 – Pivot tables
Slides based on those originally by : Parminder Jeet Kaur
Presentation transcript:

1 i247: Information Visualization and Presentation Marti Hearst Multidimensional Graphing

2 Today Discuss found visualizations Discuss Polaris paper Introducing the EDA assignment In-class practice with EDA

3 The Polaris Framework Goal: support interactive exploration of multi- dimensional relational databases –Nice overview of how to combine different standard visualizations into interactive systems. Data types: –Only ordinal and quantitative! Treats intervals as quantitative Assigns an order to all nominal fields (alphabetical) –Ordinal = dimensions = independent variable –Quantitative = measures = dependent variables Supports design principles: –Small simultaneous multiples for comparison –Data-dense display –Allows proper use of “retinal properties” (Bertin) –Cleveland’s idea regarding mapping independent and dependent variables

4 Polaris Paper Two nice examples of exploratory data analysis –Analysts form hypotheses –Create views to confirm or refute If refuted, follow leads to find new hypotheses –Look for different things Trends Anomalies

5 Specifying Table Configurations Operands are the database fields –each operand interpreted as a set {…} –quantitative and ordinal fields interpreted differently Three operators: –concatenation (+), cross product (X), nest (/)

6 Ordinal fields: interpret domain as a set that partitions table into rows and columns: Quarter = {(Qtr1),(Qtr2),(Qtr3),(Qtr4)}  Table Algebra: Operands  Quantitative fields: treat domain as single element set and encode spatially as axes: Profit = {(Profit[-410,650])} 

7 Concatenation (+) operator  Ordered union of set interpretations: Quarter + ProductType = {(Qtr1),(Qtr2),(Qtr3),(Qtr4)} + {(Coffee), (Espresso)} = {(Qtr1),(Qtr2),(Qtr3),(Qtr4),(Coffee),(Espresso)} Profit + Sales = {(Profit[-310,620]),(Sales[0,1000])}

8 Cross (x) operator Cross-product of set interpretations: Quarter x ProductType = ProductType x Profit = {(Qtr1,Coffee), (Qtr1, Tea), (Qtr2, Coffee), (Qtr2, Tea), (Qtr3, Coffee), (Qtr3, Tea), (Qtr4, Coffee), (Qtr4,Tea)}

9 Nest (/) operator  Quarter x Month would create entry twelve entries for each quarter. i.e., (Qtr1, December)  Quarter / Month would only create three entries per quarter based on tuples in database not semantics can be expensive to compute

10 Combining the Data Types Ordinal - Ordinal

11 Combining the Data Types Quant - Quant

12 Combining the Data Types Ordinal - Quantitative

13 Data Transformations Deriving Additional Fields: –Aggregation Sums Averages / Variances / Std. Deviations Min/Max LOTS of other functions (arctan …) –Counting of Ordinal Dimensions CNT(field) –Discrete Partitioning Binning (fixed-sized groups, for creating histograms) Partitioning (ad hoc sizes, good for encoding data) –Ad hoc Grouping The ordinal version of partitioning Choose meaningful groups

14 Data Transformations (cont) Sorting and Filtering –Filtering allows for choosing which values to focus on –Sorting helps find trends and anomolies Brushing and Tooltips –Brushing allows for selecting and highlighting interesting points; can then create a new dataset with them. –Tableau/Polaris is missing linking, which usually goes with brushing (it’s high on the to-do list). Linking allows you to see which items that are brushed in one view are highlighted in another Undo and Redo –A key interface capability which is well-supported here.

15 Querying the Database

16 Assignment Exploratory Data Analysis –Choose a dataset –Formulate hypotheses –Test these hypotheses and also explore the dataset using visualization tool(s) Tableau and optionally others of your choosing –We’ll supply some datasets or you can use your own –You can work in pairs but not in larger groups Due Monday February 25 (2.5 weeks)

17 EDA Practice Data from UCB Career Center What jobs do graduates get, grouped by major area?