OLAP tool for comparing time- based data 14 th May 2008 Proposed By Pimolmas Ponchaisakuldee 104915 Advisor Dr. Paul Janecek.

Slides:



Advertisements
Similar presentations
Introduction to Databases
Advertisements

Chapter 18: Data Analysis and Mining Kat Powell. Chapter 18: Data Analysis and Mining ➔ Decision Support Systems ➔ Data Analysis and OLAP ➔ Data Warehousing.
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Data analysis and interpretation. Agenda Part 2 comments – Average score: 87 Part 3: due in 2 weeks Data analysis.
Learning about software Interfaces.  In this lab, you will examine  Excel Spreadsheet Interface  Access Database Interface  You will also learn about.
Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Presented by Darren Gates for ICS 280.
The art and science of measuring people l Reliability l Validity l Operationalizing.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Tirgul 9 Amortized analysis Graph representation.
DEPARTMENT OF COMPUTER SCIENCE SOFTWARE ENGINEERING, GRAPHICS, AND VISUALIZATION RESEARCH GROUP 15th International Conference on Information Visualisation.
Table Lens From papers 1 and 2 By Tichomir Tenev, Ramana Rao, and Stuart K. Card.
Chapter 2 Web Site Design Principles Principles of Web Design, Third Edition.
2D or 3D ? Presented by Xu Liu, Ming Luo. Is 3D always better than 2D? NO!
Table Lens Introduction to the Table Lens concept Table Lens Implementation Projected Usage Scenarios Usage Comparison with Splus Critical Analysis.
Mgt 20600: IT Management & Applications Databases Tuesday April 4, 2006.
CSCI 1101 Intro to Computers 7.3 Learning HTML. HTML Coding - Frame sets 2 What are Frames? Frames allow independent navigation and content to two (or.
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
1 What do People Recall about their Documents? Implications for Desktop Search Tools Tristan Blanc-Brude and Dominique L. Scapin INRIA ACM IUI 2007 (22%)
The Scientific Method.
Tips for Writing Good Scientific Lab Reports!. What makes a good scientific question?  It cannot be answered with “yes” or “no”.  It doesn’t use the.
Classroom User Training June 29, 2005 Presented by:
Chapter 2 Web Site Design Principles
South Africa Data Warehouse for PEPFAR Presented by: Michael Ogawa Khulisa Management Services
User Interface Elements of User Interface Group View.
Web Site Design Principles
Copyright © 2005, SAS Institute Inc. All rights reserved. 1.
OnLine Analytical Processing (OLAP)
-1- Philipp Heim, Thomas Ertl, Jürgen Ziegler Facet Graphs: Complex Semantic Querying Made Easy Philipp Heim 1, Thomas Ertl 1 and Jürgen Ziegler 2 1 Visualization.
Querying Structured Text in an XML Database By Xuemei Luo.
Introduction to Programming the WWW I CMSC Winter 2003 Lecture 6.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
BUSINESS ANALYTICS AND DATA VISUALIZATION
Financial OLAP Tool. “Y.A.E. Financial Consulting Ltd.” Established in 1989 the company has been engaged in Finance, Economics, I.T. and Accounting Consulting.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Organizing Data and Information
Data Structure Visualization Evaluation Aejaaz Kamal, Fanye Cui, Gowri Santhanam, Priya Shivakumar.
Chapter 1: Exploring Data
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
Scientific Method A blueprint for experiment success.
April 26, 2017 Data & Data Analysis 8th Grade Science.
How to train your users to create their own BI reports? So that you (IT) don’t have to Paras Doshi Consultant, SolidQ Chapter co-leader, PASS Business.
TO Each His Own: Personalized Content Selection Based on Text Comprehensibility Date: 2013/01/24 Author: Chenhao Tan, Evgeniy Gabrilovich, Bo Pang Source:
1 Copyright © 2008, Oracle. All rights reserved. I Course Introduction.
Graphing Notes Graphs. Set-up your notes: Graph: a visual display of information or data. Line Graphs:Bar Graphs: Circle Graphs:Variables: Independent.
Copyright © 2006, Infinite Campus, Inc. All rights reserved. Data Analysis: How to Build a Cube.
ACCESS CHAPTER 2 Introduction to ACCESS Learning Objectives: Understand ACCESS icons. Use ACCESS objects, including tables, queries, forms, and reports.
1 Introduction to Database Systems, CS420 SQL Views and Indexes.
FINANCIAL VISUALIZATION SUPPORT FOR BREAKDOWN ANALYSIS A method to visualize multiple foci polyarchies by successive breaking down of visualization based.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Managing Data Resources File Organization and databases for business information systems.
Control of Dynamic Discrete-Event Systems Lenko Grigorov Master’s Thesis, QU supervisor: Dr. Karen Rudie.
THE COMPELLING NEED FOR DATA WAREHOUSING
Expense Trends Analysis A New way of Expense Reporting
Chapter 13 The Data Warehouse
Between-Subjects, within-subjects, and factorial Experimental Designs
Analyzing and Visualizing Data
Fundamentals of Information Systems
Access Maintaining and Querying a Database
Advantages and disadvantages of types of graphs
MANAGING DATA RESOURCES
File Systems and Databases
Data Model.
Introduction of Week 9 Return assignment 5-2
Share What You Have Learned EP Session
Looking At Data.
Charts A chart is a graphic or visual representation of data
Usability and Accessibility Implications of Various Date Picker Methods Andrew Downie.
Presentation transcript:

OLAP tool for comparing time- based data 14 th May 2008 Proposed By Pimolmas Ponchaisakuldee Advisor Dr. Paul Janecek

Contents Introduction 1 Implementation 2 Experimental Evaluation 3 Conclusion 4

What is Data warehouse & OLAP  Data warehouse  a very large database with special characteristic  Data for analyzing  OLAP tool (On-Line Analytical Processing)  A tool on top of data warehouse  For data exploring and analyzing 3

OLAP tool Problems  Cube navigation  Data presentation 4

OLAP tool Problems  Cube navigation  Problems of window-explorer-like tree browser Lose in tree topology Need several clicks  Requirements Overall topology Multiple focuses and topology at the same time  Solutions DOITree browser (better topology representation, multiple focuses, and go more than one level of hierarchy per click) DOITree VS Window-Explorer-like browser 5

OLAP tool Problems  Data presentation  Tasks Trend finding over time Trend comparison between 2 period of time  Problems Pivot table: hard to do the analysis task Existing graph: complex, hard to focus Polaris table –does not support hierarchy exploring of data –Problem of composition for comparison task  Solution Focus on demand graph = graph matrix + overlaid graph 6

Objectives  To build OLAP tool  To evaluate effectiveness of DOITree visualization for OLAP-specific navigation compare to Window-Explorer-like visualization  To evaluate visual tools for time-based comparison task Graph matrix Overlaid graph 7 Focus on demand graph

Contents Introduction 1 Implementation 2 Experimental Evaluation 3 Conclusion 4

Implementation  Java to modified JRubik 9 Cube Navigator Pivot table presentation Graph matrix presentation Overlaid Graph

Design when measure is on column 10

Design when measure is on row 11

Contents Introduction 1 Implementation 2 Experimental Evaluation 3 Conclusion 4

Evaluation 1: (WD) VS (DOI) Window-Explorer Like browser (WD) of JRubik DOI tree browser (DOI) of the modified JRubik 13

14 Evaluation 2: (GE) VS (FOCUS) Overlaid Graph General graph (GE) Focus on demand graph (FOCUS) Graph Matrix + VS

Experiment design  Participants will be trained to use the 2 tools  The modified JRubik = (DOI) and (FOCUS)  JRubik = (WD) and (GE)  Experiment design with counterbalancing order 15 ParticipantsFirst ToolSecond ToolData Student01 to Student10 The modified JRubikJRubikFoodmart Student11 to Student20 JRubikThe modified JRubikFoodmart Staff1 and 2The modified JRubikJRubikAIT admission Staff3 and 4JRubikThe modified JRubikAIT admission

First Evaluation: (WD) VS (DOI)  Tasks for browser comparison  Only browsers (one attribute)  Appendix C page 83 Task1 - First Node Finding Task2 - First time Node Finding Task3 - Subtree Revisiting Task4 - Node Revisiting  The browsers as query tool (many attributes)  Appendix D page 84 Task1 - First time Node Finding Task2 - Subtree Revisiting Task3 - Node Revisiting 16 WD DOI WD DOI

Second Evaluation: (GE) VS (FOCUS)  Tasks for graph comparison  Simple Analysis Trend finding, Trend comparing  Complex Analysis Trend comparing 17

Second Experiment: (G) VS (F)  Simple Analysis  Complex Analysis 18

Variables studied using Foodmart 19 Browsers comparisonGraphs comparison (WD) VS (DOI)(GE) VS (FOCUS) Independent variables Task types Cube Structure Browser Type Screen area Participant demographics Task types Presentation type Screen area Participant demographics Dependent variables Speed to complete tasks Number of clicked nodes Satisfaction score Speed to complete tasks Number of correct answer Satisfaction score

Variables studied using AIT ADM 20 Browsers comparisonGraphs comparison (WD) VS (DOI)(GE) VS (FOCUS) Independent variables Task types Cube Structure Browser Type Screen area Participant demographics Task types Presentation type Screen area Participant demographics Dependent variables Speed to complete tasks Number of clicked nodes Satisfaction score Speed to complete tasks Number of correct answer Satisfaction score

Experiment Result  Browser comparison result  Browsers only: Pair-Sample T-test  Browser as a query tool: Pair-Sample T-test  Participants opinions: One way ANOVA  Graph comparison result  Simple and Complex Analysis Pair-Sample T-test One way ANOVA  Participants opinions One way ANOVA 21

Experiment Result  Browser comparison result: browsers only  Subtree revisiting – users can leave subtrees open 22 First time node finding First node finding Subtree revisiting Node revisiting First time node finding First node finding Subtree revisiting Node revisiting Sig. DOI Sig. WD WD DOI Sig. WD Sig. DOI DOI WD

Experiment Result  Browser comparison result: browsers only  Finding for DOITree Advantage –More information help users to recover from mistake Disadvantage –More information can distract users to go wrong path 23

Experiment Result  Browser comparison result: browser as a query tool 24 First node finding Subtree revisiting Node revisiting WD DOI Sig. DOI Sig. WD WD DOI

Experiment Result  Browser comparison result: browser as a query tool  (DOI) always take less clicks  Go more than one level per click using (DOI)  Users shrink subtree before finding new node using (WE) 25 First node finding Subtree revisiting Node revisiting 25 WD DOI Sig. DOI Sig. WD WD DOI

Experiment Result  Browser comparison result: Opinions  Likeness: auto shrink, go more than one level  Dislikeness: dizzy (many subtrees open), cannot leave subtree open 26

Experiment Result  Graph comparison result 27 FOCUS GE Sig. FOCUS Sig. GE GE FOCUS

Experiment Result  Graph comparison result : Opinions  Likeness: focus  Dislikeness: graph and label are too small 28

Experiment Result  ANOVA: 3 groups of student participants  Satisfaction score on tree browsers: no sig. was found  Satisfaction score on graphs: no sig. was found  Simple Analysis: Next slide  Complex Analysis: no sig. was found 29

Experiment Result  ANOVA: 3 groups of student participants  Simple analysis task 30 G2: OLAP using G1: OLAP using & ex G3: Non OLAP ex G1 Sig.: OLAP using & ex Sig. G3 Sig. Non OLAP ex Sig.

Experiment Result  ANOVA: 2 groups of student participants 31 G2: JRubik G1: Mod. JRubik G2: JRubik G1: Mod. JRubik

Discussion  Personal Evaluation 32 TasksSubtasksJPivotFreeAnaly sis JRubikModified JRubik T4: Analyzing visualization presentation T4.1: How easy to read text of each graph+++++ T4.2: How easy to read exact value in graph T4.3: How easy to find trend of data T4.4: How easy to compare trend of data

Contents Introduction 1 Implementation 2 Experimental Evaluation 3 Conclusion 4

 Problems  Cube navigation, Data Presentation  Objectives  to evaluate effectiveness of (DOI) compare to (WD)  to evaluate the focus on demand graph for time-based data analysis tasks  Browser comparison result: 34 TasksThe betterSig.Note Browsers only First time node finding(DOI)  - First node finding=  - Subtree revisiting(WE) Leave subtree open Node revisiting(DOI)  Some users shrink subtree Browsers as query tool First node finding(DOI) At middle depth (very deep has animation) Subtree revisiting(DOI) When subtree expanded Node revisiting(DOI) When subtree expanded WD DOI

Conclusion  Graph comparison result:  Implications  Browser comparison: Browser only: (DOI) is better for long term usage, (WE) is good for revisiting tasks Query tool: DOITree is better for unknown data and suitable for the analyzers who are always analyze unknown data  Graph comparison: separated graph in graph matrix can eliminate complexity Users can focus what they want line graph is a suitable presentation when time is on X axis 35 CriteriaThe betterSig.Note Pair-sample T-Test Simple Analysis Tasks(FOCUS) - Complex Analysis Tasks(FOCUS) - FOCUS GE