1 Visualizing Multi-dimensional Clusters, Trends, and Outliers using Star Coordinates Author : Eser Kandogan Reporter : Tze Ho-Lin 2007/5/9 SIGKDD, 2001.

Slides:



Advertisements
Similar presentations
ENV Envisioning Information Lecture 2 Simple Graphs and Charts Ken Brodlie School of Computing University of Leeds.
Advertisements

A Clustering Framework for Unbalanced Partitioning and Outlier Filtering on High Dimensional Datasets 1 Turgay Tugay Bilgin and A.Yilmaz Camurcu 2 1 Department.
Ranking Outliers Using Symmetric Neighborhood Relationship Wen Jin, Anthony K.H. Tung, Jiawei Han, and Wei Wang Advances in Knowledge Discovery and Data.
SkewReduce YongChul Kwon Magdalena Balazinska, Bill Howe, Jerome Rolia* University of Washington, *HP Labs Skew-Resistant Parallel Processing of Feature-Extracting.
Intelligent Database Systems Lab Presenter: WU, JHEN-WEI Authors: Jorge Gorricha, Victor Lobo CG Improvements on the visualization of clusters in.
Graduate : Sheng-Hsuan Wang
High Dimensional Visualization By Mingyue Tan Mar10, 2004.
A General Model for Relational Clustering Bo Long and Zhongfei (Mark) Zhang Computer Science Dept./Watson School SUNY Binghamton Xiaoyun Wu Yahoo! Inc.
1 Visualizing the Legislature Howard University - Systems and Computer Science October 29, 2010 Mugizi Robert Rwebangira.
Basic Data Mining Techniques Chapter Decision Trees.
Data Mining.
Basic Data Mining Techniques
Marrying Spatialization and Traditional Cartography Geography 200B March 12, 2003 Presented by Kirk Goldsberry.
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
國立雲林科技大學 National Yunlin University of Science and Technology 11 Discovering Personal Gazetteers: An Interactive clustering Approach Changqing Zhou, Dan.
Intelligent Database Systems Lab Presenter: YU-TING LU Authors: Laurens van der Maaten and Geoffrey Hinton ML Visualizing non-metric similarities.
SIMULATION. Simulation Definition of Simulation Simulation Methodology Proposing a New Experiment Considerations When Using Computer Models Types of Simulations.
Data Mining – Intro.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Higher Course Assessment Question Paper Structure  Section 1 20 multiple choice questions 20 marks  Section 2 restricted and extended response questions.
Enterprise systems infrastructure and architecture DT211 4
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Sofia Carlander Kinoshita Laboratory 2004/2005
Data Mining Techniques
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Data Mining Chun-Hung Chou
© The Catholic University of America Dept of Biomedical Engineering ENGR 104: Lecture 2 Statistical Analysis Using Matlab Lecturers: Dr. Binh Tran.
Inductive learning Simplest form: learn a function from examples
Intelligent Database Systems Lab Presenter: MIN-CHIEH HSIU Authors: NHAT-QUANG DOAN ∗, HANANE AZZAG, MUSTAPHA LEBBAH 2013 NN Growing self-organizing trees.
Windows Programming Parallel Coordinates Adithya Addanki(aa207) Instructor: Dr. Yingcai Xiao.
Taxonomies of Visualization Techniques CMPT 455/826 - Week 12, Day 2 w12d2 Sept-Dec
-1- Philipp Heim, Thomas Ertl, Jürgen Ziegler Facet Graphs: Complex Semantic Querying Made Easy Philipp Heim 1, Thomas Ertl 1 and Jürgen Ziegler 2 1 Visualization.
1 Multidimensional Detective Alfred Inselberg, Multidimensional Graphs Ltd Tel Aviv University, Israel Presented by Yimeng Dou
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Visualizing Ontology Components through Self-Organizing.
Reducing the Response Time for Data Warehouse Queries Using Rough Set Theory By Mahmoud Mohamed Al-Bouraie Yasser Fouad Mahmoud Hassan Wesam Fathy Jasser.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Presenter : Keng-Wei Chang Author: Yehuda.
1 K-Means+ID3 A Novel Method for Supervised Anomaly Detection by Cascading K-Means Clustering and ID3 Decision Tree Learning Methods Author : Shekhar R.
McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. 1.
Intelligent Database Systems Lab Presenter : CHANG, SHIH-JIE Authors : Peter Sarlin* 2013.PRL Decomposing the global financial crisis: A Self-Organizing.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Manoranjan.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Utilizing Marginal Net Utility for Recommendation in E-commerce.
DDM Kirk. LSST-VAO discussion: Distributed Data Mining (DDM) Kirk Borne George Mason University March 24, 2011.
Image Thresholding Using Type II Fuzzy Sets Source : 2005, Pattern Recognition 38, Author : Hamid R. Tizhoosh Advisor: Chen R. -C. Ph. D( 陳榮昌教授.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A self-organizing map for adaptive processing of structured.
Intelligent Database Systems Lab Presenter : WU, MIN-CONG Authors : STEPHEN T. O’ROURKE, RAFAEL A. CALVO and Danielle S. McNamara 2011, EST Visualizing.
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
Intelligent Database Systems Lab Presenter: YU-TING LU Authors: Christopher C. Yang and Tobun Dorbin Ng TSMCA Analyzing and Visualizing Web Opinion.
LineUp: Visual Analysis of Multi-Attribute Rankings IEEE INFOVIS 2013 Samuel Gratzl, Johannes Kepler University Alexander Lex, Nils Gehlenborg, Hanspeter.
Visualization of Washing Powder Formulation ———seeking the best ingredients of washing powder.
Using category-Based Adherence to Cluster Market-Basket Data Author : Ching-Huang Yun, Kun-Ta Chuang, Ming-Syan Chen Graduate : Chien-Ming Hsiao.
Intelligent Database Systems Lab Presenter : Chang,Chun-Chih Authors : Emilio Corchado, Bruno Baruque 2012 NeurCom WeVoS-ViSOM: An ensemble summarization.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 3 Basic Data Mining Techniques Jason C. H. Chen, Ph.D. Professor of MIS School of Business.
Intelligent Database Systems Lab Presenter : YU-TING LU Authors : Hsin-Chang Yang, Han-Wei Hsiao, Chung-Hong Lee IPM Multilingual document mining.
Rethinking Choices for Multi-dimensional Point Indexing You Jung Kim and Jignesh M. Patel University of Michigan.
Sentence Similarity Based on Semantic Nets and Corpus Statistics
Twitter Community Discovery & Analysis Using Topologies Andrew McClain Karen Aguar.
Top 10 Unsolved Information Visualization Problems Chaomei Chen Drexel University.
國立雲林科技大學 National Yunlin University of Science and Technology Semantics-preserving dimensionality reduction rough and fuzzy rough based approaches Author.
The KDD Process for Extracting Useful Knowledge from Volumes of Data Fayyad, Piatetsky-Shapiro, and Smyth Ian Kim SWHIG Seminar.
Extension of Star Coordinates into Three Dimensions Nathan Cooprider University of Utah School of Computing Robert Burton Brigham Young.
Data Mining – Intro.
Chapter 13 The Data Warehouse
Higher Course Assessment.
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
cs5984: Information Visualization Chris North
Star Coordinates A Multi-dimensional Visualization Technique with Uniform Treatment of Dimensions.
MIS2502: Data Analytics Clustering and Segmentation
SYMPOSIUM EVALUATION IN HIGHER EDUCATION - AVALIES Title of the Paper
Presentation transcript:

1 Visualizing Multi-dimensional Clusters, Trends, and Outliers using Star Coordinates Author : Eser Kandogan Reporter : Tze Ho-Lin 2007/5/9 SIGKDD, 2001

2 Outline Motivation Objectives Methodology: Star Coordinates Interaction techniques Evaluation Conclusion Personal Comments

3 Motivation Real datasets contain typically more than three attributes of data, representing and making sense of multi-dimensional data has been challenging.

4 Objectives The objective for this paper is to relieve the dimensionality curse on knowledge discovery through simple data representations that are derived from familiar and easy to understand lower dimensional representations.

5 Methodology

6 j: 資料點 i: 屬性

7 Interaction techniques 1. Scaling Scaling 2. Rotation Rotation 3. Marking 4. Range Selection Range Selection 5. Histogram Histogram 6. Footprints Footprints 7. Sticks Sticks

8 Evaluation

9 Conclusion Star Coordinates, aims to let a representation of the higher dimensional space built on the well- known simple representations and also through dynamic interactions that allow users to discover trends, outliers, and clusters easily.

10 Personal Comments Application  Data visualization Advantage  Simple & Easy to understand Disadvantage  The figures in this paper is rough.

11 Scaling

12 Rotation

13 Range Selection

14 Histogram

15 Footprints

16 Sticks

17 Evaluation - Figure 12

18 Evaluation - Figure 13

19 Evaluation - Figure 14 Figure 14. Data point distribution after removing state, area code, phone number, and total minute and calls for day, evening, night, and international calls.

20 Evaluation - Figure 15 Figure 15. Data point partitioned into four clusters based on international service plan and voice plan membership.

21 Evaluation - Figure 16 Figure 16. Total day charge and number of customer service calls play the most significant role in churn for customers without international and voice mail service plans ‧.