Basics of a run of (Non-metric) MULTIDIMENSIONAL SCALING

Slides:



Advertisements
Similar presentations
CLUSTERING.
Advertisements

The transformation of an ER or EER model into a relational model
Time Demand Analysis.
Clustering: Introduction Adriano Joaquim de O Cruz ©2002 NCE/UFRJ
Chapter 5 The Simplex Method The most popular method for solving Linear Programming Problems We shall present it as an Algorithm.
CLUSTERING PROXIMITY MEASURES
Quick & Simple Introduction to Multidimensional Scaling Professor Tony Coxon Hon. Professorial Research Fellow, University of Edinburgh (
Multidimensional scaling Research Methods Fall 2010 Tamás Bőhm.
Chapter 17 Overview of Multivariate Analysis Methods
CSCE 641: Forward kinematics and inverse kinematics Jinxiang Chai.
1 Numerical geometry of non-rigid shapes Lecture II – Numerical Tools Numerical geometry of shapes Lecture II – Numerical Tools non-rigid Alex Bronstein.
1 NONLINEAR MAPPING: APPROACHES BASED ON OPTIMIZING AN INDEX OF CONTINUITY AND APPLYING CLASSICAL METRIC MDS TO REVISED DISTANCES By Ulas Akkucuk & J.
Distance Measures Tan et al. From Chapter 2.
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
Proximity matrices and scaling Purpose of scaling Similarities and dissimilarities Classical Euclidean scaling Non-Euclidean scaling Horseshoe effect Non-Metric.
Proximity matrices and scaling Purpose of scaling Similarities and dissimilarities Classical Euclidean scaling Non-Euclidean scaling Horseshoe effect Non-Metric.
1 Numerical geometry of non-rigid shapes Non-Euclidean Embedding Non-Euclidean Embedding Lecture 6 © Alexander & Michael Bronstein tosca.cs.technion.ac.il/book.
Proximity matrices and scaling Purpose of scaling Classical Euclidean scaling Non-Euclidean scaling Non-Metric Scaling Example.
Distance Measures Tan et al. From Chapter 2. Similarity and Dissimilarity Similarity –Numerical measure of how alike two data objects are. –Is higher.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Introduction to the gradient analysis. Community concept (from Mike Austin)
Principal Coordinate Analysis, Correspondence Analysis and Multidimensional Scaling: Multivariate Analysis of Association Matrices BIOL4062/5062 Hal Whitehead.
A B CA’ B’ C’ Similar Shapes The following diagram shows an enlargement by a scale factor 3 of triangle ABC Note Each length on the enlargement A’B’C’
Multidimensional scaling MDS  G. Quinn, M. Burgman & J. Carey 2003.
1 Cluster Analysis Objectives ADDRESS HETEROGENEITY Combine observations into groups or clusters such that groups formed are homogeneous (similar) within.
Slide 14.1 Nonmetric Scaling MathematicalMarketing Chapter 14 Nonmetric Scaling Measurement, perception and preference are the main themes of this section.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Clustering COMP Research Seminar BCB 713 Module Spring 2011 Wei Wang.
Chapter 2: Getting to Know Your Data
Magic Camera Master’s Project Defense By Adam Meadows Project Committee: Dr. Eamonn Keogh Dr. Doug Tolbert.
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
Unsupervised Learning
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Multidimensional Scaling and Correspondence Analysis © 2007 Prentice Hall21-1.
Tutorial I: Missing Value Analysis
Multidimensional Scaling
Cluster Analysis, an Overview Laurie Heyer. Why Cluster? Data reduction – Analyze representative data points, not the whole dataset Hypothesis generation.
Clustering Wei Wang. Outline What is clustering Partitioning methods Hierarchical methods Density-based methods Grid-based methods Model-based clustering.
CS685 : Special Topics in Data Mining, UKY The UNIVERSITY of KENTUCKY Dimensionality Reduction CS 685: Special Topics in Data Mining Spring 2008 Jinze.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Dimensionality Reduction Part 1: Linear Methods Comp Spring 2007.
Step 1: Specify a null hypothesis
Squashed How many times must the transformation given by the matrix
Deep Feedforward Networks
Statistical Data Analysis - Lecture /04/03
Lecture 2-2 Data Exploration: Understanding Data
Boosting and Additive Trees (2)
Chapter 5 The Simplex Method
Even More on Dimensions
Similarity and Dissimilarity
Multidimensional Scaling and Correspondence Analysis
School of Computer Science & Engineering
Multidimensional Scaling
Clustering and Multidimensional Scaling
Dimensionality Reduction
APPLICATIONS of FRACTALS
Residuals and Residual Plots
Models for Simulation & Optimization – An Introduction
Multidimensional Scaling
Clustering Wei Wang.
Principal Component Analysis
Multidimensional Scaling
Adequacy of Linear Regression Models
Dataset: Time-depth-recorder (TDR) raw data 1. Date 2
Adequacy of Linear Regression Models
Group 9 – Data Mining: Data
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
Image Stitching Linda Shapiro ECE/CSE 576.
Exercise 1: Gestational age and birthweight
Levels of Measurement Where were we: ah, yes! Representational measurement … NOW … (Numerical) Representations of a measurement structure are not *usually.
Presentation transcript:

Basics of a run of (Non-metric) MULTIDIMENSIONAL SCALING Data = Function [Model] Dis/ Ordinal Euclidean Similarities (Monotonic) Distance Any measure that Kruskal: Weak Mon (Triangle satisfies simple Guttman: Strong Inequality) requirements Transformations (includes, but much more than, PM) aka Proximities

Basics of a run of non-metricMDS INPUT SPECIFICATION Given a [SSM/LTM/LTD] data matrix of [non-negative] dis/similarities between p objects SPECIFY: dimensionality/ies of solution (5 ->1!) Options for details of type of transformation (default is weak mon; primary ties) Options for model (Minkowski: Euclidean) CHECK: Adequacy of constraints: DCR (Data Compression Ratio) (#data / #params)>2

MULTIDIMENSIONAL SCALING: The Run How does it work? It’s iterative Current trial configuration Distances in current configuration are calculated See how well they fit data – by monotone regression And calculate b-o-f measure, Stress Look at each point and its residuals And see how (and how far) it needs to be moved Move configuration into one of better fit How? Method of steepest descent. Well, there was this aviator, see? … Back to next iteration

RUN: Each iteration:

Run of MULTIDIMENSIONAL SCALING to understand in detail, you need to know about: How and why of MONOTONE / Isotonic / Ordinal Regression How to interpret “Stress” (fit) How to decide whether the fit of solution/configuration is acceptably low If so, how to interpret a configuration How to compare configurations

OUTPUT from Run of MULTIDIMENSIONAL SCALING (this does NOT follow the order of output!) Look at ADEQUACY OF FIT: Stress value (compare to simulation values) Shepard Diagram. To see form of transformation Signs of more regular transformation? Re-run with another transformation Residuals/Point contribution to Stress to locate ill-fitting pts If not acceptable, don’t even bother looking at “Solution”! ONLY THEN… LOOK AT Solution Configuration Consider fixed vs arbitrary aspects (rotation …) And then … GET INTO THE SERIOUS BUSINESS OF INTERPRETATION !