Cluster Analysis & Multidimensional Scaling Looking for like approaches (and an introduction to Systat)

Slides:



Advertisements
Similar presentations
CLUSTERING.
Advertisements

Different types of data e.g. Continuous data:height Categorical data ordered (nominal):growth rate very slow, slow, medium, fast, very fast not ordered:fruit.
Copyright Jiawei Han, modified by Charles Ling for CS411a
Clustering Clustering of data is a method by which large sets of data is grouped into clusters of smaller sets of similar data. The example below demonstrates.
Numeracy & Quantitative Methods: Numeracy for Professional Purposes Laura Lake.
Discrimination amongst k populations. We want to determine if an observation vector comes from one of the k populations For this purpose we need to partition.
Clustering.
Conceptualization and Measurement
4/25/2015 Marketing Research 1. 4/25/2015Marketing Research2 MEASUREMENT  An attempt to provide an objective estimate of a natural phenomenon ◦ e.g.
Part II Sigma Freud & Descriptive Statistics
Multivariate statistical analysis Introductions and basic data analysis.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT
5/15/2015Marketing Research1 MEASUREMENT  An attempt to provide an objective estimate of a natural phenomenon ◦ e.g. measuring height ◦ or weight.
Chapter 17 Overview of Multivariate Analysis Methods
Problem Solving Profiles Approaching learning with different views.
Chapter 7 Measuring Instruments. ALL VARIABLES ARE NOT MEASURED THE SAME  Nominal Variables  Ordinal Variables  Interval Variables  Ratio Variables.
Measurement Concepts. CONSTRUCT VALIDITY OF MEASURES Indicators of Construct Validity Face validity Aggression questionnaire Not enough to really determine.
Nominal Level Measurement n numbers used as ways to identify or name categories n numbers do not indicate degrees of a variable but simple groupings of.
1 Data Analysis  Data Matrix Variables ObjectsX1X1 X2X2 X3X3 …XPXP n.
Clustering with FITCH en UPGMA Bob W. Kooi, David M. Stork and Jorn de Haan Theoretical Biology.
1 Measurement PROCESS AND PRODUCT. 2 MEASUREMENT The assignment of numerals to phenomena according to rules.
Chapter 6 Measuring Instruments. ALL VARIABLES ARE NOT MEASURED THE SAME Nominal Variables Ordinal Variables Interval Variables Ratio Variables.
1 Measurement Adapted from The Research Methods Knowledge Base, William Trochim (2006). & Methods for Social Researchers in Developing Counries, The Ahfad.
Traditional Approach Using Questionnaires. 2 Introduction  Questionnaires  for acquiring large amount of information  Questionnaires allow us to study:
Business Research Method Measurement, Scaling, Reliability, Validity
Chapter 1 The Nature of Probability and Statistics.
Scientific Method/Statistics. Scientific Method 1. Observe 2. Hypothesize/Predict 3. Experiment Try again! 4. Analyze 5. Conclude 6. Report.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
What is a Measurement? Concept of measurement is intuitively simple  Measure something two concepts involved  The thing you are measuring  The measurement.
1 Multivariate Analysis (Source: W.G Zikmund, B.J Babin, J.C Carr and M. Griffin, Business Research Methods, 8th Edition, U.S, South-Western Cengage Learning,
AN INTRODUCTION DATA COLLECTION AND TERMS POSTGRADUATE METHODOLOGY COURSE.
Measurement Theory Michael J. Watts
Chapter 6: Using Questionnaire Lecture 6 Topics –Question types –Scales –Formatting the questionnaire –Administering the questionnaire –Web questionnaires.
Cluster analysis 포항공과대학교 산업공학과 확률통계연구실 이 재 현. POSTECH IE PASTACLUSTER ANALYSIS Definition Cluster analysis is a technigue used for combining observations.
Learning Objective Chapter 9 The Concept of Measurement and Attitude Scales Copyright © 2000 South-Western College Publishing Co. CHAPTER nine The Concept.
Week #12 Assignment For your Week #12 assignment, you will write your Methods and Results Chapters for your descriptive statistics.
1 Cluster Analysis Objectives ADDRESS HETEROGENEITY Combine observations into groups or clusters such that groups formed are homogeneous (similar) within.
Methodology v v Type of research v v Design v v Data gathering procedures v v Data analysis procedures v v Population v v Sample and sampling.
Multivariate Data Analysis Chapter 1 - Introduction.
K-Means Algorithm Each cluster is represented by the mean value of the objects in the cluster Input: set of objects (n), no of clusters (k) Output:
Data Mining Techniques Clustering. Purpose In clustering analysis, there is no pre-classified data Instead, clustering analysis is a process where a set.
Cluster Analysis.
SOCI 2003B: Sociological Methods Colleen Anne Dell, Ph.D. Carleton University, Department of Sociology & Anthropology Canadian Centre on Substance Abuse.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L10.1 Lecture 10: Cluster analysis l Uses of cluster analysis.
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 5-1 Data Mining Methods: Classification Most frequently used DM method Employ supervised.
Author(s): Carl Berger, 2011 License: Unless otherwise noted, this material is made available under the terms of the Attribution – Share Alike 3.0 license.
Definition Finding groups of objects such that the objects in a group will be similar (or related) to one another and different from (or unrelated to)
McGraw-Hill/IrwinCopyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved. MEASUREMENT Chapter 11.
Planning the Data Analysis. Statistical and Data Processing Packages 1. Today, in most cases, the computer is used for data processing and analysis. 2.
Multidimensional Scaling
1 Cluster Analysis – 2 Approaches K-Means (traditional) Latent Class Analysis (new) by Jay Magidson, Statistical Innovations based in part on a presentation.
MULTIVARIATE ANALYSIS. Multivariate analysis  It refers to all statistical techniques that simultaneously analyze multiple measurements on objects under.
Chapter_20 Cluster Analysis Naresh K. Malhotra
BSHS 382 Week 2 DQ 2 Check this A+ tutorial guideline at 382/UOP%20BSHS-382-Week-2-DQ-2 Give two example of each:
Measurement Theory Michael J. Watts
Measurement in Marketing Research
Ch. 5 Measurement Concepts.
Computing Reliability
Integration of Clustering and Multidimensional Scaling to Determine Phylogenetic Trees as Spherical Phylograms Visualized in 3 Dimensions  Introduction.
Methods Chapter Format Sources of Data Measurements
Author(s): Carl Berger, 2011
Clustering and Multidimensional Scaling
Chapter 6 Using Questionnaires
CSCI N317 Computation for Scientific Applications Unit Weka
Clustering Wei Wang.
CONCEPT TO BE INCLUDED Variables Values Recoding Ratio.
Chapter_20 Cluster Analysis
Group 9 – Data Mining: Data
CONCEPT TO BE INCLUDED Variable Value.
Presentation transcript:

Cluster Analysis & Multidimensional Scaling Looking for like approaches (and an introduction to Systat)

Cluster Analysis A way of grouping data May be by cases  You want to find who is like who else  How alike are the cases? May be by variables  Are some variables like other variables  If so you can reduce the number of variables you work with  Or you can verify that they are similar by the way folks have responded

Attributes Most cluster analysis is exclusive, that is, any variable or case cannot be in two clusters at the same time Several kinds of clustering  Hierarchical, additive and partitioned Based on some kind of correlation of the data  Some clustering techniques are swayed by having different scales while others are not. Stay tuned.

Data Uses a variable by case format Can also use a correlation matrix Data can be nominal, ordinal, interval or ratio but each should have a different way to join the clusters

Output Generally a tree, dendogram or icicle May show several user defined groups and how well each case (or variable) fits with it’s average or mean group Can be refined and localized Have face and relational reliability Works best with ~20 or less variables or cases

Looking at the PSP How do people in the class cluster? How do Profiles cluster? Profiles  Global Local  Alone Collaboration  Help Persistence  Innovation Tried  Plan Serendipity

Correlation matrix of the variables GLBLLCL HNTPRSNC INVTNTRDTRU PLNSRDPT ALNOTHRS GLBLLCL1.000 HNTPRSNC INVTNTRDTRU PLNSRDPT ALNOTHRS

Help Persistence Innovation Tried and True Global Local Alone Others Planned Serendipity

Join command (Systat only)

Multidimensional Scaling Multidimensional Scaling is a method to fit a set of points in space that best represents the dissimilarity between all the points.