Ungraded quiz Unit 7.

Slides:



Advertisements
Similar presentations
Clustering Basic Concepts and Algorithms
Advertisements

Dimension reduction : PCA and Clustering Agnieszka S. Juncker Slides: Christopher Workman and Agnieszka S. Juncker Center for Biological Sequence Analysis.
EGR 106 – Week 2 – Arrays Definition, size, and terminology Construction methods Addressing and sub-arrays Some useful functions for arrays Character arrays.
Discrim Continued Psy 524 Andrew Ainsworth. Types of Discriminant Function Analysis They are the same as the types of multiple regression Direct Discrim.
IR Models: Latent Semantic Analysis. IR Model Taxonomy Non-Overlapping Lists Proximal Nodes Structured Models U s e r T a s k Set Theoretic Fuzzy Extended.
GRA 6020 Multivariate Statistics Factor Analysis Ulf H. Olsson Professor of Statistics.
Exploring Microarray data Javier Cabrera. Outline 1.Exploratory Analysis Steps. 2.Microarray Data as Multivariate Data. 3.Dimension Reduction 4.Correlation.
Tables, Figures, and Equations
Lines Identifying Different Types of Lines. LINES There are many different types of lines. Can you think of any? vertical horizontal diagonal perpendicular.
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
Chapter 2 Dimensionality Reduction. Linear Methods
1 Vector Space Model Rong Jin. 2 Basic Issues in A Retrieval Model How to represent text objects What similarity function should be used? How to refine.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
CPS120: Introduction to Computer Science Arrays. Arrays: A Definition A list of variables accessed using a single identifier May be of any data type Can.
Cluster Analysis Forming Groups within the Sample of Respondents.
Some matrix stuff.
Basic concepts in ordination
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
What is the determinant of What is the determinant of
Principal Components Analysis. Principal Components Analysis (PCA) A multivariate technique with the central aim of reducing the dimensionality of a multivariate.
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
Chapter 13.  Both Principle components analysis (PCA) and Exploratory factor analysis (EFA) are used to understand the underlying patterns in the data.
Lines Identifying Different Types of Lines Next Type your name and send: Next.
Principal Component Analysis Zelin Jia Shengbin Lin 10/20/2015.
Market analysis for the S&P500 Giulio Genovese Tuesday, December
Principal Components Analysis ( PCA)
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Clustering (2) Center-based algorithms Fuzzy k-means Density-based algorithms ( DBSCAN as an example ) Evaluation of clustering results Figures and equations.
Unsupervised Learning
Cluster analysis Chong Ho Yu
Exploring Microarray data
Showcasing the use of Factor Analysis in data reduction: Research on learner support for In-service teachers Richard Ouma University of York SPSS Users.
Dimension Reduction via PCA (Principal Component Analysis)
Keyboarding True or false
Cluster analysis Chong Ho Yu
Ungraded quiz Unit 6.
Ungraded quiz Unit 3.
PCA, EFA, and PA Chong Ho Yu.
Ungraded quiz Unit 1.
Ungraded quiz Unit 1.
Step-By-Step Instructions for Miniproject 2
Ungraded quiz Unit 5.
Multiplying Matrices.
Ungraded quiz Unit 4.
Multivariate Statistical Methods
Dimension reduction : PCA and Clustering
X.1 Principal component analysis
Week 4: Ungraded review questions
An Introduction to Correlational Research
Advanced Algebra Unit 1 Vocabulary
Identifying Different Types of Lines
Eigenvalues and Eigenvectors
Ungraded quiz Unit 7.
Ungraded quiz Unit 6.
Ungraded quiz Unit 6.
Ungraded quiz Unit 7.
Ungraded quiz Unit 5.
Ungraded quiz Unit 3.
PCA, EFA, and PA Chong Ho Yu.
Ungraded quiz Unit 10.
Ungraded quiz Unit 1.
Ungraded quiz Unit 4.
Ungraded quiz Unit 11.
Ungraded quiz Unit 1.
Unsupervised Learning
Ungraded quiz Unit 2.
Ungraded quiz Unit 3.
Ungraded quiz Unit 9.
Ungraded quiz Unit 8.
Presentation transcript:

Ungraded quiz Unit 7

Show me your fingers Do not shout out the answer, or your classmates will follow what you said. Use your fingers One finger (the right finger) = A Two fingers = B Three fingers = C Four fingers = D No finger = I don’t know. I didn’t study

What is the difference between discriminant analysis (DA) and cluster analysis (CA)? In DA the analyst needs to assign the number of clusters in advance while CA is entirely exploratory.. DA aims to assign observations into existing groups while CA is intended to discover new grouping patterns. DA starts with a centroid but CA can start anywhere. All of the above.

Which of the following statements is true? In normal mixtures the data must be categorical. In latent class analysis the data must be continuous. In K-mean clustering the data can be continuous and categorical. Two-step clustering accepts both continuous and categorical data.

Which of the following about DBSCAN is true? DBSCAN is available in SAS. K-mean clustering might fail to detect a string-like clustering pattern but DBSCAN can detect any shape of clustering. DBSCAN separates concentrated data from sparse data. All of the above.

Which of the following about hierarchical clustering (HC) is true? HC can be top-down or bottom-up. HC and MDS can work hand in hand if the row items and the column items are the same in a symmetrical data matrix (e.g. R1 = C1, R2 = C2…etc.). Besides grouping observations into a few clusters for data reduction, HC can also be used for matching two individuals into a pair. All of the above

Which of the following about two-step clustering is true? In two-step clustering a pre-cluster is created in step 1. In two-step clustering if the model quality index is less than 0.5, then it is considered a poor model. Two-step clustering tends to create asymmetrical clusters. All of the above.

Which of the following about EFA and PCA is true? When the user requests factor analysis in SPSS, the default is PCA. PCA is a data reduction method that does not require any causal structure or theoretical justification. EFA can be used to confirm the factor structure. All of the above.

Which of the following statement about vector is true? A vector consists of scalars. A vector contains both the numeric and directional information. Vectors are used in the subject space. Eigenvectors visually depict eigenvalues. All of the above.

What of the following is true? A loading plot shows the relationship and the grouping patterns of variables, which are represented by vectors. The cosine of the angle between two vectors is the same as Pearson’s r. The biplot shows both the observations and the vectors. All of the above.

Which of the following statement about parallel analysis (PA) is true? PA is based on resampling (bootstrapping). PA with PCA tends to over-factoring. PA with EFA tends to under-factoring. PA can be run in SPSS only.