googling (or duckduckgoing, etc): "topic of interest" type:.csv

Slides:



Advertisements
Similar presentations
10.4 Complex Vector Spaces.
Advertisements

Latent Semantic Analysis
Lecture 19 Singular Value Decomposition
1cs542g-term High Dimensional Data  So far we’ve considered scalar data values f i (or interpolated/approximated each component of vector values.
Motion Analysis Slides are from RPI Registration Class.
Computer Graphics Recitation 6. 2 Last week - eigendecomposition A We want to learn how the transformation A works:
Multivariable Control Systems Ali Karimpour Assistant Professor Ferdowsi University of Mashhad.
5. Topic Method of Powers Stable Populations Linear Recurrences.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 4 March 30, 2005
1 Efficient Clustering of High-Dimensional Data Sets Andrew McCallum WhizBang! Labs & CMU Kamal Nigam WhizBang! Labs Lyle Ungar UPenn.
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
3D Geometry for Computer Graphics
Lecture 20 SVD and Its Applications Shang-Hua Teng.
Lecture 18 Eigenvalue Problems II Shang-Hua Teng.
Olga Sorkine’s slides Tel Aviv University. 2 Spectra and diagonalization A If A is symmetric, the eigenvectors are orthogonal (and there’s always an eigenbasis).
1cs542g-term Notes  Extra class next week (Oct 12, not this Friday)  To submit your assignment: me the URL of a page containing (links to)
Matrices CS485/685 Computer Vision Dr. George Bebis.
CS246 Topic-Based Models. Motivation  Q: For query “car”, will a document with the word “automobile” be returned as a result under the TF-IDF vector.
Next. A Big Thanks Again Prof. Jason Bohland Quantitative Neuroscience Laboratory Boston University.
Linear algebra: matrix Eigen-value Problems
Class 27: Question 1 TRUE or FALSE: If P is a projection matrix of the form P=A(A T A) -1 A T then P is a symmetric matrix. 1. TRUE 2. FALSE.
Distances Between Genes and Samples Naomi Altman Oct. 06.
§ Linear Operators Christopher Crawford PHY
Spectral Analysis based on the Adjacency Matrix of Network Data Leting Wu Fall 2009.
Image transformations Digital Image Processing Instructor: Dr. Cheng-Chien LiuCheng-Chien Liu Department of Earth Sciences National Cheng Kung University.
Eigenvalues The eigenvalue problem is to determine the nontrivial solutions of the equation Ax= x where A is an n-by-n matrix, x is a length n column.
Introduction to Linear Algebra Mark Goldman Emily Mackevicius.
EIGENSYSTEMS, SVD, PCA Big Data Seminar, Dedi Gadot, December 14 th, 2014.
INTRODUCTION TO MATLAB DAVID COOPER SUMMER Course Layout SundayMondayTuesdayWednesdayThursdayFridaySaturday 67 Intro 89 Scripts 1011 Work
Advanced Computer Graphics Spring 2014 K. H. Ko School of Mechatronics Gwangju Institute of Science and Technology.
Instructor: Mircea Nicolescu Lecture 8 CS 485 / 685 Computer Vision.
Market analysis for the S&P500 Giulio Genovese Tuesday, December
Welcome to MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Week 1: Introduction to Topological Data.
1 Chapter 8 – Symmetric Matrices and Quadratic Forms Outline 8.1 Symmetric Matrices 8.2Quardratic Forms 8.3Singular ValuesSymmetric MatricesQuardratic.
3D Geometry for Computer Graphics Class 3. 2 Last week - eigendecomposition A We want to learn how the transformation A works:
Chapter 13 Discrete Image Transforms
Pamela Leutwyler. Find the eigenvalues and eigenvectors next.
CSCE822 Data Mining and Warehousing
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
Review of Linear Algebra
We propose a method which can be used to reduce high dimensional data sets into simplicial complexes with far fewer points which can capture topological.
Lecture: Face Recognition and Feature Reduction
Main Project total points: 500
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.

See also mappersummary2a
Lecture on Linear Algebra
Warm Up!!! ÷ ×3 2 ×2 − 2 2.
SVD: Physical Interpretation and Applications
CS485/685 Computer Vision Dr. George Bebis
6-4 Symmetric Matrices By毛.
Christopher Crawford PHY
This is where R scripts will load
By Charlie Fractal Mentor: Dr. Vignesh Subbian
Parallelization of Sparse Coding & Dictionary Learning
Data Preprocessing Copyright, 1996 © Dale Carnegie & Associates, Inc.
Project HW 6 (Due 3/4) points You are given the following dataset to analyze using TDA Mapper a.) What do you expect the output of TDA mapper to.
Outline Singular Value Decomposition Example of PCA: Eigenfaces.
Symmetric Matrices and Quadratic Forms
Principal Components What matters most?.
This is where R scripts will load
Data Preprocessing Copyright, 1996 © Dale Carnegie & Associates, Inc.
Suppose your data set consists of points living on the trefoil knot, drawn above. Using the above overlapping bins, what will the output of TDA mapper.
Detecting Important Nodes to Community Structure
Subject :- Applied Mathematics
Linear Algebra: Matrix Eigenvalue Problems – Part 2
Lecture 20 SVD and Its Applications
Principal Components What matters most?.
Symmetric Matrices and Quadratic Forms
Marios Mattheakis and Pavlos Protopapas
Presentation transcript:

googling (or duckduckgoing, etc): "topic of interest" type:.csv Lab 3 goals   1.)  Create slide(s) similar to last week where slide includes picture of data set and several different pythonmapper outputs for this data set (similar to last week's sample.pptx).  Upload to Lab 2 slides on ICON. Include slides from a “real” data set. Put your name on these slides. They will be collated and posted on ICON so that the entire class can see what data sets you are interested in. 2.) Take Survey 2/1 on ICON. 3.)  Work with pythonmapper (see mappersummary2a.pdf and http://danifold.net/mapper/index.html ) on a linux machine. 4.)  Create figures that you can use to illustrate TDA mapper both for your poster and project.  Consider constructing some artificial data sets by modifying createArtificalDataSets2.r  and/or using the datasets in pythonmapper and/or examples in TDAmapper README. 5.)  Spend time working with a real data set.  You can find a real data set via googling (or duckduckgoing, etc):  "topic of interest"   type:.csv http://archive.ics.uci.edu/ml/datasets.html https://www.kaggle.com/datasets Using the following command in R: data()

You can also clean up data using excel, python scripts, etc. From: createArtificalDataSets2.r You can also clean up data using excel, python scripts, etc.

First 10 rows from iris data

?write

?write.table

From: uploadDatatoRetc.R

mapper.filters.dm_eigenvector(data, k=0, mean_center=True, metricpar={}, verbose=True, callback=None) Return the k-th eigenvector of the distance matrix. The matrix of pairwise distances is symmetric, so it has an orthonormal basis of eigenvectors. The parameter k can be either an integer or an array of integers (for multi-dimensional filter functions). The index is zero-based, and eigenvalues are sorted by absolute value, so k=0 returns the eigenvector corresponding to the largest eigenvalue in magnitude. If mean_center is True, the distance matrix is double-mean-centered before the eigenvalue decomposition. Reference: [R6], subsection “Principal metric SVD filters”.

Distance Matrix Eigenvector Order of eigenvector: 0

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 0

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 0 5 intervals, 50% Overlap

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 5 intervals, 50% Overlap

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 20 intervals, 20% Overlap

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 20 intervals, 50% Overlap

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 20 intervals, 80% Overlap

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 20 intervals, 80% Overlap --Balanced

Filter Function: Eccentricity with exponent = 1 Metric: Euclidian Filter Function: Eccentricity with exponent = 1 Cover: Uniform 1-d cover Clustering: Single modified slide from Maria Gommel

knn distance with k = 5 and 3 bins??? [ ( ) ) ] (