Main Project total points: 500

Slides:



Advertisements
Similar presentations
Your title here: Maybe add some pictures and/or school logo on the left and right authors and affiliation (First names of authors increase interaction.
Advertisements

Lecture 5: Triangulations & simplicial complexes (and cell complexes). in a series of preparatory lectures for the Fall 2013 online course MATH:7450 (22M:305)
Presenting your paper with PowerPoint: Module 8. Why write with PowerPoint? To supplement an oral presentation To incorporate visual and audio media into.
 A complete presentation graphics program that allows you to produce professional looking presentations.  A presentation is also referred to as a slide.
Presentation Template
Abstract # 0000 Make the Main Title with Large Bold Type Your Name Here Your Department Here Texas A&M Health Science Center Make the Main Title with Large.
Top 10 PowerPoint Tips. 1. Keep It Simple –Slide should be simple, the audience should be focused more on you –Avoid complicated or busy slides –Leave.
Use a Large Bold Type for the Main Title Use Smaller Type for the Subtitle. Above type is 96 pt, this type is 66 pt Make Authors’ names smaller. This is.
Make the Main Title with Large Bold Type Your Name and Title Here Your Department Here Texas A&M Health Science Center Make the Main Title with Large Bold.
Abstract # 0000 Make the Main Title with Large Bold Type Use Smaller Type for the Subtitle. Above Type is 105pt. This Type is 70pt. Make authors’ names.
Creating an Effective PowerPoint Presentation
 A complete presentation graphics program that allows you to produce professional looking presentations.
Writing with PowerPoint: A Workshop Brought to You by the Purdue Writing Lab.
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Sept 9, 2013: Create your own homology. Fall 2013.
Welcome to MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Week 1: Introduction to Topological Data.
Make the Main Title with Large Bold Type Use Smaller Type for the Subtitle. Above Type is 110pt. This Type is 80pt. Make authors’ names smaller. This is.
Unit 5: Developing the Training Program 1 © SHRM 2009.
Creating a cell complex = CW complex Building block: n-cells = { x in R n : || x || ≤ 1 } 2-cell = open disk = { x in R 2 : ||x || < 1 } Examples: 0-cell.
How to use Microsoft PowerPoint
Your title here: Maybe add some pictures and/or school logo on the left authors and affiliation (First names of authors increase interaction potential)
The poster title goes here and here
Computer Fundamentals 1
Main Project total points: 500
Microsoft PowerPoint This is the introduction to PowerPoint.
Permeability (% of Control)
Template for a 52” x 42” Poster Presentation with a 2” Margin
Permeability (% of Control)
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
Evaluating-Ayasdi’s-Topological-Data-Analysis-For-Big-Data_HKim2015
Headline. Poster size: 36 x 48
Lainie Chang Grand Canyon University January 20, 2010 TEC 542
We propose a method which can be used to reduce high dimensional data sets into simplicial complexes with far fewer points which can capture topological.
Creating a cell complex = CW complex
Objectives At the end of this session, students will be able to:
Visual Aids Chapter 14.
Microsoft PowerPoint This is the introduction to PowerPoint.
Main Project total points: 500
The headline should be a complete sentence that makes an assertion which is no more than two lines Photograph, drawing, diagram, or graph supporting.
The headline should be a sentence that makes an assertion which is no more than two lines Photograph, drawing, diagram, or graph supporting the headline.
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
I'd like to suggest that our Ph.D. programs often do students a disservice in two ways. First, I don't think.
Your title here: Maybe add some pictures and/or school logo on the left authors and affiliation (First names of authors increase interaction potential)
Mike Renfro, Center for Manufacturing Research

Mapper.
Guest lecturer: Isabel K. Darcy
Main Project total points: 500
Clustering Via Persistent Homology
Topological Data Analysis
Will you capture their attention?
Main Project total points: 500
Applications Software
Microsoft PowerPoint This is the introduction to PowerPoint.
Poster Template for a 920mmX 750mm (A1) (portrait) poster presentation
Project HW 6 (Due 3/4) points You are given the following dataset to analyze using TDA Mapper a.) What do you expect the output of TDA mapper to.
Guest lecturer: Isabel K. Darcy
Power Point 101.
Your title here: Maybe add some pictures and/or school logo on the left authors and affiliation (First names of authors increase interaction potential)
Headline. Poster size: 36 x 48
Author’s Name/s Goes Here Title goes here
Your title here: Maybe add some pictures and/or school logo on the left authors and affiliation (First names of authors increase interaction potential)
Microsoft PowerPoint This is the introduction to PowerPoint.
Names of Authors in Arial, 44 Points, Bold
Lecture 5: Triangulations & simplicial complexes (and cell complexes).
Use a Large Bold Type for the Main Title (80 pt):
Permeability (% of Control)
Use a Large Bold Type for the Main Title (80 pt):
Use a Large Bold Type for the Main Title (70 pt):
Permeability (% of Control)
Use a Large Bold Type for the Main Title (70 pt):
Presentation transcript:

Main Project total points: 500 200/500 = 40% finished by March 27 Introduction, Background, Partial Results/Discussion, Acknowledgement, Author contribution, funding/conflicts, References 250/500 = 50% finished by April 5 400/500 = 80% finished by April 17 500/500 = 100% finished by April 26

See LABS/ directory for useful R files But you may want to focus on Euclidean distance AFTER normalizing: From databasics3900.r: > # one way to normalize data > scaledata2 <- scale(data2) # scales data so that mean = 0, sd = 1 > colMeans(scaledata2) # faster version of apply(scaled.dat, 2, mean) # shows that mean of each column is 0 Sepal.Length Sepal.Width Petal.Length Petal.Width -4.480675e-16 2.035409e-16 -2.844947e-17 -3.714621e-17 > apply(scaledata2, 2, sd) # shows that standard deviation # of each column is 1 Sepal.Length Sepal.Width Petal.Length Petal.Width 1 1 1 1 ------------------------------------------------------------------------------------------- P<- select(tbl_df(scaledata2), Petal.Length) # Choose filter m1 <- mapper1D( # Apply mapper distance_matrix = dist(data.frame(scaledata2)), filter_values = P, num_intervals = 10, percent_overlap = 50, num_bins_when_clustering = 10) See LABS/ directory for useful R files # save data to current working # directory as a text file write.table(scaledata2, "data.txt", sep=" ", row.names = FALSE, col.names = FALSE)

https://writingcenter.uiowa.edu/#services

Mini-presentations in class: End of March or beginning of April. Over anything related to your project. 5 – 10 minutes/person. Visit speaking center before presentation. Submit summary of visit – what did you learn.

https://speakingcenter.uiowa.edu/about-us

Modified from http://www.garrreynolds.com/preso-tips/design/ 1. Keep it Simple Lots of white space is good: The less clutter you have on your slide, the more powerful your visual message will become. 2. Limit bullet points & text 3. Limit transitions & builds (animation) Only use animations that illustrate a point. Don’t use unnecessary animations. 4. Use high-quality graphics 5. Have a visual theme, but avoid using PowerPoint templates 6. Use appropriate charts 7. Use color well 8. Choose your fonts well use the same font set throughout your entire slide presentation, and use no more than two complementary sans-serif fonts (e.g., Arial and Arial Bold). 9. Use video or audio when appropriate. 10. Organize your talk: Spend time in the slide sorter (or print out your slides at least 6 to a page).

In an assertion-evidence slide, the headline is a sentence that succinctly states the slide’s main message Photograph, drawing, diagram, or graph supporting the headline message (no bulleted list) This file presents a template for making Assertion–Evidence (A–E) slides in a technical presentation. The design advocated by this template arises from pages 113-152 of The Craft of Scientific Presentations (Springer, 2003) and from the first Google listing for “presentation slides”: http://www.writing.engr.psu.edu/slides.html To follow this template, make sure that you create the slide within this PowerPoint file. Working with a New Slide (under Insert in older versions or as a button on the Home tab in version 2007) , you should first craft a sentence headline that states an assertion about your topic. Having no assertion translates to having no slide. In the body of the slide, you should then support that headline assertion visually: photographs, drawings, diagrams, equations, or words arranged visually. Use supporting text only where necessary. Do not use bulleted lists, because bulleted lists do not reveal the connections between details. This slide shows one orientation for the image and supporting text. Other orientations exist, as shown in the sample slides that follow. Call-out(s), if needed: no more than two lines PowerPoint Template: http://writing.engr.psu.edu/AE_template_PSU.ppt 7

http://www.bigdata.uni-frankfurt.de/wp-content/uploads/2015/10/ Evaluating-Ayasdi’s-Topological-Data-Analysis-For-Big-Data_HKim2015.pdf

We are not (currently) covering persistent homology including barcodes

Icon Quiz 9 Reading (10 points; Due 4/6 at 7:00 AM) over first page:

Introduction The purpose of this paper is to introduce a new method for the qualitative analysis, simplification and visualization of high dimensional data sets, as well as the qualitative analysis of functions on these data sets.

Introduction The purpose of this paper is to introduce a new method for the qualitative analysis, simplification and visualization of high dimensional data sets, as well as the qualitative analysis of functions on these data sets.

Example: Point cloud data representing a hand. A) Data Set Example: Point cloud data representing a hand. B) Function f : Data Set  R Example: x-coordinate f : (x, y, z)  x Put data into overlapping bins. Example: f-1(ai, bi) Cluster each bin & create network. Vertex = a cluster of a bin. Edge = nonempty intersection between clusters http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html

Introduction The purpose of this paper is to introduce a new method for the qualitative analysis, simplification and visualization of high dimensional data sets, as well as the qualitative analysis of functions on these data sets.

Different types of data sets

Creating overlapping bins

Filter function: eccentricity

knn distance with k = 5, 3 intervals, 50% overlap

knn distance with k = 5 3 intervals, 50% overlap [ ( ) ) ] (

knn distance with k = 5, 50% overlap 3 intervals 5 intervals 10 intervals 100 intervals

knn distance with k = 50, 50% overlap 3 intervals 5 intervals 10 intervals 100 intervals

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 0 5 intervals, 50% Overlap

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 5 intervals, 50% Overlap

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 20 intervals, 20% Overlap

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 20 intervals, 50% Overlap

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 20 intervals, 80% Overlap

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 20 intervals, 80% Overlap --Balanced

Introduction The purpose of this paper is to introduce a new method for the qualitative analysis, simplification and visualization of high dimensional data sets, as well as the qualitative analysis of functions on these data sets. Ex: 1.) f(x) = ||x|| 2.) g(x1, …, xn-1) = xn 3.) DSGA decomposition of the original tumor vector into the Normal component its linear models fit onto the Healthy State Model and the Disease component vector of residuals. http://www.pnas.org/content/early/2011/04/07/1102826108

Some quantitative analysis is also possible

3.2.2.2 Insight by Ranked Variables Going back to the Titanic example, the result of the KS-statistic show, that the variable “Sex” is the most strongly related to passengers death. We could generally assume that men conceded the places in lifeboats to women. Furthermore, it is feasible to deduct the subtle reasons of the death of each group. The passengers in group A died because of two reasons: they were man and the cabin class type was low. The passengers in the group B died because they were man. Finally, the passengers in the group C died because they were staying at third class even though most of them were women.

Topological Methods for the Analysis of High Dimensional Data Sets and 3D Object Recognition Singh, Gurjeet; Memoli, Facundo; Carlsson, Gunnar http://diglib.eg.org/handle/10.2312/SPBG.SPBG07.091-100

We propose a method which can be used to reduce high dimensional data sets into simplicial complexes with far fewer points which can capture topological and geometric information at a specified resolution.

We propose a method which can be used to reduce high dimensional data sets into simplicial complexes with far fewer points which can capture topological and geometric information at a specified resolution.

Building blocks for a simplicial complex 0-simplex = vertex = v 1-simplex = edge = {v1, v2} Note that the boundary of this edge is v2 + v1 e v1 v2 2-simplex = triangle = {v1, v2, v3} v2 e2 e1 e3 v1 v3 The building blocks for a simplicial complex consist of zero simplices which are zero dimensional vertices, one simplices which are one-dimensional edges, and 2-simplices which are two dimensional triangles, Note that the boundary of this triangle is the cycle e1 + e2 + e3 = {v1, v2} + {v2, v3} + {v1, v3}

Creating a simplicial complex 1.) Next add 1-dimensional edges (1-simplices). Note: These edges must connect two vertices. I.e., the boundary of an edge is two vertices

Creating a simplicial complex 1.) Next add 1-dimensional edges (1-simplices). Note: These edges must connect two vertices. I.e., the boundary of an edge is two vertices

Creating a simplicial complex In step n, we can add n-simplices, but only if its boundary, a sum of (n-1) dimensional simplices was created in the previous step, the n-1 step where we added n-1 - simplices. In our example, the highest dimensional simplex is a 3-simplex, this solid tetrahedron, so we have a 3-dimensional simplicial complex. This partitioning of an object into 0-simplices, 1-simplices, 2-simplices, etc. is called a triangulation of the object. An object is called a simplicial complex if it has a triangulation. Let us now triangulate a couple of familiar examples. n.) Add n-dimensional n-simplices, {v1, v2, …, vn+1}. Boundary of a n-simplex = a cycle consisting of (n-1)-simplices.

Example: Triangulating the disk. disk = { x in R2 : ||x || ≤ 1 } = Recall that a triangle is topologically equivalent to a circular disk. So as long as I have 3 edges forming a triangle, I can add a 2-dimensional face.

Example: Triangulating the circle. disk = { x in R2 : ||x || ≤ 1 } I now get to punch my topological triangle in order to form the bottom hemisphere. Fist image from http://openclipart.org/detail/1000/a-raised-fist-by-liftarn

Example: Triangulating the sphere. sphere = { x in R3 : ||x || = 1 } And finally, we fill in the topological triangles with two dimensional faces, We fill in this front face with a triangle. We fill in this side face with the triangle, as well as this back face. The bottom hemisphere is also a topological triangle. It’s boundary consists of three edges so we can also fill it in with a face. In case you don’t see that,

Triangulation of a torus http://mangrovetds.sourceforge.net/documentation/html/torus_complex.png

Distance Matrix Eigenvector, Mean Centered Distance Matrix Order of eigenvector: 1 20 intervals, 80% Overlap

http://www.math.wsu.edu/math/faculty/bkrishna/AbsAlgTopo_PSB2016/Slides_Topology.pdf

Create overlapping bins: http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html

Create overlapping bins: http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html

Which choice refers to resolution? We propose a method which can be used to reduce high dimensional data sets into simplicial complexes with far fewer points which can capture topological and geometric information at a specified resolution. Resolution means ???? Which choice refers to resolution?

The idea is to provide another tool for a generalized notion of coordinatization for high dimensional data sets. Coordinatization can of course refer to a choice of real valued coordinate functions on a data set,

Dimensionality Reduction: Given dataset D RN Want: embedding f: D  Rn where n << N which “preserves” the structure of the data. Many reduction methods: f1: D  R, f2: D  R, … fn: D  R (f1, f2, … fn): D  Rn Many are linear, M: RN  Rn, Mx = y But there are also non-linear dimensionality reduction algorithms. U

Example: Principle component analysis (PCA) http://en.wikipedia.org/wiki/File:GaussianScatterPCA.png

https://en.wikipedia.org/wiki/Nonlinear_dimensionality_reduction

f1: D  R, f2: D  R, … fn: D  R (f1, f2, … fn): D  Rn

f: D  S1 which “preserves” the structure of the data. Goal f: D  S1 which “preserves” the structure of the data. circle courtesy of knotplot.com

circle courtesy of knotplot.com

The idea is to provide another tool for a generalized notion of coordinatization for high dimensional data sets. Coordinatization can of course refer to a choice of real valued coordinate functions on a data set, but other notions of geometric representation (e.g., the Reeb graph [Ree46]) are often useful and reflect interesting information more directly.

https://en.wikipedia.org/wiki/Reeb_graph http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html