Classification of tissues and samples 指導老師：藍清隆演講者：張許恩、王人禾.

Classification of tissues and samples 指導老師：藍清隆演講者：張許恩、王人禾

introduction The use of microarrays to find group of genes that can be used diagnostically to determine the disease that an individual is suffering from, or prognostically to predict the success of a course of therapy or results of an experiment. Purpose: find a small number of genes that can predict to which group each individual belongs

Data set 9A Bone marrow samples are taken from 27 patients suffering from acute lymphoblastic leukemia(ALL) and 11 patients suffering from acute myeloid leukemia(AML)

Method of classification Describe methods that allow you to predict the class to which an individual belongs, based on gene expression measurements Assume that we have already selected a small number of genes whose expression measurements we use, and are not using all the genes on the microarray

Two concept central to classification Separability and linearity

Separability Separable: the different groups to which the samples belong occupy different regions of the gene expression space Non-Separable: the different groups to which the samples belong are mixed together in the same region of gene expression space

linearity Linearly separable: possible to partition the space between the two (or more) group using straight lines Non- Linearly separable: separable, but are not possible to partition the groups using straight lines

Method of classification 1.K-nearest neighbours 2.Nearest centroid 3.Linear discriminant analysis 4.Neural networks 5.Support vector machines

K-nearest neighbours Steps: 1.We look at the gene expression measurements for the sample we are trying to classify 2.Find the nearest of the known samples as measured by an appropriate distance measure 3.The class of the sample is the class of the nearest sample

K-nearest neighbours 2 parameter: k & l

Centroid classification Steps: 1.For each class, calculate the center of mass of the points of the representative samples 2.Calculate the distance between the position of the sample to be classified and each of the centers of mass of the classes using an appropriate distance measure 3.Assign the sample to the class whose center of mass is nearest to it

Centroid classification

Linear discriminant analysis Steps: 1.Calculate a straight line (in two dimensions) or hyperplane (in more than two dimensions) that separates two known classes so as to minimise the within class variance on either side of the line and maximise the between class variance(figure 9.4) 2.The class of the unknown sample is determined by the side of the hyperplane on which the sample lies.

Linear discriminant analysis

Neural networks Steps: 1.Train the neural network using the samples with known classes 2.Apply the neural network to the new individual to determine it’s class

Neural networks

Support vector machines Steps: 1.Project the data from the known classes into a suitable high-dimensional space 2.Identify a hyperplane that separates two classes 3.The class of the new individual is determined by the side of hyperplane on which the sample lines

Support vector machines

謝謝大家的支持！

Classification of tissues and samples 指導老師：藍清隆演講者：張許恩、王人禾.

Similar presentations

Presentation on theme: "Classification of tissues and samples 指導老師：藍清隆演講者：張許恩、王人禾."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Classification of tissues and samples 指導老師：藍清隆 演講者：張許恩、王人禾.

Similar presentations

Presentation on theme: "Classification of tissues and samples 指導老師：藍清隆 演講者：張許恩、王人禾."— Presentation transcript:

Similar presentations

About project

Feedback

Classification of tissues and samples 指導老師：藍清隆演講者：張許恩、王人禾.

Presentation on theme: "Classification of tissues and samples 指導老師：藍清隆演講者：張許恩、王人禾."— Presentation transcript: