Spanish Inquisition Final Project Week 4 - 5/21/09 Breast Cancer Gene Expression Data Leon Kay, Yan Tran, Chris Thomas Chris Yan Leon.

Slides:



Advertisements
Similar presentations
Statistical vs. Practical Significance
Advertisements

Objectives (BPS chapter 24)
Chapter 17 Overview of Multivariate Analysis Methods
Model and Variable Selections for Personalized Medicine Lu Tian (Northwestern University) Hajime Uno (Kitasato University) Tianxi Cai, Els Goetghebeur,
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Correlation Relationship between Variables. Statistical Relationships What is the difference between correlation and regression? Correlation: measures.
Correlation: Relationship between Variables
Spanish Inquisition Final Project Week 2 - 4/29/09 Breast Cancer Gene Expression Data Leon Kay, Yan Tran, Chris Thomas Chris Yan Leon.
Genomic signatures to guide the use of chemotherapeutics Authors: Anil Potti et. al Presenter: Jong Cheol Jeong.
Final Project Week 3 - 5/7/09 GSEA and Cluster Computing in Protein Research Leon Kay, Yan Tran, Chris Thomas Yan Gary Chris Leon.
Business Statistics - QBM117 Statistical inference for regression.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Thoughts on Biomarker Discovery and Validation Karla Ballman, Ph.D. Division of Biostatistics October 29, 2007.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Introduction to Linear Regression and Correlation Analysis
Correlation Scatter Plots Correlation Coefficients Significance Test.
Basic Statistics. Basics Of Measurement Sampling Distribution of the Mean: The set of all possible means of samples of a given size taken from a population.
Correlation and Regression
Chapter 7 Essential Concepts in Molecular Pathology Companion site for Molecular Pathology Author: William B. Coleman and Gregory J. Tsongalis.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Correlation and Prediction Error The amount of prediction error is associated with the strength of the correlation between X and Y.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Lecture Slide #1 Logistic Regression Analysis Estimation and Interpretation Hypothesis Tests Interpretation Reversing Logits: Probabilities –Averages.
MRNA Expression Experiment Measurement Unit Array Probe Gene Sequence n n n Clinical Sample Anatomy Ontology n 1 Patient 1 n Disease n n ProjectPlatform.
Statistical planning and Sample size determination.
Lecture: Forensic Evidence and Probability Characteristics of evidence Class characteristics Individual characteristics  features that place the item.
April 1 st, Bellringer-April 1 st, 2015 Video Link Worksheet Link
1 CHAPTER 4 CHAPTER 4 WHAT IS A CONFIDENCE INTERVAL? WHAT IS A CONFIDENCE INTERVAL? confidence interval A confidence interval estimates a population parameter.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
Correlation. Up Until Now T Tests, Anova: Categories Predicting a Continuous Dependent Variable Correlation: Very different way of thinking about variables.
Survival Analysis approach in evaluating the efficacy of ARV treatment in HIV patients at the Dr GM Hospital in Tshwane, GP of S. Africa Marcus Motshwane.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Pan-cancer analysis of prognostic genes Jordan Anaya Omnes Res, In this study I have used publicly available clinical and.
Ch 8 Estimating with Confidence 8.1: Confidence Intervals.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
P
Chapter 13 Understanding research results: statistical inference.
A B C Supplementary Figure S1. Time-dependent assessment of grade, GGI and PAM50 in untreated patients Landmark analyses of the Kaplan-Meier estimates.
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Date of download: 5/29/2016 Copyright © 2016 American Medical Association. All rights reserved. From: Gene Expression Signatures, Clinicopathological Features,
Direct method of standardization of indices. Average Values n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the.
Chapter 2 Bivariate Data Scatterplots.   A scatterplot, which gives a visual display of the relationship between two variables.   In analysing the.
Tests of hypothesis Contents: Tests of significance for small samples
Correlation and Linear Regression
Statistical Inference
Covariance/ Correlation
Regression Analysis.
Expression Levels of KMT2C and SLC20A1 Identified by Information-theoretical Analysis Are Powerful Prognostic Biomarkers in Estrogen Receptor-positive.
Math 4030 – 12a Correlation.
Covariance/ Correlation
Inferential Statistics:
Quantitative Data Analysis P6 M4
BUS 308 HELPS Perfect Education/ bus308helps.com.
BUS 308 HELPS Education for Service-- bus308helps.com.
CHAPTER 10 Correlation and Regression (Objectives)
Summarising and presenting data - Univariate analysis continued
2. Find the equation of line of regression
Using Statistics in Biology
Using Statistics in Biology
Covariance/ Correlation
Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.
Analyzing and Interpreting Quantitative Data
Using Clustering to Make Prediction Intervals For Neural Networks
EN1 expression in breast cancer and clinical outcome.
MYC–HOXB7–HER2 predicts clinical outcome in breast cancer patients treated with tamoxifen. MYC–HOXB7–HER2 predicts clinical outcome in breast cancer patients.
Isolation of large soft agar clones from HBEC3p53,KRAS and HBEC3p53,KRAS,MYC identifies tumorigenic and nontumorigenic clones and genome-wide mRNA expression.
HEART-LUNG TRANSPLANTATION
Presentation transcript:

Spanish Inquisition Final Project Week 4 - 5/21/09 Breast Cancer Gene Expression Data Leon Kay, Yan Tran, Chris Thomas Chris Yan Leon

Cluster Analysis - SAM Refined Clusters Using TMEV’s SAM Statistical Analysis Significance Analysis of Microarrays –determining whether changes in gene expression are statistically significant. –identifies statistically significant genes by measuring the strength of the relationship between gene expression and a response variable

MeV SAM Analysis - Results Creation of SAM file – Used Excel 2007 to manually create the SAM load file. SAM reduces number of genes to 265 significant genes, and 1279 non- significant genes (1544 total genes). SAM analysis reduces the number of genes to 17% of the original total.

MeV SAM Analysis – Significant Genes Graph

MeV SAM Analysis – Non- significant Genes Graph

Kaplan-Meier Survival Analysis Used to estimate the overall likelihood of survival, given a set of lifetime data Generated using the Excel Plug-in – –Thanks Sri! A plot of the Kaplan-Meier estimate of the survival function is a series of horizontal steps of declining magnitude which, when a large enough sample is taken, approaches the true survival function for that population.

Survival Analysis – Breast Cancer Type

Survival Analysis - Overall

Relapse Probability 30 out of 270 patients relapsed. Only 270 patients in the clinical data has information recorded one way or the other for relapsing. This gives a relapse rate of.1111, or 11.11% Calculating a 99% confidence interval, we get +/ The final probability of relapse, with 99% certainty, is / Or, 11.11% +/- 4.9%, for a min and max range of (6.21%, 16.01%)

Relevance Networks The MeV manual states that a “relevance network is a group of genes whose expression profiles are highly predictive of one another.” Clusters are represented as genes connected together by lines showing that they are related to each other by a correlation coefficient R 2 within preset thresholds.

Relevance Networks The breast cancer data yielded 14 relevance networks.

GATA3 In week two we mentioned the GATA3 gene Linked to the estrogen receptor alpha. Method for providing prognosis because the expression profile is very different between Basal-like and Luminal. Will GATA3 show-up as a significant gene after post SAM analysis and will we find the gene associated with estrogen receptor alpha with it?

Relevance Networks GATA3 and ESR1 are in network 2.

References 1) Edward L. Kaplan, “This Week’s Citation Classic”, Current Contents June