A Similarity Analysis of Curves: A Comparison of the Distribution of Gangliosides in Brains of Old and Young Rats. Yolanda Munoz Maldonado Department of.

Slides:



Advertisements
Similar presentations
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Advertisements

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Chapter 10 Curve Fitting and Regression Analysis
Multiple Regression. Outline Purpose and logic : page 3 Purpose and logic : page 3 Parameters estimation : page 9 Parameters estimation : page 9 R-square.
Objectives (BPS chapter 24)
1 Chapter 4 Interpolation and Approximation Lagrange Interpolation The basic interpolation problem can be posed in one of two ways: The basic interpolation.
Environmental Data Analysis with MatLab Lecture 23: Hypothesis Testing continued; F-Tests.
Chapter 13 Multiple Regression
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
CF-3 Bank Hapoalim Jun-2001 Zvi Wiener Computational Finance.
Chapter 12 Multiple Regression
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Correlation Patterns. Correlation Coefficient A statistical measure of the covariation or association between two variables. Are dollar sales.
Lecture 6 Notes Note: I will homework 2 tonight. It will be due next Thursday. The Multiple Linear Regression model (Chapter 4.1) Inferences from.
Chapter 11 Multiple Regression.
Simple Linear Regression Analysis
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. by Lale Yurttas, Texas A&M University Chapter 171 CURVE.
5-3 Inference on the Means of Two Populations, Variances Unknown
Lorelei Howard and Nick Wright MfD 2008
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Chapter 15 Nonparametric Statistics
CpE- 310B Engineering Computation and Simulation Dr. Manal Al-Bzoor
Correlation and Regression
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Chapter 11 Simple Regression
Statistics Pooled Examples.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Correlation.
Essential Statistics in Biology: Getting the Numbers Right
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
V. Space Curves Types of curves Explicit Implicit Parametric.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Class 4 Simple Linear Regression. Regression Analysis Reality is thought to behave in a manner which may be simulated (predicted) to an acceptable degree.
Chapter 8 Curve Fitting.
Model Comparison for Tree Resin Dose Effect On Termites Lianfen Qian Florida Atlantic University Co-author: Soyoung Ryu, University of Washington.
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
BIOSYST-MeBioSwww.biw.kuleuven.be The potential of Functional Data Analysis for Chemometrics Dirk De Becker, Wouter Saeys, Bart De Ketelaere and Paul Darius.
1 Estimating the Term Structure of Interest Rates for Thai Government Bonds: A B-Spline Approach Kant Thamchamrassri February 5, 2006 Nonparametric Econometrics.
1 Objective Compare of two population variances using two samples from each population. Hypothesis Tests and Confidence Intervals of two variances use.
Linear correlation and linear regression + summary of tests Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Correlation & Regression Analysis
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Continuous Representations of Time Gene Expression Data Ziv Bar-Joseph, Georg Gerber, David K. Gifford MIT Laboratory for Computer Science J. Comput. Biol.,10, ,
Math 4030 Final Exam Review. Probability (Continuous) Definition of pdf (axioms, finding k) Cdf and probability (integration) Mean and variance (short-cut.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Principal Warps: Thin-Plate Splines and the Decomposition of Deformations 김진욱 ( 이동통신망연구실 ; 박천현 (3D 모델링 및 처리연구실 ;
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Estimating standard error using bootstrap
Chapter 4 Basic Estimation Techniques
Part 5 - Chapter
Part 5 - Chapter 17.
Fundamentals of regression analysis
Part Three. Data Analysis
Part 5 - Chapter 17.
Statistical Process Control
Product moment correlation
Presentation transcript:

A Similarity Analysis of Curves: A Comparison of the Distribution of Gangliosides in Brains of Old and Young Rats. Yolanda Munoz Maldonado Department of Statistics Texas A&M University E-mail: ymunoz@stat.tamu.edu Dr. Joan Staniswalis Department of Mathematical Sciences University of Texas at El Paso E-mail: joan@math.utep.edu This project was partially supported by RCMI grant 5G12-RR08124 from the National Institute of Health.

Overview Introduction of the Biological Problem Methodology Simulation Data Analysis Summary

Thin Silica Gel Plate

Ganglioside Standards

Standard Curves

Functional Object The intensity of the gangliosides is considered as a function of distance, so the first step in the analysis is to reconstruct the entire profile on a closed interval so that it can be evaluated at any point (Ramsay and Silverman, 1998). Regression splines are used for this purpose (Eubank, 1988).

Regression Splines The sampled curve Y(t), t in G, is interpolated by fitting a linear combination of B-splines . This involves the minimization of over .

Splines A spline of order K with knots , is any function of the form:

B-splines The i th normalized B-spline of order K for the knot sequence is denoted by and satisfy the properties:

Cubic B-Splines

Ganglioside Profiles

Warping Functions The registration of the curves requires: a monotone transformation w for each curve Y(t) such that the registered curves have more or less identical argument values for any of the characteristic features.

Individual Curves

Properties

Warping function The warping functions were estimated using the Penalized Least-Squares Error Criterion by minimizing The minimizer of this is expression is a natural cubic spline (Shoenberg 1946). Since we want to preserve the area under the curve, the registered curve is given by

Warping Functions

Aligned Curves

Similarity Similarity is based upon comparison of the functions evaluated on a common grid G . The index of similarity between two curves uses the Pearson’s sample correlation coefficient.

Test Statistics Three test statistics were considered: The pooled mean similarity within groups: The pooled variance similarity within groups: . 3. The ratio of the pooled-mean to the square-root of the pooled variance:

Permutation Distribution The permutation distribution of each test statistic under the null hypothesis is obtained by permuting the 10 curves, and then dividing them into two groups “old” and “young”. The p-value for the pooled-mean and the ratio is given by the number of permutations which yield a value of the test statistic greater than the observed value. The p-value for the pooled-variance is obtained by the number of permutations which yield a value of the test statistic that is less than the observed value.

Simulation The noisy data were simulated according to is the normal pdf. is generated following a is the vector of the center of the peaks of the original data. is the variance-covariance matrix of these points.

Simulation The follow a chi-square distribution with the following degrees of freedom: 20, 45, 30, 20, 20. The are normally distributed with mean 0 and covariance . The are independent, uniformly distributed coefficients on the intervals: min = ( 0.175, 0.25, .0.2, .0.08, 0.1) max = (0.5 ,0.7, 0.5, 0.417, 0.4)

Simulated Curves under OLD YOUNG

Size of the Test 0.032 * 0.008 0.024 * 0.01 0.068 0.064 0.05 0.108 0.104 0.080 0.1 Ratio Pooled Mean/Variance Pooled Variance Mean Proportion of Rejections Significance Level

Simulation under

Power function at a=0.05

Data Analysis Three data sets are studied: Medulla Locus Coeruleus Hippocampus The last data set was expected to show no differences between old and young rats.

Registered, Cut and Normalized Profiles

Analysis Result 0.333 235.4329 Ratio of Means/Variances 0.706 0.0059 Hippocampus 0.805 0.8347 Mean 0.014 109.815 0.028 0.00987 Locus Coeruleus 0.009 0.736441 0.027 588.8824 0.0024 Medulla 0.046 0.84114 P-value Test Statistic Brain Region

Conclusions The result confirms the biologists expectations of differences in ganglioside concentration in the Medulla region and no difference for Hippocampus. The result for Locus Coeruleus region provides new evidence for a significant development shift in ganglioside pattern.

References Irwin, L.N. (1984). Ontogeny and Phylogeny of vertebrate brain gangliosides. In Ganglioside Structure, Function and Biomedical Potential. New York: Plenum. Edited by Leeden, R.W., Yu, R.K., Rapport, M.M. and Suzuki, K, pp. 319-329. Eubank, Randall (1988). Spline smoothing and nonparametric regression. New York: Marcel Dekker, Inc. Heckman, N. (1997). The Theory and Application of Penalized Least Squares Methods or Reproducing Kernel Hilbert Spaces Made Easy. http://www.stat.ubc.ca/people/nancy. Kimeldorf, G. and Wahba, G. (1971). Some Results on Tchebycheffian Spline Functions. Journal of Mathematical Analysis and Applications, vol. 33, pp.82-95. Kneip, A. and Gasser, T. (1992). Statistical Tools to Analyze Data Representing a Sample of Curves. The Annals of Statistics, Vol. 20, No. 3, pp. 1266-1305. Ramsay, J.O. and Silverman B.W. (1997). Functional Data Analysis. New York: Springer Series in Statistics. Ramsay, J.O. and Silverman B.W. (1998). S-Plus Functions for FDA. http://www.psych.mcgill.ca/faculty/ramsay.