Multivariate Linkage Continued

Slides:

Advertisements

Similar presentations

Analysis of multivariate transformations. Transformation of the response in regression The normalized power transformation is: is the geometric mean of.

Advertisements

What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.

A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.

How Should We Assess the Fit of Rasch-Type Models? Approximating the Power of Goodness-of-fit Statistics in Categorical Data Analysis Alberto Maydeu-Olivares.

Genetic Heterogeneity Taken from: Advanced Topics in Linkage Analysis. Ch. 27 Presented by: Natalie Aizenberg Assaf Chen.

Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests

Econ 140 Lecture 81 Classical Regression II Lecture 8.

© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.

Independent Sample T-test Formula

Part I – MULTIVARIATE ANALYSIS

DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?

Viewpoint Plotting Program for Linkage Results Harry Beeby & Sarah Medland Queensland Institute of Medical Research.

Viewdist Plotting Program for quick visualisation of distributions Harry Beeby & Sarah Medland Queensland Institute of Medical Research.

Slide 1 Detecting Outliers Outliers are cases that have an atypical score either for a single variable (univariate outliers) or for a combination of variables.

Repeated Measures ANOVA Used when the research design contains one factor on which participants are measured more than twice (dependent, or within- groups.

Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.

Review As sample size increases, the distribution of sample means A.Becomes wider and more normally shaped B.Becomes narrower and more normally shaped.

Inferential Statistics

Lecture 16 Correlation and Coefficient of Correlation

SW388R7 Data Analysis & Computers II Slide 1 Assumption of Homoscedasticity Homoscedasticity (aka homogeneity or uniformity of variance) Transformations.

Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.

Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.

Multinomial Distribution

Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.

Power of linkage analysis Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.

ANOVA: Analysis of Variance.

Regression-Based Linkage Analysis of General Pedigrees Pak Sham, Shaun Purcell, Stacey Cherny, Gonçalo Abecasis.

Lecture 12: Linkage Analysis V Date: 10/03/02  Least squares  An EM algorithm  Simulated distribution  Marker coverage and density.

Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.

Chapter 10 The t Test for Two Independent Samples

Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.

- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.

T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.

Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,

David M. Evans Multivariate QTL Linkage Analysis Queensland Institute of Medical Research Brisbane Australia Twin Workshop Boulder 2003.

A simple method to localise pleiotropic QTL using univariate linkage analyses of correlated traits Manuel Ferreira Peter Visscher Nick Martin David Duffy.

Copy folder (and subfolders) F:\sarah\linkage2. Linkage in Mx Sarah Medland.

Efficient calculation of empirical p- values for genome wide linkage through weighted mixtures Sarah E Medland, Eric J Schmitt, Bradley T Webb, Po-Hsiu.

Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.

Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &

QTL Mapping Using Mx Michael C Neale Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.

 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.

Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.

The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.

Inference concerning two population variances

Chapter 4. Inference about Process Quality

Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing

Prepared by Lloyd R. Jaisingh

REGRESSION DIAGNOSTIC II: HETEROSCEDASTICITY

Test for Goodness of Fit

Statistical Inference for more than two groups

Hypothesis Testing Using the Chi Square (χ2) Distribution

Elementary Statistics

Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests

Data Analysis for Two-Way Tables

Chapter 9 Hypothesis Testing

Chi Square Two-way Tables

The t distribution and the independent sample t-test

Hypothesis testing. Chi-square test

Chapter 11: Inference for Distributions of Categorical Data

Basic analysis Process the data validation editing coding data entry

Statistical Process Control

Contingency tables and goodness of fit

Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007

Summary of Tests Confidence Limits

Chapter 9 – Modeling Breaking Strength with Dichotomous Data

Univariate Linkage in Mx

Chapter 10 Introduction to the Analysis of Variance

Power Calculation for QTL Association

Chapter 14.1 Goodness of Fit Test.

Presentation transcript:

Multivariate Linkage Continued Sarah Medland Queensland Institute of Medical Research

Running a loop Alternate method for running all markers is to run a ‘repeat’ script multi_repeat.mx #repeat n – where n is number of times the loop will run Use ‘exit’ at the end of your last group End the script with #end repeat Can refer to i using $repeat_number – where i is the number of the current repeat

Results change in chi-square

Probability Calculating the p-value can be problematic for MV linkage Univariate Linkage (1 QTL estimate)

Univariate Linkage: (1 QTL estimate) Under standard conditions, twice the difference in natural log-likelihood between models is distributed asymptotically as a χ2 distribution with degrees of freedom equal to the difference in the number of parameters between the models BUT In linkage analysis, the likelihood ratio test is conducted under non-standard conditions That is, the true value of some of the parameters under the null hypothesis (i.e. σq2 = 0) are located on the boundary of the parameter space defined by the alternative hypothesis. Under these conditions, the likelihood ratio statistic is distributed as a mixture of χ2 distributions, with the mixing proportions determined by the geometry of the parameter space. For example, in the case of a univariate VC linkage analysis, the test is asymptotically distributed as a 50:50 mixture of χ12 and a point mass at zero (Self & Liang, 1987).

In practical terms… Univariate linkage LOD score =Δχ²/4.6 50:50 mixture 0, Calculate p-value for and divide by 2 LOD score =Δχ²/4.6

In practical terms… Bivariate linkage 25:50:25 mixture 0, , Calculate:

3+ The mixture starts becoming very complex Begins to approach - where q is the number of QTL parameters estimated (Marlow et al., 2003) Simulation is the best approach Alt. can use but this will be a conservative test

Graphical representation -LOG10p Back convert the p-value to a chi-square on 1 df and compute the LOD score as Δχ²/4.6 Graph the p values

Viewpoint Graph the linkage results using viewpoint Harry Beeby Graph the linkage results using viewpoint Open by double clicking Go to file and open the file uni-graph.txt Chose a univariate plot Go to Edit - select plotted columns and select -log(10)p & backconvert Go to Edit - line attributes change colours

Viewpoint Nested model in which proportion of QTL variance was equated at each time point Result called ‘LOD1_parameter’ add it back into the graph Does the equated model ‘perform’ better than the full model? Why?/Why not?

Check the path coefficents Explore the linkage results using veiwpoint Go to file and open the file multi-graph.txt Chose a multivariate plot Explore the path coefficents Compare to the multivariate graph of demo-prints.txt More information about viewpoint in viewpoint.ppt

Viewdist Find the families that contribute the most & the least to the LOD score using viewdist Input data will be %p files from the null and linkage models – marker 58 Open by double clicking Go to file and open difference file The first file to read in is null.p The second is marker58.p

Viewdist Chose an ‘internal plot’ Define column mappings Graph column 2 Do not change other defaults Find the 3 ‘highest’ families and look to see if they are outliers at marker58 Open file maker58.p Chose a ‘normal plot’ Graph column 4

Viewdist Conclusion are they outliers? If so what would this mean If so may want to rerun the linkage in this region excluding these families More information about viewdist in viewdist.ppt