Copy folder (and subfolders) F:\sarah\linkage2. Linkage in Mx Sarah Medland.

Slides:



Advertisements
Similar presentations
Bivariate analysis HGEN619 class 2007.
Advertisements

Gene by environment effects. Elevated Plus Maze (anxiety)
Multivariate Mx Exercise D Posthuma Files: \\danielle\Multivariate.
:NEUROPSYCHIATRIC GENETICS [BIOSTATISTICS|BIOINFORMATICS] CORE BIOSTATISTIC/BIOINFORMATIC TOOLS FOR GENETICS DATA: DATA MANAGEMENT AND ANALYSIS RICHARD.
(Re)introduction to Mx Sarah Medland. KiwiChinese Gooseberry.
Extended sibships Danielle Posthuma Kate Morley Files: \\danielle\ExtSibs.
(Re)introduction to Mx. Starting at the beginning Data preparation Mx expects 1 line per case/family Almost limitless number of families and variables.
Introduction to Linkage
Viewpoint Plotting Program for Linkage Results Harry Beeby & Sarah Medland Queensland Institute of Medical Research.
Univariate Analysis Hermine Maes TC19 March 2006.
Mx Practical TC18, 2005 Dorret Boomsma, Nick Martin, Hermine H. Maes.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Viewdist Plotting Program for quick visualisation of distributions Harry Beeby & Sarah Medland Queensland Institute of Medical Research.
Slide 1 Analyzing Patterns of Missing Data While SPSS contains a rich set of procedures for analyzing patterns of missing data, they are not included in.
Power and Sample Size Adapted from: Boulder 2004 Benjamin Neale Shaun Purcell I HAVE THE POWER!!!
Linkage Analysis in Merlin
Statistical Power Calculations Boulder, 2007 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Karri Silventoinen University of Helsinki Osaka University.
Copy the folder… Faculty/Sarah/Tues_merlin to the C Drive C:/Tues_merlin.
Karri Silventoinen University of Helsinki Osaka University.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
F:\sarah\fri_MV. Multivariate Linkage and Association Sarah Medland and Manuel Ferreira -with heavy borrowing from Kate Morley and Frühling Rijsdijk.
Linkage in selected samples Manuel Ferreira QIMR Boulder Advanced Course 2005.
Whole genome approaches to quantitative genetics Leuven 2008.
Power and Sample Size Boulder 2004 Benjamin Neale Shaun Purcell.
Power of linkage analysis Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
The importance of the “Means Model” in Mx for modeling regression and association Dorret Boomsma, Nick Martin Boulder 2008.
Regression-Based Linkage Analysis of General Pedigrees Pak Sham, Shaun Purcell, Stacey Cherny, Gonçalo Abecasis.
Combined Linkage and Association in Mx Hermine Maes Kate Morley Dorret Boomsma Nick Martin Meike Bartels Boulder 2009.
Univariate Analysis Hermine Maes TC21 March 2008.
Linkage and association Sarah Medland. Genotypic similarity between relatives IBS Alleles shared Identical By State “look the same”, may have the same.
Errors in Genetic Data Gonçalo Abecasis. Errors in Genetic Data Pedigree Errors Genotyping Errors Phenotyping Errors.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
Meetings this summer June 3-6: Behavior Genetics Association (Amsterdam, The Netherlands, see: June 8-10: Int. Society Twin Studies (Ghent,
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
Family Based Association Danielle Posthuma Stacey Cherny TC18-Boulder 2005.
Means, Thresholds and Moderation Sarah Medland – Boulder 2008 Corrected Version Thanks to Hongyan Du for pointing out the error on the regression examples.
Powerful Regression-based Quantitative Trait Linkage Analysis of General Pedigrees Pak Sham, Shaun Purcell, Stacey Cherny, Gonçalo Abecasis.
Mx Practical TC20, 2007 Hermine H. Maes Nick Martin, Dorret Boomsma.
David M. Evans Multivariate QTL Linkage Analysis Queensland Institute of Medical Research Brisbane Australia Twin Workshop Boulder 2003.
Using Merlin in Rheumatoid Arthritis Analyses Wei V. Chen 05/05/2004.
Introduction to Genetic Theory
Welcome  Log on using the username and password you received at registration  Copy the folder: F:/sarah/mon-morning To your H drive.
Linkage in Mx & Merlin Meike Bartels Kate Morley Hermine Maes Based on Posthuma et al., Boulder & Egmond.
Efficient calculation of empirical p- values for genome wide linkage through weighted mixtures Sarah E Medland, Eric J Schmitt, Bradley T Webb, Po-Hsiu.
More on thresholds Sarah Medland. A plug for OpenMx? Very few packages can handle ordinal data adequately… OpenMx can also be used for more than just.
QTL Mapping Using Mx Michael C Neale Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
Genetic mapping and QTL analysis - JoinMap and QTLNetwork -
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Regression Models for Linkage: Merlin Regress
Ordinal Data Sarah Medland.
Introduction to Multivariate Genetic Analysis
Linkage and Association in Mx
Re-introduction to openMx
Genome Wide Association Studies using SNP
Ordinal data, matrix algebra & factor analysis
Can resemblance (e.g. correlations) between sib pairs, or DZ twins, be modeled as a function of DNA marker sharing at a particular chromosomal location?
Univariate modeling Sarah Medland.
Regression-based linkage analysis
Remark Exam-Grading Instructions
Linkage in Selected Samples
(Re)introduction to Mx Sarah Medland
Sarah Medland faculty/sarah/2018/Tuesday
More on thresholds Sarah Medland.
Multivariate Linkage Continued
Lecture 9: QTL Mapping II: Outbred Populations
Univariate Linkage in Mx
Power Calculation Practical
Multivariate Genetic Analysis: Introduction
Presentation transcript:

Copy folder (and subfolders) F:\sarah\linkage2

Linkage in Mx Sarah Medland

In this session: Preparing your data for variance components linkage in Mx Showing the progression to linkage via path models Running linkage in Mx  Introducing the loop function  Sibship size 5  Efficient algebra

From Merlin to Mx Merlin gives us… Pihats Phenotypes Mx wants something like…

From Merlin to Mx Different ways to go about this  Shell or Perl scripts in Unix/Linux   Alsort (for pairwise data)   SPSS, SAS, R etc 

Alsort.exe Takes an all-possible pairs approach rather than a full sib-ship approach  If a family has a sibship of 2 then 1 pair  If a family has a sibship of 3 then 3 pairs  If a family has a sibship of 4 then 6 pairs You can run alsort and then convert to a full sib-ship approach

Alsort.exe – to go through at home Usage: alsort [-vfpm] [-c] [-i] [-t] [-x...] -v Verbose (implies -vfpm) -vf Print family ID list -vp Print marker positions -vm Print missing p-values -c Create output file per chromosome -i Include 'self' values (id1=id2) -x Exclude list; id-values separated by spaces -t Write tab as separator character Thanks to Danielle Posthuma for this slide and this practical example

Practical Alsort.exe There is a copy of alsort in the folder: pract-alsort Open a dos prompt, go to the directory where alsort.exe is and type Alsort test.ibd sorted.txt –c –x –t Thanks to Danielle Posthuma for this slide and this practical example

Once you have your data… Estimates of QTL effects can easily be incorporated into ACE/ADE models ‘Simple’ extension of the path model and script

ACE path model A+C+E | A+C+E ; Where z =.5/1

ACQE path model A+C+Q+E | A+C+Q+E ; Where z =.5/1 & p =

Test for linkage Drop Q from the model Note  although you will have to run your linkage analysis model many times the fit of the sub-model (or base-model) will always remain the same  So… run it once and use the command option sub=,

Using MZ twins in linkage MZ pairs that are incorrectly modelled lead to spurious results

Using MZ twins in linkage An MZ pair will not contribute to your linkage signal  BUT correctly including MZ twins in your model allows you to partition A and C or A and E  AND if the MZ pair have a (non-MZ) sibling the ‘MZ-trio’ contributes more information than a regular (DZ) sibling pair – but less than a ‘DZ-trio’

Running linkage Multiple jobs  Run ‘at the Marker’  Run ‘at a Grid’ Approx 3500 cM Every 1/2/5/ cM? So… we want the analyses to be efficent

Running a loop Pre-prepare your data files  One per-chromosome or one per-marker Include a loop function in your mx script to select the data you want to analyse

Running a loop (page 52) At the top of the loop  #loop $ start stop increment #loop $file Within the loop  Rectangular file =grid$file`.rec Rectangular file =grid39.rec At the end of the loop  #endloop

Practical Example Absolute Finger Ridge Count Sum of total number of ridges between the triradii (deltas) and cores of all ten fingers

Practical Example Australian adolescent twins and sibs Sibships of up to 5 siblings Two 10cM genome scans interleaved = Approx 5cM scan

Covariance structure It is important to correctly structure the coefficients for the QTL variance component abcde a1 b1 c1 d1 e1

Covariance structure In the Mx script: cov_loop_linkage.mx

Covariance structure In the Mx script: cov_loop_linkage.mx

Your task… You have been given data for a section of a chromosome Edit the loop in the script cov_loop_linkage.mx Based on where you are sitting Run linkage for your assigned grids

Huh?!? Front of the room Grid 39 – 43 Which maps to cM Grid 44 – 48 Which maps to cM Grid 49 – 53 Which maps to cM Grid 54 – 59 Which maps to cM

Your task… Edit the loop in the script cov_loop_linkage.mx Based on where you are sitting Run linkage for your assigned grids Note the change in chi-square for each grid Come and add your results to the graph on the excel sheet up the front

One I prepared earlier…

LOD=(Univariate)Δ χ²/4.67

Significance What is the p-value at 250cM (Grid 51)?  Δ χ² = , LOD = 2.74  Mixture distribution (MORE on this next year) ½ point mass zero ½ χ² df=1 Called  From a practical point of view divide the p by 2  So…p=  Lander and Kruglyak LOD = 3.6, Chi-sq = 16.7, P =

Alternatively Estimate significance taking into account the informative-ness of your sample and your genotypic data  Simulate 1,000-10,000 null replicates  MORE on this next year

From a practical point of view Speed is important!!! Have a look at kron_loop_linkage.mx

Covariance structure In the Mx script: kron_loop_linkage.mx

From a practical point of view Speed is important!!! Have a look at kron_loop_linkage.mx Run it – how does it compare  In terms of speed?  Do you get the same results?

For more on these topics Come back next year or in the meantime read the recommended readings for next year