GenABEL: an R package for Genome Wide Association Analysis

Slides:



Advertisements
Similar presentations
Statistical methods for genetic association studies
Advertisements

Population structure.
PLINK: a toolset for whole genome association analysis
BST 775 Lecture PLINK – A Popular Toolset for GWAS
Association Tests for Rare Variants Using Sequence Data
GBS & GWAS using the iPlant Discovery Environment
HW1b: Background information Oct 8 th, 2013 Kyrylo Bessonov 1.
Presented by Qing Duan Dr. Yun Li group UNC at Chapel Hill
Ingredients for a successful genome-wide association studies: A statistical view Scott Weiss and Christoph Lange Channing Laboratory Pulmonary and Critical.
Statistical association of genotype and phenotype.
Association Modeling With iPlant
More Powerful Genome-wide Association Methods for Case-control Data Robert C. Elston, PhD Case Western Reserve University Cleveland Ohio.
:NEUROPSYCHIATRIC GENETICS [BIOSTATISTICS|BIOINFORMATICS] CORE BIOSTATISTIC/BIOINFORMATIC TOOLS FOR GENETICS DATA: DATA MANAGEMENT AND ANALYSIS RICHARD.
Aspects of Genetics and Genomics in Cancer Research Li Hsu Biostatistics and Biomathematics Program Fred Hutchinson Cancer Research Center.
Genome-wide association studies Usman Roshan. SNP Single nucleotide polymorphism Specific position and specific chromosome.
IGES 2003 How many markers are necessary to infer correct familial relationships in follow-up studies? Silvano Presciuttini 1,3, Chiara Toni 2, Fabio Marroni.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
Integrating domain knowledge with statistical and data mining methods for high-density genomic SNP disease association analysis Dinu et al, J. Biomedical.
Using biological networks to search for interacting loci in genome-wide association studies Mathieu Emily et. al. European journal of human genetics, e-pub.
Data Description Tables and Graphs Data Reduction.
Genome-Wide Association Studies
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Design Considerations in Large- Scale Genetic Association Studies Michael Boehnke, Andrew Skol, Laura Scott, Cristen Willer, Gonçalo Abecasis, Anne Jackson,
Analysis of genome-wide association studies
Practical aspects of GWAS
Polymorphism and Variant Analysis Lab
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
ConceptS and Connections
Genetic Variation Influences Glutamate Concentrations in Brains of Patients with Multiple Sclerosis Robby Bonanno.
Population Stratification
Polymorphism & Variant Analysis Lab Saurabh Sinha Polymorphism and Variant Analysis Lab v1 | Saurabh Sinha 1 Powerpoint by Casey Hanson.
Linear Modelling III Richard Mott Wellcome Trust Centre for Human Genetics.
Figure S1. Quantile-quantile plot in –log10 scale for the individual studies The red line represents concordance of observed and expected values. The shaded.
1 Association Analysis of Rare Genetic Variants Qunyuan Zhang Division of Statistical Genomics Course M Computational Statistical Genetics.
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Quantitative Genetics
Overview of developments. Nested Association Mapping (NAM) Jianming Yu, James B. Holland, Michael D. McMullen and Edward S. Buckler, Genetics, Vol. 178,
Jianfeng Xu, M.D., Dr.PH Professor of Public Health and Cancer Biology Director, Program for Genetic and Molecular Epidemiology of Cancer Associate Director,
Future Directions Pak Sham, HKU Boulder Genetics of Complex Traits Quantitative GeneticsGene Mapping Functional Genomics.
Host disease genetics: bovine tuberculosis resistance in
QTL Cartographer A Program Package for finding Quantitative Trait Loci C. J. Basten Z.-B. Zeng and B. S. Weir.
Statistical Issues in Genetic Association Studies
Errors in Genetic Data Gonçalo Abecasis. Errors in Genetic Data Pedigree Errors Genotyping Errors Phenotyping Errors.
PLINK / Haploview Whole genome association software tutorial
C2BAT: Using the same data set for screening and testing. A testing strategy for genome-wide association studies in case/control design Matt McQueen, Jessica.
Resources at HapMap.Org HapMap3 Tutorial Marcela K. Tello-Ruiz Cold Spring Harbor Laboratory.
Why you should know about experimental crosses. To save you from embarrassment.
Linkage. Announcements Problem set 1 is available for download. Due April 14. class videos are available from a link on the schedule web page, and at.
Younghun Han Department of Epidemiology UT MD Anderson Cancer Center
Linkage. Announcements Problem set 1 is available for download. Due April 14. class videos are available from a link on the schedule web page, and at.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL FVGWAS: Fast Voxelwise Genome Wide Association Analysis of Large-scale Imaging Genetic Data Tutorial: pipeline,
Statistical Analysis of Candidate Gene Association Studies (Categorical Traits) of Biallelic Single Nucleotide Polymorphisms Maani Beigy MD-MPH Student.
Genetic mapping and QTL analysis - JoinMap and QTLNetwork -
Population stratification
Quantitative genetics
Genome Wide Association Studies Zhiwu Zhang Washington State University.
Development of an interactive pipeline for Genome wide association analysis Falola Damilare & Adigun Taiwo – Covenant University Bioinformatics research.
upstream vs. ORF binding and gene expression?
Multiple testing etc. Benjamin Neale Leuven 2008.
Bioinformatics challenges for “genome-wide association” studies
Genome Wide Association Studies using SNP
R Packages for Genome-Wide Association Studies
Population stratification
Introduction to Data Formats and tools
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
GENOME WIDE ASSOCIATION STUDIES (GWAS)
Presentation transcript:

GenABEL: an R package for Genome Wide Association Analysis Younghun Han Department of Epidemiology UT MD Anderson Cancer Center

Outline Introduction GeneABEL: GWAA.data class Importing data to GenABEL Genetic data QC GWA association analysis

Introduction GenABEL is an R library developed to facilitate Genome-Wide Association analysis of binary and quantitative traits. Features of GenABEL : specific facilities for storage and manipulation of large data QC Maximum Likelihood estimation of linear, logistic and Cox regression on Genome-wide scale Specific functions to analyze and display the results

GeneABEL: GWAA.data class

GWAA.data class >library(GenABEL) >load(“lung2291.Rdata”)

GWAA.data class

Importing data to GenABEL Need a phenotypic and genotypic data Example of a phenotype file : Example of a genotypic data (PLINK tped files) TPED- file TFAM -file

Importing data to GenABEL Convert the data to GenABEL raw format : > convert.snp.affymetrix() > convert.snp.illumina() > convert.snp.mach() > convert.snp.ped() > convert.snp.text() > convert.snp.tped() Load the data into GenABEL >load.gwaa.data() Example : > convert.snp.tped(tped="lung2291.tped", tfam="lung2291.tfam",out="lung2291.raw", strand=“u") > lung2291<- load.gwaa.data(phe="pheno.txt", gen="lung2291.raw",force = T)

Genetic data QC summary.snp.data() : Number of observed genotypes, allelic frequency, genotypic distrbution, P-value of the exact test for HWE check.trait() : summary of phenotypic data check for outliers (using False Discovery Rate framework) and plots the raw data check.marker() : The major genetic data QC function of GenABEL HWE.show() : showing HWE tables, Chi2 and exact HWE P-values perid.summary() : call rate and heterozygosity per person ibs() : matrix of average IBS for a group of people hom() : average homozygosity (inbreeding) for a set of people, across multiple markers EXAMPLE of QC

GWA association analysis Descriptives of the phenotypic and marker data : descriptives.trait(lung2291) descriptives.trait(lung2291, by=case_control) descriptives.marker(lung2291) descriptives.marker(lung2291,ids=(case_control==0)) Score test an0 <- qtscore(case_control, data=lung2291, trait = "binomial") an1 <- qtscore(case_control~sex, data=lung2291, trait = "binomial") Chi-squre test for binary trait an2 <- ccfast("case_control", data=lung2291) SNP association test using glm in R library Scan.glm(“case_control~sex+CRSNP”, family=binomial) Scan.glm(“case_control~sex*CRSNP”, family=binomial) # no G*E test Scan.glm.2D(“case_control~sex+CRSNP”, family=binomial) # 2-snp interaction scan Note : formula must contain CRSNP variable to be replaced with the analysis SNPs

GWA association analysis GWAS results from qtscore, ccfast, scan.glm P1df : P-values of 1-d.f. (additive or allelic) test P2df : P-values of 2-d.f. (genotypic) test for association Pc1df : P-values of 1-d.f. test: the statistics is corrected for possible inflation effB: Effect of the B allele(second allele from coding) in allelic test (OR for ccfast) effAB : Effect of the AB effBB : Effect of the BB map : list of map positions of the SNPs chromosome : list of chromosomes the SNPs belong to idnames : list of people used in analysis lambda : inflation factor estimate formula : formula/function call was used to compute P-values family : family of the link function

Example of descriptives.trait() and descriptives.marker()

Example of qtscore() and ccfast()

Example of scan.glm() and scan.glm.2D()

Thank you!!!