Host Genomics in WIHS  The WIHS GWAS data set  Concept Sheet  Data use agreement  Data transfer  Analytic support.

Slides:



Advertisements
Similar presentations
Global Drug Development To Bridge or Not to Bridge.
Advertisements

Lecture 2 Strachan and Read Chapter 13
Copyright © 2008, SAS Institute Inc. All rights reserved. Discovering Meaningful Patterns in Genomics Data with JMP Genomics Jordan Hiller JMP Genomics.
Analysis of imputed rare variants
CZ5225 Methods in Computational Biology Lecture 9: Pharmacogenetics and individual variation of drug response CZ5225 Methods in Computational Biology.
Why I chose: First reading results seemed counterintuitive Introduction full of references I didn’t know Useful? Or Gee Whizz so what?...Needed to read.
Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant.
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
CLOUD Surname DNA Project. Genetic Genealogy A Report on The CLOUD DNA Project. 1.Our Data Examined 2.Intro to Genetic Genealogy & DNA Genealogical.
Genetic Basis of Agronomic Traits Connecting Phenotype to Genotype Yu and Buckler (2006); Zhu et al. (2008) Traditional F2 QTL MappingAssociation Mapping.
Objectives Cover some of the essential concepts for GWAS that have not yet been covered Hardy-Weinberg equilibrium Meta-analysis SNP Imputation Review.
Standardization within the consortium Cancer consortia Paolo Boffetta IARC.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
The Extraction of Single Nucleotide Polymorphisms and the Use of Current Sequencing Tools Stephen Tetreault Department of Mathematics and Computer Science.
FINAL EXAM: TAKE-HOME Assessment of Significance in Cancer Gene SNPs.
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
SNPs Human Genome. SNP Typing Allele specific hybridization ASO probes usually with the polymorphic base in a central position in the probe sequence.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA.
Design Considerations in Large- Scale Genetic Association Studies Michael Boehnke, Andrew Skol, Laura Scott, Cristen Willer, Gonçalo Abecasis, Anne Jackson,
Georgia Wiesner, MD CREC June 20, GATACAATGCATCATATG TATCAGATGCAATATATC ATTGTATCATGTATCATG TATCATGTATCATGTATC ATGTATCATGTCTCCAGA TGCTATGGATCTTATGTA.
Molecular and Genetic Epidemiology Kathryn Penney, ScD January 5, 2012.
Single Nucleotide Polymorphism
Utilizing Science & Technology and Innovation for Development A Genome Wide Association Study for Type 2 Diabetes Susceptibility Gene and Treatment in.
A Primer on Genetic Variation Variety Lawrence Brody - NHGRI.
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Oat Molecular Toolbox: Toward Better Oats Nick Tinker, 2014-March-5 Agriculture and Agri-Food Canada.
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
CS177 Lecture 10 SNPs and Human Genetic Variation
A Genome-wide association study of Copy number variation in schizophrenia Andrés Ingason CNS Division, deCODE Genetics. Research Institute of Biological.
Gene Hunting: Linkage and Association
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
Genome-Wide Association Study (GWAS)
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Polysubstance Abuse–Vulnerability Genes: Genome Scans for Association, Using 1,004 Subjects and 1,494 Single-Nucleotide Polymorphisms Uhl, Liu, Walther,
E XOME SEQUENCING AND COMPLEX DISEASE : practical aspects of rare variant association studies Alice Bouchoms Amaury Vanvinckenroye Maxime Legrand 1.
Lab 13: Association Genetics December 5, Goals Use Mixed Models and General Linear Models to determine genetic associations. Understand the effect.
1 Ethical issues in genomics research Bernard Lo, M.D. March 3, 2009.
SHARe. What is WHI-SHARe? SHARe – stands for SNP Health Association Resource. It is an NHLBI program for genome-wide association studies (GWAS). GWAS.
Gene Mapping ROBERT SANTOS ENGLISH 100 ESP NOVEMBER
Statistical Issues in Genetic Association Studies

George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Select Sires’
FaceBase Kick-Off Meeting Nov 15-16, 2009 Bethesda Oral Clefts: Moving from Genome Wide Studies Toward Functional Genomics TH Beaty for Alan F Scott, Ingo.
The International Consortium. The International HapMap Project.
Use of Data from the Electronic Medical Record in Research Opportunities & Pit Falls Kristin West.
Copyright © 2010 Pearson Education, Inc. publishing as Benjamin Cummings Lectures by Greg Podgorski, Utah State University Current Issues in Biology, Volume.
Lectures 7 – Oct 19, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall.
Supplemental Figure 1. False trans association due to probe cross-hybridization and genetic polymorphism at single base extension site. (A) The Infinium.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
Whole genome selection and the 2000 bull project at USMARC Larry Kuehn Research Geneticist.
Analysis of Next Generation Sequence Data BIOST /06/2015.
Statistical Analysis of Candidate Gene Association Studies (Categorical Traits) of Biallelic Single Nucleotide Polymorphisms Maani Beigy MD-MPH Student.
Crafting a Quality Grant Proposal March, 2016 ACCELERATED COLLEGE CREDIT GRANT.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Genome-Wides Association Studies (GWAS) Veryan Codd.
Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.
Armenian Genome Project
071126_EAS56_0057_FC – lanes 1-8 read 2 b a _EAS56_0057_FC – lanes 1-8 read 1 Table S1. Summary tables for a read 1 and b read 2 of a.
Pharmacogenomic Profiles from 1,092 Human Genomes
Zhengzheng Tang and Danyu Lin March 26, 2013
ESP6800 BP Analysis (recessive model)
Power to detect QTL Association
Sequencing the IL4 locus in African Americans implicates rare noncoding variants in asthma susceptibility  Gabe Haller, BA, Dara G. Torgerson, PhD, Carole.
Presentation transcript:

Host Genomics in WIHS  The WIHS GWAS data set  Concept Sheet  Data use agreement  Data transfer  Analytic support

Host Genomics in WIHS  The WIHS GWAS data set  Concept Sheet  Data use agreement  Data transfer  Analytic support

The GWAS Data set  3700 / 3740 WIHS participants submitted for GWAS  Approximately 5 millions single nucleotide polymorphisms (SNPs)  2.5 million “common” SNPs (>5% MAF)  2.5 million “rare” SNPs (<5% MAF)  Imputation (additional 8 million SNPs)

The GWAS Data set Quality control analyses revealed excellent quality.  Failed samples (i.e., low call rate, insufficient DNA)  2.8% (95 samples); 57 of 95 passed repeat analysis  DNA sample call rate (passed SNP/total SNP):  100% with call rates exceeding 97.5%.  SNP call rate (proportion of samples with valid genotypes)  2,420,602 of 2,443,179 assays (99.1%) had Gentrain scores ≥ 0.8.  2,253,850 exceeded a call rate of 99%  2,391,865 exceeded a call rate of 97.5%  2,419,923 exceeded a call rate of 95.0%  Only 678 assays (0.028%) displayed call rates less than 95%.  Duplicate genotype concordance (2,443,179 SNP assays)  Among 62 pair duplicate samples exceeded 97.6%.  Batch and array level resampling  No evidence of batch effects was found

Host Genomics in WIHS  The WIHS GWAS data set  Concept Sheet  Data use agreement  Data transfer  Analytic support

Concept Sheet: host genomics Considerations for host genomics Table of genes required for candidate gene study If using GWAS dataset, sections 5 & 6 not required Section 5: laboratory methods Section 6: QA/QC If proposing new genotyping, you must substantiate why the GWAS data is not sufficient Examples non-SNP poorly captured by available SNPs Region containing the SNP poorly covered by GWAS Pre-submission review offered

Host Genomics in WIHS  The WIHS GWAS data set  Concept Sheet  Data use agreement  Data transfer  Analytic support

Data Use Agreement  Agreement between investigator, WIHS contact, and WIHS to  Pursue maximum reasonable security measures  Agree to destroy the genetic data files upon successful completion of the study (i.e., publication)  Notify WDMAC in the event of a breach of security/loss of confidentiality

Host Genomics in WIHS  The WIHS GWAS data set  Concept Sheet  Data use agreement  Data transfer  Analytic support

Foundation for secure transfer Request verified by examining Concept Sheet

Encrypting data

Decrypting data

WIHS Assay Validation Report

Host Genomics in WIHS  The WIHS GWAS data set  Concept Sheet  Data use agreement  Data transfer  Analytic support  Special note on racial and ethnic heterogeneity

Analytic Support  Pre-submission Concept Sheet review  Evaluation of and assistance with study design and the data analysis plan.  Potential involvement as a co- Investigator to provide  Analytic support  Assistance with dissemination

Approaches to Race/ethnicity  Self-report only  Genomic estimates of self-reported race and ethnicity  Both  So, how do we estimate genetic ancestry?

Estimating race/ethnicity  Select “ancestry informative markers” from across the genome  Estimate latent subgroups using ancestry informative markers  Note that this is a somewhat circular process and is not perfect  Principle component analysis  Use these estimates jointly as covariates  These PCs (n=10) are provided with all genomic data requests

By Racial and Ethnic Group, then by Caucasian Component gradient

By Racial and Ethnic Group, then by Site, then by Caucasian Component gradient

Principle components: PC1 vs PC2

Principle components: PC1 vs PC2, by site