Haplotypes and imputed genotypes in diverse human populations Noah Rosenberg April 29, 2009.

Slides:



Advertisements
Similar presentations
Which Phenotypes Can be Predicted from a Genome Wide Scan of Single Nucleotide Polymorphisms (SNPs): Ethnicity vs. Breast Cancer Mohsen Hajiloo, Russell.
Advertisements

Lab 3 : Exact tests and Measuring Genetic Variation.
Presented by Qing Duan Dr. Yun Li group UNC at Chapel Hill
Sharlee Climer, Alan R. Templeton, and Weixiong Zhang
Perspectives from Human Studies and Low Density Chip Jeffrey R. O’Connell University of Maryland School of Medicine October 28, 2008.
INTRODUCTION Genome-wide association studies are now feasible. Measuring allele frequencies of pools of cases and controls, instead of between individuals,
Objectives Cover some of the essential concepts for GWAS that have not yet been covered Hardy-Weinberg equilibrium Meta-analysis SNP Imputation Review.
High resolution detection of IBD Sharon R Browning and Brian L Browning Supported by the Marsden Fund.
Outline to SNP bioinformatics lecture
The role of variation in finding functional genetic elements Andy Clark – Cornell Dave Begun – UC Davis.
A coalescent computational platform for tagging marker selection for clinical studies Gabor T. Marth Department of Biology, Boston College
Poster Title 95 point, Arial font Primary heading Xx xxx x xx xxxxx xx xxxx xxx xx xx x x xxxxx xxxx xx xxxxx xx xxxxxx xxx xx. X xx xxxx xxxxx xx xxxx.
Population structure identification BNFO 602 Roshan.
Human Migrations Saeed Hassanpour Spring Introduction Population Genetics Co-evolution of genes with language and cultural. Human evolution: genetics,
CSE 291: Advanced Topics in Computational Biology Vineet Bafna/Pavel Pevzner
CSE182-L17 Clustering Population Genetics: Basics.
SNPs DNA differs between humans by 0.1%, (1 in 1300 bases) This means that you can map DNA variation to around 10,000,000 sites in the genome Almost all.
Presenter Name Presentation/event title xxxxx
SNP Selection University of Louisville Center for Genetics and Molecular Medicine January 10, 2008 Dana Crawford, PhD Vanderbilt University Center for.
MULTIPLES OF 2 By Preston, Lincoln and Blake. 2 X 1 =2 XX 2X1=2 1+1=2.
Differential relatedness of African Americans to populations within West Africa Katarzyna Bryc 1**, Amy Williams 1**, Nick Patterson 2, Solomon Musani.
Origins of Host Specific Populations of Puccinia triticina Revealed by SNP Markers (Preliminary) M. Liu and J. A. Kolmer USDA-ARS Cereal Disease Laboratory,
Design Considerations in Large- Scale Genetic Association Studies Michael Boehnke, Andrew Skol, Laura Scott, Cristen Willer, Gonçalo Abecasis, Anne Jackson,
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
A single-nucleotide polymorphism tagging set for human drug metabolism and transport Kourosh R Ahmadi, Mike E Weale, Zhengyu Y Xue, Nicole Soranzo, David.
EveMark Ed Enter question here. Click the sound icon to listen to each solution. Who has the best solution?
Divide by 8 page – groups of 8 Division Sentence 0 ÷ 8 = 0.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
A Genome-wide association study of Copy number variation in schizophrenia Andrés Ingason CNS Division, deCODE Genetics. Research Institute of Biological.
Gene Hunting: Linkage and Association
Informative SNP Selection Based on Multiple Linear Regression
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Detection of positive selection in humane genome.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
California Pacific Medical Center
The HapMap Project and Haploview
The International Consortium. The International HapMap Project.
Populations: defining and identifying. Two major paradigms for defining populations Ecological paradigm A group of individuals of the same species that.
Canadian Bioinformatics Workshops
International Workshop on Bioinformatics Research and Applications, May 2005 Phasing and Missing data recovery in Family Trios D. Brinza J. He W. Mao A.
L4: Counting Recombination events
Genomic Patterns of Homozygosity in Worldwide Human Populations
Haplotype Diversity in Immune Related Genes Between Different Geographical Regions and Populations Haplotype frequencies, by color, for each of the 52.
Gamma Theta Upsilon Recent GTU Initiates
Presenter Name Presentation/event title xxxxx
Instructions and Examples Version 1.5
Title—48 pt. Arial Bold (use two lines of text if necessary)
Volume 26, Issue 7, Pages (April 2016)
Introgression of Neandertal- and Denisovan-like Haplotypes Contributes to Adaptive Variation in Human Toll-like Receptors  Michael Dannemann, Aida M.
Haplotype Estimation Using Sequencing Reads
Gene Discovery for Complex Traits: Lessons from Africa
Proportioning Whole-Genome Single-Nucleotide–Polymorphism Diversity for the Identification of Geographic Population Structure and Genetic Ancestry  Oscar.
A Predominantly Indigenous Paternal Heritage for the Austronesian-Speaking Peoples of Insular Southeast Asia and Oceania  Cristian Capelli, James F. Wilson,
Basic SNP characteristics using different filter settings for missing data. Basic SNP characteristics using different filter settings for missing data.
Volume 24, Issue 8, Pages (April 2014)
Title—48 pt. Arial Bold (use two lines of text if necessary)
Perspectives from Human Studies and Low Density Chip
Identifying Darwinian Selection Acting on Different Human APOL1 Variants among Diverse African Populations  Wen-Ya Ko, Prianka Rajan, Felicia Gomez, Laura.
Trevor J. Pemberton, Chaolong Wang, Jun Z. Li, Noah A. Rosenberg 
TITLE XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Long Runs of Homozygosity Are Enriched for Deleterious Variation
GLOBAL CHANNEL TRENDS, 2013 Shifting patterns in regional channel growth LOUISE HOWARTH May 2013 Senior Retail Analyst.
Evaluating the Effects of Imputation on the Power, Coverage, and Cost Efficiency of Genome-wide SNP Platforms  Carl A. Anderson, Fredrik H. Pettersson,
Genotype-Imputation Accuracy across Worldwide Human Populations
Introgression of Neandertal- and Denisovan-like Haplotypes Contributes to Adaptive Variation in Human Toll-like Receptors  Michael Dannemann, Aida M.
Presentation transcript:

Haplotypes and imputed genotypes in diverse human populations Noah Rosenberg April 29, 2009

Human Genome Diversity Cell Line Panel 525,910 single-nucleotide polymorphisms in 29 populations M Jakobsson et al. (2008) Nature 451:

How do we measure and compare haplotype diversity across populations? Imputation in diverse populations Overview

Which populations and genomic sites have more haplotype diversity? X0XX0X000X00X X0XXX00XX0X00000X XX00X0XX 000X0XX000000XXX000XX0000 0X00X00XX0X00000X0000X0XX 0X000X000X00X X0X000X00X X00XX00XX0X00000X X000XX000000XXX000XX X00XX0X00000X0000X0XX 0X00X00XX0X00000X0000X0XX 0X0X XX X XX XX000000XXX000XX0000 0X000XX000000XXX000XX0000 X0XX0X000X0XX X X00XX0X00000X X0X0X000X00X X0XX 0X00000XX0X00000X0000X0XX 0X0X0XX000000XXX000XXX0XX Population 1 Population 2

Which populations and genomic sites have more haplotype diversity? XXXXXX XXXXXXXX XXXXX XXXXXXXX XXXXXXXXX XXXX XXXX XXXXXXX XXXXXXXX XXXXXXXX XXXXXXXXX XXXX XXX XXXXXXX XXXXXXXX XXXXXXXX XXXXX XXXXXXXX XXXXXXXX XXXXXXXXXXXX Population 1 Population 2

Which populations and genomic sites have more haplotype diversity? XXXXXXXX XXXXXXXX XXXX XXXX XXXXXXX XXXXXXXX XXXXXXXXX XXXXXXXX XXXXX XXXXXXXX Population 1 XXXXXX XXXXX XXXXXXXXX XXXXXXXX XXXX XXX XXXXXXX XXXXXXXX XXXXXXXX XXXXXXXXXXXX Population 2 P Scheet, M Stephens (2006) AJHG 78:

Which populations and genomic sites have more haplotype diversity? XXXXXXXX XXXXXXXX XXXX XXXX XXXXXXX XXXXXXXX XXXXXXXXX XXXXXXXX XXXXX XXXXXXXX Population 1 Blue

Which populations and genomic sites have more haplotype diversity? 111X1X XXXXXXXX XXXXXXXX XXXX XXXX XXXXXXX XXXXXXXX XXXXXXXXX XXXXXXXX XXXXX XXXXXXXX Population 1 Blue Green

Which populations and genomic sites have more haplotype diversity? 111X1X XXXXXXXX XXXXXXXX XXXX XXXX XXXXXXX XXXXXXXX XXXXXXXXX XXXXXXXX XXXXX XXXXXXXX Population 1 Blue Green Orange

Which populations and genomic sites have more haplotype diversity? 111X1X XXXXXXXX XXXXXXXX XXXX XXXX XXXXXXX XXXXXXXX XXXXXXXXX XXXXXXXX XXXXX XXXXXXXX Population 1 Blue Green Orange Pink

Which populations and genomic sites have more haplotype diversity? 111X1X XXXXXXXX XXXXXXXX XXXX XXXX XXXXXXX XXXXXXXX XXXXXXXXX XXXXXXXX XXXXX XXXXXXXX Population 1 Blue Green Orange Pink Yellow

Which populations and genomic sites have more haplotype diversity?

XXXXXXXX XXXXXXXX XXXX XXXX XXXXXXX XXXXXXXX XXXXXXXXX XXXXXXXX XXXXX XXXXXXXX Population 1 Less diversity XXXXXX XXXXX XXXXXXXXX XXXXXXXX XXXX XXX XXXXXXX XXXXXXXX XXXXXXXX XXXXXXXXXXXX Population 2 More diversity

Haplotype cluster frequencies for a “typical” genomic region M Jakobsson et al. (2008) Nature 451:

More haplotype diversity in Africa Africa Europe Middle East Asia Oceania America C Asia M Jakobsson et al. (2008) Nature 451:

Less haplotype homozygosity and more haplotype diversity in Africa M Jakobsson et al. (2008) Nature 451:

Genetic diversity declines with distance from Africa Haplotype heterozygosity

Haplotype clusters recover population structure Africa Middle East Europe Central/South Asia Oceania America East Asia M Jakobsson et al. (2008) Nature 451:

Haplotype clusters recover population structure M Jakobsson et al. (2008) Nature 451:

Low haplotype diversity in the lactase region in Europe Africa Europe Middle East Asia Oceania America C Asia M Jakobsson et al. (2008) Nature 451:

Haplotype cluster homozygosity as a test for selection Random region Lactase region M Jakobsson et al. (2008) Nature 451:

Haplotype clusters can be used to encode haplotypes pointwise for measurement of diversity Haplotype cluster diversity is greatest in Africa Low haplotype cluster diversity can potentially be used to detect selection Haplotype diversity – summary

Measuring haplotype diversity using haplotype clusters Imputation in diverse populations Overview

Study sample Genotyped positions Reference panel Imputed genotypes can be tested for disease association Genotypes can be imputed using a reference panel – but imperfectly

443 individuals in 29 populations from the Human Genome Diversity Panel Genotypes at >500,000 SNPs (Jakobsson et al. Nature 451: , 2008) 420 HapMap reference haplotypes of ~2,000,000 SNPs, omitting offspring in trios Randomly hide 15% genotypes in HGDP individuals and impute with MACH Measure the proportion of alleles imputed correctly Evaluating imputation accuracy in worldwide populations

Imputation accuracy is predicted by haplotype diversity Imputation accuracy L Huang et al. (2008) AJHG 84:

Imputation accuracy is greatest with a close reference panel L Huang et al. (2008) AJHG 84:

Highest-accuracy reference panels match geographic locations Africa Europe/ W Asia E Asia/ Oceania/ Americas L Huang et al. (2008) AJHG 84:

Instead of imputing based on separate HapMap panels, impute from mixtures Choose mixtures to have optimal size given specified ratios Imputation accuracy can be increased using HapMap mixtures L Huang et al. (2008) AJHG 84:

Imputation accuracy can be increased using HapMap mixtures L Huang et al. (2008) AJHG 84:

Strategies to improve imputation studies -Increased sample size -Improved imputation algorithms -Improved use of reference panels -Development of additional reference panels -Improved haplotyping -Use of additional data from relatives Summary – imputation accuracy

Imputation error and sample size inflation are greatest in Africa Imputation – summary Several strategies may be available for improving imputation, including use of mixtures

Rosenberg lab James Degnan Mike DeGiorgio Lucy Huang Mattias Jakobsson Trevor Pemberton Paul Scheet Zach Szpiech Jenna VanLiere Chaolong Wang Collaborators Goncalo Abecasis (Michigan) Raph Gibbs (NIA) John Hardy (UCL) Yun Li (Michigan) Sonja Scholz (NIA) Andy Singleton (NIA) Funding Alfred P. Sloan Foundation Burroughs Wellcome Fund National Institutes of Health U of M Rackham Graduate School [M DeGiorgio] U of M Center for Genetics in Health and Medicine [M Jakobsson]